[go: up one dir, main page]

Dataset Preview
Full Screen
The full dataset viewer is not available (click to read why). Only showing a preview of the rows.
The dataset generation failed because of a cast error
Error code:   DatasetGenerationCastError
Exception:    DatasetGenerationCastError
Message:      An error occurred while generating the dataset

All the data files must have the same columns, but at some point there are 6 new columns ({'options', 'normalized_correct_opt', 'correct_answer', 'question_info', 'price', 'normalized_options'}) and 11 missing columns ({'extra_info', 'correct_attempts', 'ep_num', 'category', 'wrong_attempts', 'round_name', 'value', 'coord', 'answer', 'daily_double', 'air_date'}).

This happened while the csv dataset builder was generating data using

hf://datasets/RedBlock/parrot/millionaire.csv (at revision 946987f59cfda8a40f58ae159e29ceb6e15ad5d9)

Please either edit the data files to have matching columns, or separate them into different configurations (see docs at https://hf.co/docs/hub/datasets-manual-configuration#multiple-configurations)
Traceback:    Traceback (most recent call last):
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/builder.py", line 2013, in _prepare_split_single
                  writer.write_table(table)
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/arrow_writer.py", line 585, in write_table
                  pa_table = table_cast(pa_table, self._schema)
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/table.py", line 2302, in table_cast
                  return cast_table_to_schema(table, schema)
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/table.py", line 2256, in cast_table_to_schema
                  raise CastError(
              datasets.table.CastError: Couldn't cast
              question_info: string
              question: string
              options: string
              correct_answer: string
              price: int64
              normalized_options: string
              normalized_correct_opt: string
              -- schema metadata --
              pandas: '{"index_columns": [{"kind": "range", "name": null, "start": 0, "' + 1134
              to
              {'ep_num': Value(dtype='string', id=None), 'air_date': Value(dtype='string', id=None), 'extra_info': Value(dtype='string', id=None), 'round_name': Value(dtype='string', id=None), 'coord': Value(dtype='string', id=None), 'category': Value(dtype='string', id=None), 'value': Value(dtype='string', id=None), 'daily_double': Value(dtype='string', id=None), 'question': Value(dtype='string', id=None), 'answer': Value(dtype='string', id=None), 'correct_attempts': Value(dtype='float64', id=None), 'wrong_attempts': Value(dtype='float64', id=None)}
              because column names don't match
              
              During handling of the above exception, another exception occurred:
              
              Traceback (most recent call last):
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 1396, in compute_config_parquet_and_info_response
                  parquet_operations = convert_to_parquet(builder)
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 1045, in convert_to_parquet
                  builder.download_and_prepare(
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/builder.py", line 1029, in download_and_prepare
                  self._download_and_prepare(
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/builder.py", line 1124, in _download_and_prepare
                  self._prepare_split(split_generator, **prepare_split_kwargs)
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/builder.py", line 1884, in _prepare_split
                  for job_id, done, content in self._prepare_split_single(
                File "/src/services/worker/.venv/lib/python3.9/site-packages/datasets/builder.py", line 2015, in _prepare_split_single
                  raise DatasetGenerationCastError.from_cast_error(
              datasets.exceptions.DatasetGenerationCastError: An error occurred while generating the dataset
              
              All the data files must have the same columns, but at some point there are 6 new columns ({'options', 'normalized_correct_opt', 'correct_answer', 'question_info', 'price', 'normalized_options'}) and 11 missing columns ({'extra_info', 'correct_attempts', 'ep_num', 'category', 'wrong_attempts', 'round_name', 'value', 'coord', 'answer', 'daily_double', 'air_date'}).
              
              This happened while the csv dataset builder was generating data using
              
              hf://datasets/RedBlock/parrot/millionaire.csv (at revision 946987f59cfda8a40f58ae159e29ceb6e15ad5d9)
              
              Please either edit the data files to have matching columns, or separate them into different configurations (see docs at https://hf.co/docs/hub/datasets-manual-configuration#multiple-configurations)

Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.

ep_num
string
air_date
string
extra_info
string
round_name
string
coord
string
category
string
value
string
daily_double
string
question
string
answer
string
correct_attempts
float64
wrong_attempts
float64
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(1, 1)
LAKES & RIVERS
(100,)
False
River mentioned most often in the Bible
the Jordan
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(2, 1)
INVENTIONS
(100,)
False
Marconi's wonderful wireless
the radio
0
3
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(3, 1)
ANIMALS
(100,)
False
These rodents first got to America by stowing away on ships
rats
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(4, 1)
FOREIGN CUISINE
(100,)
False
The "coq" in coq au vin
chicken
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(5, 1)
ACTORS & ROLES
(100,)
False
Video in which Michael Jackson plays a werewolf & a zombie
"Thriller"
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(1, 2)
LAKES & RIVERS
(200,)
False
Scottish word for lake
loch
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(2, 2)
INVENTIONS
(200,)
False
In 1869 an American minister created this "oriental" transportation
the rickshaw
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(3, 2)
ANIMALS
(200,)
False
There are about 40,000 muscles & tendons in this part of an elephant's body
the trunk
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(4, 2)
FOREIGN CUISINE
(200,)
False
A British variety is called "bangers", a Mexican variety, "chorizo"
sausages
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(5, 2)
ACTORS & ROLES
(200,)
False
2 "Saturday Night" alumni who tried "Trading Places"
Dan Aykroyd & Eddie Murphy
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(1, 3)
LAKES & RIVERS
(800,)
True
River in this famous song:
the Volga River
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(2, 3)
INVENTIONS
(300,)
False
A 1920's hunting trip to Canada inspired Birdseye's food preserving method
freezing
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(3, 3)
ANIMALS
(300,)
False
When husbands "pop" for an ermine coat, they're actually buying this fur
a weasel
0
3
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(4, 3)
FOREIGN CUISINE
(300,)
False
Jewish crepe filled with cheese
a blintz
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(5, 3)
ACTORS & ROLES
(300,)
False
He may "Never Say Never Again" when asked to be Bond
Sean Connery
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(1, 4)
LAKES & RIVERS
(400,)
False
American river only 33 miles shorter than the Mississippi
the Missouri
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(2, 4)
INVENTIONS
(400,)
False
This fastener gets its name from a brand of galoshes it was used on
a zipper
0
3
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(3, 4)
ANIMALS
(400,)
False
Close relative of the pig, though its name means "river horse"
the hippopotamus
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(4, 4)
FOREIGN CUISINE
(400,)
False
French for a toothsome cut of beef served to a twosome
Châteaubriand
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(5, 4)
ACTORS & ROLES
(400,)
False
The blonde preferred in the film "Gentlemen Prefer Blondes"
Marilyn Monroe
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(1, 5)
LAKES & RIVERS
(500,)
False
World's largest lake, nearly 5 times as big as Superior
the Caspian Sea
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(3, 5)
ANIMALS
(500,)
False
If this species of hybrid's parents were reversed, you'd get a hinny
mule
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Jeopardy
(5, 5)
ACTORS & ROLES
(500,)
False
Sam Shepard played this barrier breaker in "The Right Stuff"
Colonel Chuck Yeager
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(1, 1)
THE BIBLE
(200,)
False
When "Joshua Fit The Battle Of Jericho", these took a tumble
the walls
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(2, 1)
'50'S TV
(200,)
False
Occupation of Richard Diamond, Peter Gunn & Mike Hammer
private eyes (or private detectives)
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(3, 1)
NATIONAL LANDMARKS
(200,)
False
She came from France to harbor America's freedom
the Statue of Liberty
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(5, 1)
NOTORIOUS
(200,)
False
It was probably a lyre, not a fiddle, if he played it while Rome burned
Nero
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(6, 1)
4-LETTER WORDS
(200,)
False
Pulled the trigger or what's in a jigger
shot
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(1, 2)
THE BIBLE
(400,)
False
His price was 30 pieces of silver
Judas
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(2, 2)
'50'S TV
(400,)
False
She was "Our Miss Brooks"
Eve Arden
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(3, 2)
NATIONAL LANDMARKS
(400,)
False
When he was home, George Washington slept here
Mount Vernon
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(5, 2)
NOTORIOUS
(400,)
False
His book, translated as "My Struggle", outlined plans to conquer Europe
Adolf Hitler
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(6, 2)
4-LETTER WORDS
(400,)
False
Basketball defense or Serling's twilight area
zone
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(1, 3)
THE BIBLE
(600,)
False
According to the Bible, it wasn't necessarily an apple
the forbidden fruit (or the fruit of the Tree of Knowledge)
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(2, 3)
'50'S TV
(600,)
False
Amount Michael Anthony gave out each week on behalf of John Beresford Tipton
$1 million
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(3, 3)
NATIONAL LANDMARKS
(600,)
False
The cornerstone of Massachusetts, it bears the date 1620
Plymouth Rock
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(5, 3)
NOTORIOUS
(600,)
False
Lenin called him ruthless, and his purges proved he was
Stalin
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(6, 3)
4-LETTER WORDS
(600,)
False
Little girls do it with a rope, Van Halen does it in a song
jump
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(1, 4)
THE BIBLE
(800,)
False
Though its name means "city of peace", it's seen over 30 wars, the last in 1967
Jerusalem
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(2, 4)
'50'S TV
(800,)
False
His card read "Have gun, will travel"
Paladin (Richard Boone)
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(3, 4)
NATIONAL LANDMARKS
(800,)
False
Site where John Hancock signed his "John Hancock"
Independence Hall
1
1
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(6, 4)
4-LETTER WORDS
(1000,)
True
It's the first 4-letter word in "The Star Spangled Banner"
what
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(1, 5)
THE BIBLE
(1000,)
False
According to 1st Timothy, it is the "root of all evil"
the love of money
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(2, 5)
'50'S TV
(1000,)
False
Name under which experimenter Don Herbert taught viewers all about science
Mr. Wizard
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(3, 5)
NATIONAL LANDMARKS
(1000,)
False
D.C. building shaken by November '83 bomb blast
the Capitol
1
2
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(5, 5)
NOTORIOUS
(1000,)
False
After the deed, he leaped to the stage shouting "Sic semper tyrannis"
John Wilkes Booth
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Double Jeopardy
(6, 5)
4-LETTER WORDS
(1000,)
False
The president takes one before stepping into office
oath
1
0
1
1984-09-10
Premiere episode with Alex Trebek as host.
Final Jeopardy
(1, 1)
HOLIDAYS
(2500, 3700, 300)
False
The third Monday of January starting in 1986
Martin Luther King Day
3
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(1, 1)
STATE CAPITALS
(100,)
False
Until 1875 its dual capitals were New Haven & Hartford
Connecticut
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(2, 1)
THE '40'S
(100,)
False
Though faster than a speeding bullet, he was ruled 4-F
Superman
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(4, 1)
TRANSPORTATION
(100,)
False
Type of auto engine, or a tomato cocktail
V8
0
3
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(5, 1)
COUNTRY MUSIC
(100,)
False
A grandma before her 30th birthday, she's the real "Coal Miner's Daughter"
Loretta Lynn
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(6, 1)
BY THE NUMBERS
(100,)
False
The 2 digits that give James Bond license to kill
00
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(1, 2)
STATE CAPITALS
(200,)
False
This N.M. town is the oldest city that's a state capital
Santa Fe
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(2, 2)
THE '40'S
(200,)
False
By war's end, they had banked over 13 million units of blood for plasma
the Red Cross
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(4, 2)
TRANSPORTATION
(200,)
False
Lindbergh was not first to fly the Atlantic, but first to do it this way
solo
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(5, 2)
COUNTRY MUSIC
(200,)
False
Nashville's Ryman Auditorium was its home for years
the Grand Ole Opry
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(6, 2)
BY THE NUMBERS
(200,)
False
Three Dog Night called this the loneliest number
"One"
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(1, 3)
STATE CAPITALS
(300,)
False
Crossing the Delaware on Xmas, 1776, Washington defeated the Hessians at this N.J. capital
Trenton
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(2, 3)
THE '40'S
(300,)
False
Not his singing, but a lack of lunch caused fan to swoon at his Paramount Theater concert
Frank Sinatra
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(4, 3)
TRANSPORTATION
(300,)
False
Derived from words "American", "travel", & "track", it provides most U.S. passenger rail service
Amtrak
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(5, 3)
COUNTRY MUSIC
(300,)
False
In "True Grit", he played a cowboy, but not his famous Rhinestone one
Glen Campbell
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(6, 3)
BY THE NUMBERS
(300,)
False
Number of red stripes on current U.S. flag
7
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(1, 4)
STATE CAPITALS
(400,)
False
It actually is 5,280 feet above sea level
Denver
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(2, 4)
THE '40'S
(400,)
False
Destination of MacArthur's "I shall return"
the Philippines
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(4, 4)
TRANSPORTATION
(400,)
False
London's, not New York's, was the first ever built
subway (underground, metro)
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(5, 4)
COUNTRY MUSIC
(400,)
False
His famed San Quentin concert inspired inmate Merle Haggard
Johnny Cash
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(6, 4)
BY THE NUMBERS
(400,)
True
Total of Disney's Dalmatians and dwarfs
108
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(1, 5)
STATE CAPITALS
(500,)
False
The name shows its founder, Roger Williams, believed God led him there
Providence
0
3
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(2, 5)
THE '40'S
(500,)
False
Wartime pseudonym of Mrs. I. Toguri D'Aquino
Tokyo Rose
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(4, 5)
TRANSPORTATION
(500,)
False
Changing lines, you could have at one time ridden these from Freeport, IL to Utica, NY
streetcars
1
2
2
1984-09-11
Second episode. Three-way tie at zero.
Jeopardy
(6, 5)
BY THE NUMBERS
(500,)
False
Broadway hit that takes Fellini film a ½ step further
Nine
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(1, 1)
WILD WEST
(200,)
False
One of its newspapers was appropriately called "The Epitaph"
Tombstone
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(2, 1)
OPERA
(200,)
False
The Lone Ranger's theme is the opera's overture
(The) William Tell (Overture)
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(3, 1)
SPORTS
(200,)
False
The Greeks reckoned time from this event, first held in Olympia in 776 B.C.
the Olympics
0
3
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(4, 1)
BIOLOGY
(200,)
False
Genus Rana; frequent victim of biology class dissections
a frog
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(5, 1)
FOREIGN PHRASES
(200,)
False
In Germany, said before a toast & after a sneeze
Gesundheit
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(6, 1)
RELIGION
(200,)
False
He was the doubter among the Apostles
(St.) Thomas
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(1, 2)
WILD WEST
(400,)
False
She once shot a cigarette from the mouth of the German crown prince
Annie Oakley
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(2, 2)
OPERA
(400,)
False
Profession of Rossini's Figaro
a barber
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(4, 2)
BIOLOGY
(400,)
False
L.B.J.'s hound dog or Darwin's ship
the Beagle
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(5, 2)
FOREIGN PHRASES
(400,)
False
American equivalent to English "the bonnet on a lorry"
the hood of a truck
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(6, 2)
RELIGION
(400,)
False
Continent with the largest Jewish population
North America
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(1, 3)
WILD WEST
(600,)
False
In 650,000 miles the mail was lost only once
the Pony Express
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(2, 3)
OPERA
(1000,)
True
This aria from "Pagliacci" gave him the first million-selling record ever
Enrico Caruso
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(4, 3)
BIOLOGY
(600,)
False
The basic unit of life; 3 billion die every minute in your body
cells
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(5, 3)
FOREIGN PHRASES
(600,)
False
From French, it literally means "a pen name"
nom de plume
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(6, 3)
RELIGION
(600,)
False
Color of smoke signifying election of new Pope
white
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(1, 4)
WILD WEST
(800,)
False
Brothers Virgil & Morgan were shot here, but Wyatt Earp emerged unscathed
(the gunfight at) the OK Corral
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(2, 4)
OPERA
(800,)
False
Lt. Pinkerton's girlfriend Cio-Cio-San
Madame Butterfly
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(4, 4)
BIOLOGY
(800,)
False
It puts the green in greenery
chlorophyll
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(5, 4)
FOREIGN PHRASES
(800,)
False
The vidi, in "Veni, vidi, vici"
I saw
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(6, 4)
RELIGION
(800,)
False
This Buddhist sect seeks truth through concepts like "the sound of one hand clapping"
Zen
1
1
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(1, 5)
WILD WEST
(1000,)
False
It ran 2,000 miles, from Independence, Missouri to Oregon City, Oregon
the Oregon Trail
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(2, 5)
OPERA
(1000,)
False
1976 was also the bicentennial of this famed Milan opera house
Teatro alla Scala
0
3
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(4, 5)
BIOLOGY
(1000,)
False
Deoxyribonucleic acid
DNA
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Double Jeopardy
(6, 5)
RELIGION
(1000,)
False
This word for the Mohammedan religion means "submission to the will of God"
Islam
1
0
2
1984-09-11
Second episode. Three-way tie at zero.
Final Jeopardy
(1, 1)
THE CALENDAR
(1100, 5000, 9500)
False
Calendar date with which the 20th century began
January 1, 1901
0
3
3
1984-09-12
Third episode.
Jeopardy
(1, 1)
U.S. HISTORY
(100,)
False
This colonial inventor suggested Daylight Saving Time
Benjamin Franklin
1
0
3
1984-09-12
Third episode.
Jeopardy
(2, 1)
3-LETTER WORDS
(100,)
False
A fedora, homburg or derby
a hat
1
0
End of preview.

Dataset Card for PARROT

Performance Assessment of Reasoning and Responses on Trivia (PARROT) is a validated LLM benchmarking dataset that leverages game show data for a more realistic evaluation of Large Language Models (LLMs). Curated by Redblock, this dataset offers unique challenges through its open-ended and closed-ended question formats, derived from the popular game shows Jeopardy and Who Wants to Be a Millionaire.

Dataset Details

Dataset Description

PARROT is designed to provide a robust evaluation of LLM performance through diverse QA tasks. It is comprised of two distinct datasets:

  1. PARROT-Jeopardy: A dataset consisting of questions from the game show Jeopardy, featuring short, concise questions for testing reasoning and ambiguity handling.

  2. PARROT-Millionaire: A dataset consisting of questions from the game show Who Wants to Be a Millionaire, known for its straightforward nature and broad range of topics. This dataset is valuable for evaluating an LLM's knowledge.

  • Curated by: Redblock
  • Shared by: Redblock
  • License: cc-by-4.0

Uses

Direct Use

PARROT is designed to benchmark the performance of Large Language Models (LLMs) in Question-Answering tasks, particularly over trivia.

Dataset Structure

PARROT-Jeopardy

  • ep_num: Episode number from the season.
  • air_date: Date when the episode aired.
  • extra_info: Additional episode information, including the host's name.
  • round_name: The round being played (e.g., Jeopardy, Double Jeopardy, Final Jeopardy).
  • coord: Coordinates of the clues on the game board.
  • category: Clue category.
  • value: Monetary value of the clue.
  • daily_double: Boolean indicating if the clue is part of the Daily Double round.
  • question: The clue itself.
  • answer: Labeled answer or guess.
  • correct_attempts: Count of contestants who answered correctly.
  • wrong_attempts: Count of contestants who answered incorrectly.

PARROT-Millionaire

  • question_info: Describes the price value and the current question number.
  • question: The question in text form.
  • options: Four predefined options corresponding to the question.
  • correct_answer: Labeled correct answer.
  • price: Engineered feature from Question Info, indicating the dollar value of the question.
  • normalized_options: Engineered feature providing text normalization for the options.
  • normalized_correct_opt: Engineered feature providing text normalization for the correct answer.

Dataset Creation

Curation Rationale

PARROT was created to address the need for a more realistic and challenging benchmarking dataset for LLMs. By using game show data, the dataset captures a wide range of question types and difficulties, providing a comprehensive evaluation tool.

Source Data

Data Collection and Processing

  • Data for PARROT-Jeopardy was curated from seven key seasons of Jeopardy the game show to ensure a representative sample across the show's timeline. The data was scraped from the J!Archive, a fan-created archive containing over 500,000 clues.

  • PARROT-Millionaire was created by scraping data from the Millionaire Fandom site. The data was organized and processed to ensure consistency and reliability.

Who are the source data producers?

The original data for PARROT-Jeopardy was sourced from the fan-created archive of the original show Jeopardy, while the data for PARROT-Millionaire was sourced from the Millionaire Fandom site.

Personal and Sensitive Information

The dataset does not contain personal, sensitive, or private information.

Citation

BibTeX: If you use this dataset in your research, please cite it as follows:

@dataset{parrot2024,
author = {Redblock AI Team},
title = {PARROT: Performance Assessment of Reasoning and Responses on Trivia},
year = 2024,
publisher = {Redblock},
url = {https://huggingface.co/datasets/redblock/parrot},
license = {CC BY 4.0}
}

APA:

Redblock AI Team. (2024). PARROT: Performance Assessment of Reasoning and Responses on Trivia. Redblock. Available at https://huggingface.co/datasets/redblock-ai/parrot.

More Information

For more information, visit redblock.ai.

Dataset Card Authors

Redblock AI Team

Dataset Card Contact

For inquiries, visit redblock.ai.

dataset_info.json

{
  "description": "PARROT is a validated LLM benchmarking dataset that leverages game show data for a more realistic evaluation of Large Language Models (LLMs). Curated by Redblock, this dataset offers unique challenges through its open-ended and closed-ended question formats, derived from the popular game shows Jeopardy! and Who Wants to Be a Millionaire?",
  "homepage": "https://huggingface.co/datasets/redblock-ai/parrot",
  "license": "cc-by-4.0",
  "citation": "@dataset{parrot2024,\nauthor = {Redblock AI Team},\ntitle = {PARROT: Performance Assessment of Reasoning and Responses on Trivia},\nyear = 2024,\npublisher = {Redblock},\nurl = {https://huggingface.co/datasets/redblock/parrot},\nlicense = {CC BY 4.0}\n}",
  "version": "1.0.0",
  "splits": {
    "jeopardy": {"num_examples": 61462},
    "millionaire": {"num_examples": 22698}
  },
  "download_size": 20000000,
  "dataset_size": 20000000
}

Disclaimer

Important Notice:
The datasets curated in this benchmark include content derived from fan-created sites related to Who Wants to Be a Millionaire? Fandom and J! Archive. These datasets are intended solely for research, educational purposes, and non-commercial use. Redblock does not claim ownership of, nor does it have any affiliation with, the creators or copyright holders of Who Wants to Be a Millionaire? and J! Archive.

Who Wants to Be a Millionaire? Fandom and J! Archive are registered trademarks of their respective owners. Redblock's use of these materials is protected under the fair use doctrine as defined by U.S. copyright law, which permits the use of copyrighted material for purposes such as criticism, commentary, news reporting, teaching, scholarship, and research.

Redblock has modified these datasets in compliance with U.S. law to ensure that the content remains within the boundaries of fair use. Any modifications or derived works created from these datasets should also adhere to the principles of fair use and respect the intellectual property rights of the original content creators.

This benchmark is provided by Redblock "as-is" without any guarantee of accuracy or fitness for a particular purpose. Users of this benchmark are encouraged to respect copyright laws and the intellectual property rights of the original content creators. The datasets should not be used for commercial purposes without obtaining proper authorization from the rights holders.

Downloads last month
43