Skip to content

How should we interpret the consecutive occurrences of "A" and "Z" in the librispeech-lm-norm.txt file? #4970

@MeeElves

Description

@MeeElves

Hi,

I have checked librispeech-lm-norm.txt, and try to figure out how to prepare a *-lm-norm.txt file for another data set.

It is difficult to understand the consecutive occurrences of "A" and "Z" in the librispeech-lm-norm.txt file.

There isn't a book or textbook that explains this issue. A friend suggested that I could post an issue here to ask about it.

Thank you!

Consecutive "A" and "Z" in the librispeech-lm-norm.txt:
A A
A A A
A A A A
A A A A A
A A A A A A A A A A A A A A
A A A A A A ARE THE PARTS OF THE FRAMEWORK THE DIMENSIONS OF WHICH IN FEET AND INCHES ARE GIVEN
A A A A A AH
A A A A A AH THE CRY WAS WRUNG FROM JOHNNIE
A A A A A BOVE SECOND SINGER DIMINUENDO
A A A A A MEN
A A A A A Y
A A A A AHOWOOH
A A A A ALL ABOARD
A A A A ARE FOUR PIECES OF WIRE OF THE SAME THICKNESS AS USED FOR THE PRECEDING NET
A A A A CITY IN SOUTH AMERICA
A A A A H
A A A A L L S WELL
A A A A OBSERVED M'TELA INTERESTEDLY
A A A A ONE OF THE UNITED STATES
A A A A RIVER IN SOUTH AMERICA
A A A A Y
A A A AH
A A A AH A A A AH
...
ZYNOOL'S LOAN WAS MUCH NEEDED AND WAS QUICKLY SPENT
ZYOBOR WILL BE A PART OF THE GREAT WATERS
ZYPS OF ZIRL THE ALPS OF THE TYROL ARE DARK WITH PINES WHERE FOAMING UNDER THE MOUNTAIN SPINES THE INN'S LONG WATER SOUNDS AND SHINES
ZZ
ZZ FIGHT DOGS
ZZ ING
ZZ SHIPLAPS FITTING UNDER THE POINT AND EDGE OF THE MOULD BOARD
ZZZ
ZZZ Z EEEE
ZZZZ
ZZZZ IT WAS A BOLD THING TO DO SAID MY UNCLE SHIFTING THE VENUE FROM THE REGION OF HONOUR TO THE REGION OF COURAGE
ZZZZ LYING FLAT ON THE GROUND PELLE CREPT OVER THE GRASS IMITATING THE MADDENING BUZZ OF THE GAD FLY
ZZZZ WELL THAT'S ONE WAY GEORGE
ZZZZZ IP A DEVIL NOISE A DEATH THAT SHRIEKED TAUNTED AND TRIUMPHED
ZZZZZRUPP THE SLITTING SOUND WAS CLEAR AS THEY BURST INTO THE HALL
ZZZZZZ
ZZZZZZ IP HEAVEN AND EARTH BLURRED TOGETHER BLENDED BY THE GIANT BRUSH OF EDDYING SMOKE

Metadata

Metadata

Assignees

No one assigned

    Labels

    staleStale bot on the loose

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions