Skip to content
This repository was archived by the owner on Sep 25, 2025. It is now read-only.

Conversation

Utkarsh352
Copy link

I made the README file more grammatically correct. I hope you will like it

preemptable -> preemtible
The current version reads the entire train file into memory and then
loads it with Dataset.from_tensor_slices. This can result in very large
allocations which can either OOM or cause a bad_alloc. To avoid this,
the new version now writes a TFRecord file to the `output_dir`.
External references should all be to colab.research.
It seems the [] on CLS and SEP were added after and the masking was not
realigned
comments: fix the alignement of type_ids
Fix  BERT acronym to be consistent with paper
Added colab link in the readme
Fixed the colab notebook link.
Correct Wikipedia size correlation comment.
…v-patch-1

Update multilingual.md to correct Wikipedia size size correlation comment.
I made it more grammatically correct. I hope you will like it
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant