Corpus Onboarding
Upload 3-10 books you have rights to use. Good first sources: Project Gutenberg, Litteraturbanken/Sprakbanken open texts, DOAB/open-license books.
Starter Corpus Health
Books
3
Benchmark ready
2
Avg chunks
144
Rights safe
Yes
Languages
sv, en
Genres
roman, poetry
Embeddings complete
Yes
Health score
96/100
Import Status
Rights
Public domain
Benchmark
Allowed
Chapters
0
Chunks
0
Uploaded
UploadedTextCleanedChaptersChunksEmbeddingsBook DNAReady
Rights
Public domain
Benchmark
Ready
Chapters
48
Chunks
76
Benchmark ready
UploadedTextCleanedChaptersChunksEmbeddingsBook DNAReady