Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step. Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff. #Make lstmf file tesseract test_sample.tiff test_sample \ --tessdata-dir /home/j/img2/tess_files \ --psm 7 --oem 1 -l eng_custom \ /home/j/tesseract/tessdata/configs/lstm.train echo "test_sample.lstmf" single_lstmf_file.txt #Train LSTM model lstmtraining \ --model_output tess_training.lstm \ --continue_from /home/j/img2/tess_files/eng.lstm \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --train_listfile single_lstmf_file.txt \ --max_iterations 1 Stop training and finalize model lstmtraining --stop_training \ --continue_from tess_training.lstm_checkpoint \ --traineddata /home/j/img2/tess_files/eng_custom.traineddata \ --model_output /home/j/img2/tess_files/eng_final.lstm Update traineddata with new LSTM model mkdir -p /home/j/img2/base_model combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm combine_tessdata /home/j/img2/base_model/eng_custom cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata But I get problem during final step: j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/ index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266 Aborted (core dumped) Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata Environment: /home/j/img2/tess_files/ eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm /home/j/img2/base_model/ eng_custom.bigram-dawg eng_custom.normproto eng_custom.word-dawg eng_custom.freq-dawg eng_custom.number-dawg eng.lstm eng_custom.inttemp eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata eng.lstm-unicharset eng_custom.lstm-recoder eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version Any guidance would be greatly appreciated. Thanks! Jacob
Project ID:
3426853
Project category:
Project budget:
Project
Started
Animation Short clip of 90 seconds
Category : 3D Animation, Adobe Flash, After Effects, Animation, Video Services Budget : $10 - $30 USD
15-May-2025 04:04 GMT
Hire a Maths Tutor for NSW Student
Category : Education & Tutoring, Mathematics Budget : $15 - $25 AUD
15-May-2025 04:04 GMT
AI Real Estate Assistant App
Category : AI Agents, Android, IPhone, Mobile App Development, User Interface / IA Budget : $250 - $750 CAD
15-May-2025 04:04 GMT
Don't bid India, Pakistan and Bangladesh____Customizable AI Survey Avatar Development
Category : Android, Artificial Intelligence, IPhone, Machine Learning (ML), Web Design Budget : €15 - €42 EUR
15-May-2025 04:03 GMT
Audio Hardware Control App
Category : Android, Arduino, C, Programming, Java, Mobile App Development Budget : $30 - $250 USD
15-May-2025 04:03 GMT
Engaging Business Social Media Posts
Category : Article Writing, Content Writing, Creative Writing, Facebook Marketing, Social Media Marketing Budget : $10 - $30 AUD
15-May-2025 04:02 GMT
my draft should stay here
Category : Graphic Design, HTML Budget : $15 - $25 USD
15-May-2025 04:00 GMT
Claude AI and Apify Integration Development
Category : API, JavaScript, Large Language Models (LLMs), Node.js, Python Budget : $30 - $250 USD
15-May-2025 04:00 GMT
Software Developer Needed to Build Custom AI Smart Device with Raspberry Pi Integration
Category : Artificial Intelligence, ChatGPT, Linux, OpenAI, Raspberry Pi Budget : $30 - $250 USD
15-May-2025 04:00 GMT
Urdu to English Translation
Category : Content Writing, English (US) Translator, English Grammar, Hindi Translator, Logo Design Budget : $8 - $15 USD
15-May-2025 03:58 GMT
Artistic A5 Workshop Flyer Design
Category : Brochure Design, Flyer Design, Graphic Design, Photoshop, Poster Design Budget : $10 - $30 AUD
15-May-2025 03:58 GMT
Construction Cost Estimation Spreadsheet
Category : Civil Engineering, Data Entry, Data Processing, Excel, Visual Basic Budget : $30 - $250 AUD
15-May-2025 03:57 GMT
Data Matrix QR Code Creation -- 2
Category : Data Processing, Excel, QR Code Making, Software Architecture, Visual Basic Budget : $10 - $30 USD
15-May-2025 03:57 GMT
ViciDial Modernization
Category : Graphic Design, HTML, Mobile App Development, PHP, Web Design Budget : ₹1500 - ₹12500 INR
15-May-2025 03:56 GMT
Hindi Kids' Animation Voiceover
Category : Audio Production, Audio Services, Voice Acting, Voice Over, Voice Talent Budget : ₹600 - ₹1500 INR
15-May-2025 03:56 GMT
Browse All Projects