1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Tesseract Training
Posted by:
External project from PeoplePerHour
Started:
19-Mar-2025 10:12 GMT
Description:
Description: I have some text, which is single word on tiff file, designed to train eng_custom.traineddata. Currently I use syntax below which seem sane and does not produce any error before last step.

Important: I don't want to change current approach as my goal to train each of 1000 tiff files with same parameters, since I prepared corresponding tessRead and boxes for each tiff.

#Make lstmf file

tesseract test_sample.tiff test_sample \
--tessdata-dir /home/j/img2/tess_files \
--psm 7 --oem 1 -l eng_custom \
/home/j/tesseract/tessdata/configs/lstm.train

echo "test_sample.lstmf" single_lstmf_file.txt

#Train LSTM model

lstmtraining \
--model_output tess_training.lstm \
--continue_from /home/j/img2/tess_files/eng.lstm \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--train_listfile single_lstmf_file.txt \
--max_iterations 1

Stop training and finalize model

lstmtraining --stop_training \
--continue_from tess_training.lstm_checkpoint \
--traineddata /home/j/img2/tess_files/eng_custom.traineddata \
--model_output /home/j/img2/tess_files/eng_final.lstm

Update traineddata with new LSTM model

mkdir -p /home/j/img2/base_model
combine_tessdata -u /home/j/img2/tess_files/eng_custom.traineddata /home/j/img2/base_model/eng_custom
cp /home/j/img2/tess_files/eng_final.lstm /home/j/img2/base_model/eng.lstm
combine_tessdata /home/j/img2/base_model/eng_custom
cp /home/j/img2/base_model/eng_custom.traineddata /home/j/img2/tess_files/eng_custom.traineddata

But I get problem during final step:

j@j:~/t$ tesseract test_sample.tiff stdout -l eng_custom --tessdata-dir /home/j/img2/tess_files/
index = 0:Error:Assert failed:in file /home/j/tesseract4/src/ccutil/strngs.cpp, line 266
Aborted (core dumped)

Question: How to amend above commands so I can combine eng_final.lstm with eng_custom.traineddata

Environment:

/home/j/img2/tess_files/

eng.traineddata eng_custom.traineddata eng.lstm eng_final.lstm

/home/j/img2/base_model/

eng_custom.bigram-dawg eng_custom.normproto
eng_custom.word-dawg eng_custom.freq-dawg
eng_custom.number-dawg eng.lstm eng_custom.inttemp
eng_custom.pffmtable eng.lstm-number-dawg eng_custom.lstm
eng_custom.punc-dawg eng.lstm-punc-dawg eng_custom.lstm-number-dawg eng_custom.shapetable
eng.lstm-recoder eng_custom.lstm-punc-dawg eng_custom.traineddata
eng.lstm-unicharset eng_custom.lstm-recoder
eng_custom.unicharambigs eng.lstm-word-dawg eng_custom.lstm-unicharset eng_custom.unicharset eng.version eng_custom.lstm-word-dawg eng_custom.version

Any guidance would be greatly appreciated.

Thanks!

Jacob
Project ID:
3426853
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
I need a podcast editor 21 Apr 2025 20:49 GMT
Data entry: 1100 invoices (3 lines-items/invoice) 21 Apr 2025 20:48 GMT
3D Renders + Lighting floor plan 21 Apr 2025 20:45 GMT
Spanish voice actor to provide services for various projects. 21 Apr 2025 20:34 GMT
Design Facebook ads 21 Apr 2025 20:29 GMT
Business Development Specialist for Knitwear Manufacturer 21 Apr 2025 19:58 GMT
Template Updated Using HTML with embedded CSS and JavaScript. 21 Apr 2025 19:41 GMT
Podcast editing 21 Apr 2025 18:27 GMT
Website Redesign 21 Apr 2025 18:10 GMT
Labelling and packaging expert 21 Apr 2025 17:47 GMT
Social media manager for start up fashion brand 21 Apr 2025 17:29 GMT
Project A6 birthday card layout for printing. 21 Apr 2025 17:17 GMT
Get ranked on Google top reaults 21 Apr 2025 17:17 GMT
Novel editing service 21 Apr 2025 17:16 GMT
Project A6 birthday card layout for printing. 21 Apr 2025 17:03 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2024
1001 Freelance Projects