Java Jive
2019-10-05 15:59:57 UTC
Cross-posting to one Linux and one Windows group, because I'm happy to
run a workable solution on either OS.
As per the post's title, I have a need for some software, preferably
free or at least low cost, to run on either OS, which can recognise
handwriting. The context is that I've inherited a trunkful of old
Macfarlane documents, nearly all hand-written, which I'm scanning my way
through, having completed probably three-quarters or more of it.
The documents consists of:
1) Some old deeds, wills, etc written on parchment (stretched animal
skin). These are so large that they have had to be scanned in sections,
anything from 9-16 in number, and then the individual images joined
together into one so that each line of writing continues seamlessly over
all the joins. This is the stage I'm currently at, and it's taking a
long time because the business of joining the images together is very,
very fiddly. It takes an entire evening to create a single image of
such a document, then, currently, after each is done, I convert it into
text manually, as that's the best way to ensure that the resulting
complete image is fully readable, and even this latter can take another
evening just for one document.
2) Many family letters
3) A few pages of accounts
4) Family trees
5) Log books of sea-voyages, diaries of holidays, hand-written books
containing historical research notes, etc.
6) Many loose pages of notes concerning family history, clan history,
and Scottish history, nearly all of which are on foolscap pages, which
again had to be scanned in two sections to capture the top and bottom of
the page and then joined together to form one image.
Some of the notes were in pencil, which required me altering the
exposure settings. Worse still, quite a number of pages had both ink
and pencil, for which it was difficult to find settings that showed both
to best advantage.
As I'm currently nearing the end of the scanning phase, next will be
sorting the documents into boxes and the files into sub-directories -
of course they are already, because I was trying to make sense of it all
as I worked through it, but only after I know entirely what is there
will I be able to finalise how best it should be divided up into
meaningful chunks of information. After that will come the probably
much longer phase of converting as much of it as possible into
searchable text. It's this last stage I really would like to shorten as
much as possible, because I'd like to have a chance of finishing it
before I die !-)
So, does anyone here have any experience of software to recognise
hand-writing in an image file (*.png) and produce a text file from it?
run a workable solution on either OS.
As per the post's title, I have a need for some software, preferably
free or at least low cost, to run on either OS, which can recognise
handwriting. The context is that I've inherited a trunkful of old
Macfarlane documents, nearly all hand-written, which I'm scanning my way
through, having completed probably three-quarters or more of it.
The documents consists of:
1) Some old deeds, wills, etc written on parchment (stretched animal
skin). These are so large that they have had to be scanned in sections,
anything from 9-16 in number, and then the individual images joined
together into one so that each line of writing continues seamlessly over
all the joins. This is the stage I'm currently at, and it's taking a
long time because the business of joining the images together is very,
very fiddly. It takes an entire evening to create a single image of
such a document, then, currently, after each is done, I convert it into
text manually, as that's the best way to ensure that the resulting
complete image is fully readable, and even this latter can take another
evening just for one document.
2) Many family letters
3) A few pages of accounts
4) Family trees
5) Log books of sea-voyages, diaries of holidays, hand-written books
containing historical research notes, etc.
6) Many loose pages of notes concerning family history, clan history,
and Scottish history, nearly all of which are on foolscap pages, which
again had to be scanned in two sections to capture the top and bottom of
the page and then joined together to form one image.
Some of the notes were in pencil, which required me altering the
exposure settings. Worse still, quite a number of pages had both ink
and pencil, for which it was difficult to find settings that showed both
to best advantage.
As I'm currently nearing the end of the scanning phase, next will be
sorting the documents into boxes and the files into sub-directories -
of course they are already, because I was trying to make sense of it all
as I worked through it, but only after I know entirely what is there
will I be able to finalise how best it should be divided up into
meaningful chunks of information. After that will come the probably
much longer phase of converting as much of it as possible into
searchable text. It's this last stage I really would like to shorten as
much as possible, because I'd like to have a chance of finishing it
before I die !-)
So, does anyone here have any experience of software to recognise
hand-writing in an image file (*.png) and produce a text file from it?