Transcribe Geometry Model Data from a PDF report to an ASCII file HelicopterBRL-CAD
Status: ClosedTime to complete: 100 hrs Mentors: Isaac Kamga, DeepakTags: docs, geometry, transcription, documentation, convert, text, 3D, ocr, pdfBeginner

We have scans (PDF) of a number of reports documenting early geometric models in the COMGEOM format (a now obsolete format, but the models are interesting nonetheless). These reports contain the actual geometry defining the model as pages and pages of numbers and letters. Unfortunately, the quality is sufficiently poor that optical character recognition (OCR) has a very high rate of error.

This task is to attempt the manual transcription of a portion of the Black Hawk Helicopter model described in the report ''Computer Description of Black Hawk Helicopter'' (see the References list below for the link that will let you download the PDF). One possible approach is to use Acrobat Reader or some other PDF reader select and copy the OCR text, paste that to a text file as a starting point, and then manually correct it. There may also be some patterns that will allow for semi-automated processing (for example, if 5 zeros in a row are commonly replaced with the character ''O'' instead of 0, a search and replace is in order.) However you wish to approach it is fine, but remember that the goal is not just the extraction of the OCR text but the production of an accurate transcription of the file. The OCR text can be used as a starting point but it will NOT be accurate.

The preferred format to provide the pages in is a comma-separated value ASCII text file, which is suitable for post-processing.

The eventual goal is to have a file that can be fed to BRL-CAD's comgeom-g importer to generate an accurate .g file. The description of this target is a couple hundred text pages (which will take much longer than a single GCI task if you're doing correctness checking!) so there will be multiple tasks for pieces of the file. For this task, pleas submit a csv file with the content of the tables on pages

71-101

References:

Please discuss your progress with the developers.

Additional information on comgeom

Uploaded Work
File name/URLFile sizeDate submitted
a073444(71-101).csv8.0 MBJanuary 10 2015 05:41 UTC
a073444(71-101).csv8.0 MBJanuary 10 2015 05:41 UTC
Comments
Daksh kalraon December 5 2014 14:42 UTCTask Claimed

I would like to work on this task.

Deepak on December 5 2014 14:43 UTCTask Assigned

This task has been assigned to Daksh kalra. You have 100 hours to complete this task, good luck!

Daksh kalraon December 6 2014 01:55 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Daksh kalraon December 6 2014 05:27 UTCTask Claimed

I would like to work on this task.

Gauravjeet Singh on December 6 2014 05:28 UTCTask Assigned

This task has been assigned to Daksh kalra. You have 100 hours to complete this task, good luck!

Daksh kalraon December 6 2014 05:33 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

MaxitoTheGreaton December 8 2014 22:13 UTCTask Claimed

I would like to work on this task.

MaxitoTheGreaton December 8 2014 22:25 UTC

 TABLE OF CONTENTS


Page


I, INTRODUCTION .............................. 11


II. DISCUSSION 11


A. Combinatorial Geometry (COMGEOM) Method ... 11


B. Specific Approach to COMGEOM Modeling ..... 14


C. Computer Target Description ............... 17


III. CONCLUSIONS ............................. 18


APPENDIX ..................................... 69


DISTRIBUTION LIST ............................323

Mihai Neacsu on December 9 2014 02:45 UTCTask Assigned

This task has been assigned to MaxitoTheGreat. You have 100 hours to complete this task, good luck!

Melange on December 13 2014 06:45 UTCTask Reopened

Melange has detected that the final deadline has passed and it has reopened the task.

YolandaCenKamon December 26 2014 11:00 UTCTask Claimed

I would like to work on this task.

YolandaCenKamon December 26 2014 11:01 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Lukeon December 29 2014 14:56 UTCTask Claimed

I would like to work on this task.

Lukeon December 29 2014 14:58 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Duckieon December 29 2014 20:53 UTCTask Claimed

I would like to work on this task.

Deepak on December 29 2014 20:55 UTCTask Assigned

This task has been assigned to Duckie. You have 100 hours to complete this task, good luck!

Duckieon January 2 2015 07:17 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Duckieon January 2 2015 07:17 UTCTask Claimed

I would like to work on this task.

Mihai Neacsu on January 2 2015 07:24 UTCTask Assigned

This task has been assigned to Duckie. You have 100 hours to complete this task, good luck!

Duckieon January 5 2015 16:24 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Duckieon January 5 2015 16:24 UTCTask Claimed

I would like to work on this task.

Deepak on January 5 2015 16:26 UTCTask Assigned

This task has been assigned to Duckie. You have 100 hours to complete this task, good luck!

Duckieon January 9 2015 06:00 UTCClaim Removed

The claim on this task has been removed, someone else can claim it now.

Duckieon January 9 2015 06:00 UTCTask Claimed

I would like to work on this task.

Deepak on January 9 2015 06:01 UTCTask Assigned

This task has been assigned to Duckie. You have 100 hours to complete this task, good luck!

Duckieon January 10 2015 05:42 UTCReady for review

The work on this task is ready to be reviewed.

Duckieon January 10 2015 05:45 UTCNote to Submission

Hello,


I just wanted to mention something about my submission. On the document, some of the numbers were really unclear, so that I could not recognize the number, causing me to guess them.


- Duckie 

Sean on January 11 2015 07:47 UTCTask Closed

Congratulations, this task has been completed successfully.

Sean on January 11 2015 07:49 UTChow long?

Duckie,


This looks like excellent work.  How long did this one take you?  Note that there are several other transcription tasks if you're interested!  And there are more that can be added if we run out.. ;)