AWS Sagemaker+Textract +ML for automated document analysis
Project detail
About us:
Investment group with 30+ years of experience in emerging markets businesses . This project will be executed in one of our companies www.megalojista.com.br.
Job details:
Looking for a Machine learning professional to extract, classificate document entities, training pipelines and perform machine learning binary classification for automated document analysis.
Create and document the custom workflow for the provided contract analysis use case.
build and optimize your SageMaker model.
use SageMaker Neo to train and optimize your model. speed up your model while maintaining accuracy.
****Deliverables:****
-Sagemaker Jupyter workflow model optimized and deployed in our account (entity models and business classification rules will be provided)
-Sagemaker user training and review interface
-SageMaker tutorial and documentation
-S3 output bucket with Json extracted data and Binary machine learning analysis result (approved /not approved)
Install and configuration:
-Amazon Augmented AI with Textract, with properly S3 bucket configuration and IAM permissions. (500+ PDF files for test and training will be provided)
-Amazon Sage Maker (using Python development using Jupyter notebooks) with properly Lambda call functions
-AWS Machine Learning (perform analysis of Textract entities and use ML to perform Binary classification or better model(decision trees) , to automatically approve or not the contract based on pre informed business rules)
Requirements:
-Real Experience in deployment with the current project scenario
-Esperience with OCR document analysis and machine learning NLP