Welcome to Nanonets!
Welcome to the complete guide to getting started with Nanonets. Learn how to capture data from any document, in any format and in any language in the world. Automate repetitive tasks, even reading data from the most complex documents, and make your team more productive!
Whether you're looking to start a car rental service, share benefits with your loyal customers, onboard more restaurants, verify data for employees you recruit or manage your Accounts Payable, Nanonets helps you effortlessly capture data from any document you can imagine, so that you can focus on what matters to you. You decide how you work, and build the perfect setup for your needs.
Unlike a traditional data extraction system, Nanonets gives you the power of an AI-enabled, template agnostic solution. It is super flexible—connecting with your information, capturing only what you need, and structuring it as you need it—all in one place.
Who is this guide for?
Technical and non-technical folks, brand-new users to Nanonets, and anyone who just wants to brush up on their Nanonets knowledge.
First sign up on app.nanonets.com to access the Nanonets universe.
All you need to do is:
1. Enter your email address
2. Choose a password
3. Verify the link you receive on your email
Once you first open up Nanonets, you'll see a panel on the left hand side and a bunch of colorful icons (More on these later).
The left hand side contains all your key account information -
- My Models: All your models and their unique Model IDs in one place
- API Keys: Your API keys to access your model
- Billing: Contains all your billing history and information
- Team: Team members that have access to your models
Try Your First Model
When you first open up Nanonets, you'll see a bunch of colorful icons. The icons are different AI models that you can train or use right away. Here are the three ways:
1. Build your own OCR Model
2. Build your Document Classification Model
3. Use a pre-made Nanonets Model
Building Your Own OCR Model
Let's take a fun fictitious example - Say you are starting a new restaurant business, and want to capture all monthly expenses in a CSV at the end of the month.
1. Upload some pages of your document type to teach the AI Model on the Upload Files screen.
We require a minimum of 10 images to train a custom model.
2. Specify the fields that you want the AI to capture from your document on the Specify Fields screen.
In this case, you're labels will look something like this -
Once you're ready, click on Start Training.
3. Show the AI Model where the fields are present on your pages.
On the first page, create rectangular boxes around the text associated with each label. Enter the name of the label (or field) you want to associate the text with, and hit Save. This process is called annotating.
We need a minimum of 10 annotations per label to train a model. We recommend starting with 50 and adding more files depending on the accuracy you see.
It takes about 40 minutes before you can see some magic. Watch your inbox - We'll send you an email once your model is ready!
Best Practices to Train the Model
Refer to an in-depth article here to train the best performance model for you: https://nanonets.com/help/ocr/best-practices-for-high-accuracy-models
Pre-made Nanonets Model
You can quickly use a pre-made Nanonets model for these document types - Invoices, Receipts / Bills, US Driver's Licenses, Passports, Menu Cards. Simply upload your files on the Extract Data screen and review extracted data.
Each of these models is already trained on thousands and thousands of pages and work well out of the box for any language! What's more, that we can quickly enable these models for any additional language.
We frequently add new models to our Product Universe, if you can't find a model for your document type - drop an email to firstname.lastname@example.org and we'll see if we can quickly build one for you.