Data mining in the cloud
Do you need structured data? Our Artificial intelligence will gather it from the web.
JJill runs an e-shop.
- Her users want to search products by params. e.g. sort digital cameras by resolution:
- Her users want correct parameters and the newest products
Digital camera | Type | Res. |
Canon PS S100 | ? | ? |
Nikon S3200 | Compact | 24 Mpx |
Canon EOS M | DSLR | 18 Mpx |
Samsung NX210 | ? | ? |
Nikon Coolpix S800C | Compact | 160 Mpx |
She has some data but collecting it is:
- expensive
- labor intensive
- error-prone
Digital camera | Type | Res. |
Canon PS S100 | Compact | 12 Mpx |
Nikon S3200 | DSLR | 24 Mpx |
Canon EOS M | DSLR | 18 Mpx |
Samsung NX210 | DSLR | 20 Mpx |
Nikon Coolpix S800C | Compact | 16 Mpx |
She tries MagicTable
- we automatically learn structure of her data
- crawl the relevant information from the web
- check for any error
- add new product
Structure analysis
Our algorithm examines your data
- strings/categories - What values are possible?
- numbers and units - What is correct?
Focused crawling
& natural language
We gather relevant web pages by our crawler.
We analyse the texts
and identity information.
Machine learning
We combine the data from all the web pages and from your own data to correct bad values and add new products.