Data mining in the cloud

Do you need structured data? Our Artificial intelligence will gather it from the web.

JJill runs an e-shop.

  • Her users want to search products by params. e.g. sort digital cameras by resolution:
  • Her users want correct parameters and the newest products

Digital camera Type Res.
Canon PS S100 ? ?
Nikon S3200 Compact 24 Mpx
Canon EOS M DSLR 18 Mpx
Samsung NX210 ? ?
Nikon Coolpix S800C Compact 160 Mpx

She has some data but collecting it is:

  • expensive
  • labor intensive
  • error-prone

 

Digital camera Type Res.
Canon PS S100 Compact 12 Mpx
Nikon S3200 DSLR 24 Mpx
Canon EOS M DSLR 18 Mpx
Samsung NX210 DSLR 20 Mpx
Nikon Coolpix S800C Compact 16 Mpx

She tries MagicTable

  • we automatically learn structure of her data
  • crawl the relevant information from the web
  • check for any error
  • add new product

Structure analysis

Our algorithm examines your data

  • strings/categories - What values are possible?
  • numbers and units - What is correct?

Focused crawling
& natural language

We gather relevant web pages by our crawler.
We analyse the texts and identity information.

Machine learning

We combine the data from all the web pages and from your own data to correct bad values and add new products.