Projet numéro 601 : "PRICE COMPARISON MATCHING MODULE"


Initiateur du projet :
Description projet :

 

PROJECT BACKGROUND:

Our company develops a Price comparison web service. when a visitor searches  a product, we display all the products corresponding to its search by grouping those that have identical “model”. A product is characterized among others by a category (e.g. digital camera), a brand (e.g. Sony), a model (e.g. DSC T9)…

 

Example:

A user searches on our site for a digital camera: he clicks on the “HiFi-Photo-Video” category then on “digital Camera” category. We then display all the digital cameras numerical models. For example, those whose model is “DSC T9” are gathered and the price range 319€ to 369€ is displayed.

 

The product data is sent to us from about 50 internet vendors (Virgin, Fnac ...) and affiliation networks (NetAffiliation, TradeDoubler…). This is received every night using xml or csv files that we need to proceed and then load into our database. We receive about a total of 500.000 products. The products are characterized in these flows by a category, a brand, a product name, a model, a price, a stock…

 

Example:

The flow of merchant A contains the following product data:

Category: Digital Camera
Brand: Sony

Model:
ProductName: Cybershot DSC T9
Description: This digital camera is the last born from Sony, it combines… Price: 349€

 

The flow of the merchant B contains the following product data:

Category: Camera
Brand: Sony
Model:
ProductName: DSC-T9
Description: Sony Cybershot T9 is a of 5 megapixel digital camera, it… Price: 369€

 

The flow of the merchant C contains the following product data:
Category: Numerical camera

Brand: Sony
Model: Cybershot DSC T9
ProductName: Sony T9
Description: T9 of Sony has a stabilizer of image which…
Price: 319€

 

 

PROBLEM:

There are 2 distinct problems:
1. The model is sometimes not defined in the model field, but is specified in the ProductName or in the description.
2. the same model is named in different ways (here DSC T9, DSC-T9, Cybershot DSC T9…) It is impossible for us to ask all the internet merchants to use the same model naming and to write the model always in the "model" field

 

DESCRIPTION OF THE NEED:

What we need is the development of a "matching module" to identify in this example that the products from merchants A, B and C correspond actually to the same model (same Sony Cybershot DSC T9). To do this matching, it is possible to use a database or a table that contains all "known" brands and models. This module will have to do the matching automatically every night when the product data feeds are received from the internet merchants. Part of the module can be manual to validate new brands or models. The module can be developed in any language but will need afterwards to be integrated in our platform that is based on PHP and MYSQL.

 
 



Les sujets suivants peuvent également vous intéresser

recherche consultant phpcodeur web
prestataire internetrealisation site internet
graphiste freelance paristeletravail xml
independant microsoftprogrammeur net
freelance traitement imagecherche independant developpement sites php mysql ajax

Projets "en attente" similaires


TIBCO-REPUBLICATION
site de jeux en ligne flash
Creation site de E commerce
Parametrage serveur RED5 - republication
Realisation site web - gestionnaire de demande de stages

Projets "en cours" similaires


URGENT-Retranscription en mot a mot d-interview
URGENT - Developpement d un module en Java ou Flex pour de la video - REPUBLICATION
Creation site CMS Open Source
Modification de module vtiger
Amelioration referencement site Internet - migration vers Joomla 1.5

Projets "en archive" similaires


Creation logo charte graphique
Module Dolibarr
Vectorisation05
Carte interactive FLASH projet transport en commun en Afrique
Un site de belote en ligne

Projets "VIP" similaires


site comparatif de voyages
Developpement site web complexe
Realisation magazine participatif web2.0
Recrutement video
Developpements Telecom

 

 

 

 

(c)