Hi!
Purple is the color.
My name is Tomasz Kustra, and I am from Poland.
I am interested in this project.
getting ALL product listing will be rather not possible, because;
- of amazons data organization - there are no full listings of all products, every product is placed on more than one
- of time - getting all those data needs time and resources like proxies, server.
It just happened that I am involved right now in quite similar project, and it needs dozens of proxies and its own server. And it still can get only a fraction of amazon listing per day.
It is basic calculation - in long term you want be able to get more than one page per second from one IP, often it is even slower.
So 4000 pages per hour = 100.000 pages per day, each page can contain one (product page) or up to 20 products (listing), so you can't get more than 2mln products per day. And from listings you can get only basic information, for full ones you need products pages.
And there is over 500mln products.
and listings contain duplicated products, so you have to get even more data.
So 25 proxies only to get listings, and 500 proxies to get whole info.
not to mention how much transfer it would need (up to 3TB per day for 500 proxies)
So your project will have to be hardly cut.
Look into my reviews and let me know if you are interested, than we can talk about details and price.
Regards Tomasz