The websites of blablacar, flixbus, and various trains sites are hostile toward the idea of travelers getting all the data they need to plan a trip.
E.g. Flixbus is tor-hostile, but even when access is granted you cannot just ask for the cheapest trip from A to B over a range of days. The motherfuckers force us to search one day at a time and just one destination at a time.
Fuck that. How can we get the data? I know these sites have bot-hostility so scraping it seems like a huge effort.
Some countries have “open data” laws that requires sharing the data, but that only works if the gov gets the data to begin with. If the gov does not get the Flixbus data, then there is no legal requirement to share it.
Personalised pricing is evil indeed. You make an interesting point because exposing that evil likely gives an angle on why carriers resist open data which I had not considered.
When I raised the question, I did not mean to limit the request to official sources. In fact, I somewhat expect that a dataset would come from an independent 3rd party. Even if the prices are biased for a particular person, it’s relative pricing that’s most interesting anyway.
Finding the cheapest is quite useful even if there are slight markups/markdowns with whatever vendor sells the ticket. Flixbus discriminates against Americans by adding $1 to every ticket from US IP addresses, but a US dataset would still help me decide outside of the US which route is the cheapest.
Note as well that routes and schedules are useful even without accurate pricing – for BlaBlaCar in particular because people offering seats in their car have no periodic schedule.
Is there any type of browser extension that could help people crowdsource data? It’s the only way I see how to build this dataset.
I had the same idea but AFAIK it does not exist.