So, craigslist experiences some blowback.. [Archive]

View Full Version : So, craigslist experiences some blowback..

Tony_SS

10-30-2010, 04:04 PM

The short story is that there was a site called Craiglook. It was great, you could type in your zip code and it would search ALL of craigslist based solely on your zipcode. This was great for me, since I live in a rural area, I could search 5-10 miles away for things I needed.

So then, craigslist sent a nasty letter to craiglook, demanding they shut down for infringing their rights and so on, even though Craiglook only search CACHED rss feeds, it did not tax their servers at all. It actually brought them more users! But they shut down, people went nuts and complained to craigslist.

Now Craiglook has changed their site. Its http://claz.org/ and its now a free classified site and a very strong competitor to craigslist. I hope they really take off.. the search engine is amazing, something craigslist has always lacked.

So for those who like to buy other peoples junk or hard to find car parts, check it out!

dropit69

10-30-2010, 04:41 PM

Thanks like i really needed more sites to buy stuff..lol..also try www.backpage.com its like craigslist..

WS6

10-30-2010, 06:43 PM

does claz have an adult services section? I really need a message.

novaderrik

10-30-2010, 09:09 PM

does claz have an adult services section? I really need a message.

i could send you a private message if you really want one that bad..
i think i'll go on Craigslist and find someone to give me a massage first..

79-TA

10-30-2010, 11:01 PM

Thanks for the heads up! Now I have yet another venue to find cars to buy . . . and throw money at. Yea, I'm blaming you when I'm broke.

Tony_SS

10-31-2010, 03:37 AM

Thanks like i really needed more sites to buy stuff..lol..also try www.backpage.com its like craigslist..

claz already searches backpage. :) It also searches the ebay free classifieds.

does claz have an adult services section? I really need a message.

I wouldn't know.. I have a wife for that! Try your local Tokyo sauna. :)

wmhjr

10-31-2010, 05:07 AM

WS6

10-31-2010, 07:31 AM

claz already searches backpage. :) It also searches the ebay free classifieds.

I wouldn't know.. I have a wife for that! Try your local Tokyo sauna. :)

Well, I just moved to Pampa Texas and my initial drive around town this morning has me thinking I just made a mistake, lol. I doubt there's even a sauna in this town.

And holy crap I suck at spelling. Message, Ha!

Nessumsar

10-31-2010, 07:48 AM

There's also www.Searchtempest.com

T_Raven

10-31-2010, 08:08 AM

I use to use craiglook all the time. I was pissed when it changed to claz.org. If craigslist would let you search a distance range in the first place I never would've needed craiglook. I live in the middle of no where and I'm willing to pay shipping or go get what I want, so I like to look nationwide.

WS6

10-31-2010, 03:27 PM

I think CL would be foolish to not look at what brought people over to craiglook and then try and incorporate that into their website. I think that would be a smart move. I consider the people over at CL to be fairly bright people so I would expect it in the future.

Flash68

10-31-2010, 03:29 PM

Jaxed is a good craiglist search engine for auto related stuff.

Tony_SS

10-31-2010, 05:28 PM

Tony, it's probably not what you want to hear, but I can understand why CL would be concerned about craiglook operating the way they were. It really isn't whether or not craiglook was running queries directly against CL databases. It's probably more of a concern that it takes people away from the CL UI, which means that should CL ever decide to modify their operating model and get revenue from page based ads, craiglook undermines that. That means that in terms of company value, craiglook reduced the value of their product. CL is a for profit company, 25% owned by eBay. It's pretty hard to blame CL for wanting to protect their UI. Not saying I necessarily agree with them, but it's their company and their data.
What do you mean by 'UI'? I don't know what craigslist was claiming.. and I'm not sure how they were violated, search engines do this all the time. Craiglook was doing just that? But it sound like Craiglook folded under a bluff and became Claz. So now craiglist and ebay have more competition and users - and that's fine by me. Options are a good thing.

Well, I just moved to Pampa Texas and my initial drive around town this morning has me thinking I just made a mistake, lol. I doubt there's even a sauna in this town.
Pampa!? I thought you moved to Colorado?

There's also www.Searchtempest.com
Yes, they are good too. What confuses me is that they are still up and Craiglook is not.

I use to use craiglook all the time. I was pissed when it changed to claz.org. If craigslist would let you search a distance range in the first place I never would've needed craiglook. I live in the middle of no where and I'm willing to pay shipping or go get what I want, so I like to look nationwide.
Same here man! Don't hold your breath... they made it clear the craigslist is a LOCAL only thing. I don't see their search improving. I hope it does though.

I think CL would be foolish to not look at what brought people over to craiglook and then try and incorporate that into their website. I think that would be a smart move. I consider the people over at CL to be fairly bright people so I would expect it in the future.
That was my understanding too... why on earth would you not want that traffic? Unless of course Ebay pulled some strings.. it directly cuts in on their action, not craigslist.

wmhjr

10-31-2010, 06:10 PM

What do you mean by 'UI'? I don't know what craigslist was claiming.. and I'm not sure how they were violated, search engines do this all the time. Craiglook was doing just that? But it sound like Craiglook folded under a bluff and became Claz. So now craiglist and ebay have more competition and users - and that's fine by me. Options are a good thing.

"UI" means user interface. I'm very sure that there are patent implications and just like with college teams having to deal with high schools as well as actual licensing pirates to defend their intellectual property, CL has to look out for their own product. If they allow some organizations to violate their intellectual property they find themselves at risk of not being to enforce it in other instances. Search engines look for metadata matches. This is completely different. This other company us specifically searching for content of a privately owned company with a patent protected product (there is an amount of assumption there) and returning content only from that provider. It makes no difference if they gather that data from RSS feeds rather that directly from CL infrastructure. CL apparently feels that the content is their property, and I can understand how they would believe that it's what makes their product valuable. CL can make the argument that the other company is using CL technology without license. I would not be so certain that the letter was a "bluff". It's hard to say. The point is that unless you are familiar with patent law around information technology products, I wouldn't be so certain.

Tony_SS

11-01-2010, 05:00 AM

79-TA

11-01-2010, 10:38 AM

Craiglook was better known and so it was taken down first. That's a result of both being more visible and greater potential threat in whatever capacities already discussed. The precedent has been set; Seachtempest is still under the radar.

Jon, thanks for the tip on search tempest!

wmhjr

11-01-2010, 11:22 AM

I might believe that if craigslist tried to enforce their claims consistently. Like mentioned, there are other craigslist 'search' sites that do the same, exact thing - yet they are still operating. Maybe they didn't cave into their threats.

If we apply that logic, then any lawyer could sue any search engine for searching cached RSS files.

I really don't know what's going on behind the scenes. We're all making guesses. But remember that CL likely has patents - and has been adament that they want to stay "local". It's part of their "brand identity". They have actively opposed "non-local" CL use. So, like I said I can understand the reasoning behind some of what they're doing. Nor do either of us have a strong understanding of IT patent law, though I'm relatively familiar at a general level due to my current job. Whether you or I believe it doesn't change the fact that neither of us really knows exactly what patents they have, exactly how they felt that a particular company was violating it, and how it potentially impacts their product. Remember - all they have is their product, and their product is information entered by users. The bottom line is that it's their data. They have a right to try and protect it to suit their needs.

DEIGuy38

11-01-2010, 08:58 PM

Craiglook was great. Maybe someone watched that movie Social Network and got all butthurt so CL decided to shut down craiglook.

donaz

11-02-2010, 05:35 AM

I liked using Craiglook and getting daily e-mail notifications of new listings. Claz sends out e-mail notifications of listings it seems from everywhere except Craigslist. Without searching all the "search all craigslist" sites has anyone found a good one comparible to Craiglook that actually sends e-mail from CL listings?

TonyL

11-02-2010, 05:54 AM

you guys need to use Crazedlist.org also. You can search all of the region you are interested in, all on one page. It's very good, you do have to change the "referrer" settings in firefox or chrome, but its easy and tells you how to do this. I HIGHLY recommend that site.

TonyL

11-02-2010, 05:57 AM

Tony_SS

11-02-2010, 08:23 AM

Oh, and the reason for craiglook having to change it's name? "Craig" in the title. It's all about brand dilution, it's infringing on his brand, and contributing to confusion, people might think that Craig has something to do with it, and he doesn't. Hence lawsuity time.

I could understand that.. but I think they should have just changed the name and left the search feature. One of their claims in their C&D letter was that Craiglook was putting and extra load on their servers - which is BS since it only searched cached RSS feed.. so it did not even touch their servers.

wmhjr

11-02-2010, 09:00 AM

I could understand that.. but I think they should have just changed the name and left the search feature. One of their claims in their C&D letter was that Craiglook was putting and extra load on their servers - which is BS since it only searched cached RSS feed.. so it did not even touch their servers.

That is not true. The SEARCH may not have put extra load on. However the hits on the SEARCH RESULTS would. In other words, CL likely has their hosting design based on the assumption (and intent) that CL is "local only". However, the larger search indexing would result in people for example from NY starting to look at ads from TX. That's more volume, in an area that CL does not want. That would increase requirements for I/O, CPU, Memory and bandwidth. They DO NOT WANT people from NY searching all over the country. That is not the intent of their service, and they've been specific and clear about that.

I understand your position Tony, but it IS their company and their own proprietary property. It is their data. I know you're a private enterprise kind of guy and it surprises me a little that you have a different opinion here. No insult or criticism intended - just an observation.

Bottom line is that again - IT IS THEIR COMPANY AND THEIR PROPERTY.

Tony_SS

11-02-2010, 10:25 AM

I am pro-private enterprise and believe in intellectual property, but in regards to information property, that's where things get into a gray area. There have been absurd allegations and lawsuits from a law firm that makes it's living off of suing online forums, like this one, for someone posting links and copy from other various news outlets. Read more here (http://www.techdirt.com/articles/20100816/03474210634.shtml). And here (http://www.wired.com/threatlevel/2010/07/copyright-trolling-for-dollars/).

There has been much debate about information intellectual property. Other than using the name 'Craiglook' I don't believe they infringed on craigslist, based on what I know. We don't have all the details about situation here. Obviously it appears they were singled out, as others still exists, so I have to question that. We can only speculate on a lot things - no sense in debating the unknown unless we have some facts to go by.

But I believe there is a distinct difference between stealing and sharing information. Fair use exists for a reason. It keeps the copyright & lawfirm trolls from shaking down everyone and everything for their own greed.

wmhjr

11-02-2010, 11:02 AM

But Tony, your position is exactly opposite of one supporting private enterprise. "Craigslist" is a for profit business having a product with brand value, and the fundamental value of their product is with the data/information that people directly input into Craigslist. Without that data, their product is worthless. In order to differentiate them from the spam filled "global" or "national" players, they have instead consistently concentrated on making their product "local". They also have built an infrastructure designed to handle expected load based on the use cases and characteristics for which the product was designed. So, when somebody else comes along and attempts to take that data, which directly violates the CL temrs of use, it's a far different case than what you describe above. Trying to make CL out to be like Righthaven is ridiculous. CL isn't trying to gain revenue by use of patent enforcement. They are simply trying to protect their own product, which in case you forgot, is FREE to almost everyone.

If CL took data from multiple sources it would be a very different story. However, because CL has marketed itselt first and foremost as a "local" and "community" service, and expressly forbits any data entry that is not manual by the user, then the nature is very different. Again, in the terms of use, CL has for a long time expressly forbidden data mining of its data by external parties without the express permission of CL. To suggest that data mining of CL is "fair use" is more than a stretch. I can tell you that it does not legally hold water. It is THEIR data. Were they to choose, they could restrict viewing to paying customers. Fact is, CL has been extremely open about access, providing a service at almost no cost. Asking them to open themselves up to even more operating cost is unreasonable.

Bottom line is that participation of CL binds ALL users to the Terms of Use, which expressly forbid data mining, meaning that factually Craiglook vioated the terms of use and infringed on CL. It may not be convenient for you and I, but it is that simple. You have to ask yourself, what evil reasoning can you ascribe to this action? How exactly is CL acting like Righthaven? Where's the beef? What money did CL get for enforcing this? It's like Occums razor. Once you eliminate all the other reasons, the obvious comes to mind. And in this case trying to make CL look like the bad guy is going to be pretty tough to do.

Tony_SS

11-02-2010, 12:47 PM

So, is google 'data mining'? I'm sure I could find a lawyer who thinks they are. I'm not trying to make craigslist the boogey man - I don't think they are. I'm saying that whether or not Craiglook (now Claz) infringed on craigslist claimed property is highly debatable based on the what search engines do. This is nothing new. I understand you see it differently.

wmhjr

11-02-2010, 09:13 PM

First of all, Google does not focus specifically on one information source. Second, Google does not allow you to preset parameters which exactly match the user interface of the core technology like these specific search engines are. If you are familiar with how search engines such as google, yahoo, etc operate then you must admit that frankly they operate completely differently - frankly for one reason so that they can avoid this specific type of problem. Craiglook specifically and solely targes ONLY CL data. That is something completely different. It is in fact data mining and in no way, shape or form resembles how the search algorithms function with actual "search engines" such as google. And I mean in absolutely no way whatsoever does Craiglook (or whatever other ones exist) resemble true "search engines". None. Google, Yahoo, etc use very complicated (largely unpublished and VERY proprietary) algorithms to generate result sets with rankings based on a number of specified and unspecified variables. Craiglook simply uses the exact metadata, and passes the request to get results that have been generated by craigslist resources having completed indexing on a ton of data. Data that they solely own.

But the real problem you keep missing is that this is private data, owned by CL, and managed by CL, with specific, published terms of use that explicitly prohibit this behavior in order to protect both their brand, and their intellectual property. It costs them money to provide this service, and they have certain methods to generate revenue to offset that. Without that data and that user interface (UI) they have absolutely nothing. They believe (probably with some merit) that their brand is uniquely "local" and want to avoid "national" or "regional" activity. I have no issue whatsoever with that. Furthermore, many of these "hack" search tools do NOT simply parse RSS feeds. They attempt to use bots and spiders to gather index data. But even the ones like this one that may use RSS feeds also still add load to CL systems - in 2 ways. First of all, it is inevitable that if you're parsing RSS feeds, the content will be dated, and you'll be returning outdated results - meaning that entire tables are scanned potentially if a user clicks on one of those links. That takes processing power, and potentially memory because obviously the data will not be cached. Beyond that, as I already mentioned CL explicitly does NOT WANT multi-region searches. When Craiglook returns a result set, actually selecting one of those results does not get run against an RSS feed. It then gets run against the production system, meaning more resources are being used than designed, intended or budgeted by CL based on their INTENDED service offering. Therefore, such use generates increased cost because they have to invest in order to provide adequate performance to all, even though some of it is being consumed by what are effectively "non-customers".

That to me is not acceptable - that is, to force CL to spend money so that somebody else can use their data in a manner that their branding explicitly opposes.

wmhjr

11-02-2010, 09:24 PM

BTW, just for the record I want to explain why I'm spending time on this. I'm kind of in this business. I'm retired from my first career (active military) and working on my second. I run a group which focuses on delivering what can best be described as IT enabled services. Part of that is designing and hosting commercial, shared service, single instance systems. Unfortunately, a good deal of my time and budget is spent trying to avoid where possible, and mitigate where not possible, unintended use of our systems which costs me time, effort and money. Because IT in a very real way has become so consumer friendly, something I often refer to as the "Best Buy Syndrome" has grown. That is, because people can walk into a local Best Buy and pick up a 1TB drive for a couple hundred bucks, and get residential broadband for $50/month, they think this stuff is all real cheap to deliver with high availability in large scale. It's a good news/bad news kind of thing. People are far more aware of the potential of the technology and it has become far easier to use, but that has resulted in far FAR higher cost that nobody recognizes. Here's an example. There has been more unstructured data created and saved to storage in the past 2 days than was created and stored from this history of man to 2008 - in total. Fact. Resource utilization is not a linear function. It is an exponential function. And companies that deliver online services have to pay their bills. Last year alone I spent (business wise) over $2.1 million dollars in ONLY network circuit costs. Or that even using enterprise discounted pricing, MS SQL Server enterprise for commercial use is licensed at around $16 thousand - per core. So, a highly available clustered database with an active/passive 2 node 4 core cluster plus a disaster recovery location costs $256K annually - just for DB licensing. Don't even ask about Oracle. You know what they say - after a while you're talking real money :) I'm just saying that it's easy to point the finger and say CL is being unreasonable - so long as you don't know what it costs them to provide that free service that you're not even paying for.

Tony_SS

11-03-2010, 03:27 AM

Craiglook specifically and solely targes ONLY CL data.
Wrong. They searched Oodle, Backpage and Ebay free classifieds. It was a free classifieds search engine that searched cached files.

wmhjr

11-03-2010, 06:07 AM

Wrong. They searched Oodle, Backpage and Ebay free classifieds. It was a free classifieds search engine that searched cached files.

Didn't know that. Doesnt' matter though. The point remains intact that it's not a typical "search engine" and instead parses the RSS feed to take advantage of filtering already performed by CL. The interesting thing is that Craiglook/claz themselves prove my point. There's a page showing the impact to traffic going to CL up to, and following the change. You can see a dramatic drop in traffic - which is apparently what CL wants for reasons I already mentioned. The people I see complaining about it are typically the same people who complain that MS is evil, all software should be open source, etc. Of course, they're not complaining about how much development has happened as a result of MS as an example.

We'll just have to agree to disagree.

Tony_SS

11-03-2010, 07:14 AM

Didn't know that.

You're forming opinions based on assumptions. So what about the traffic? Does CL own their traffic too? lol You can't claim infringement based on traffic. And now you're resorting to strawman's with your Microsoft 'open source'.

Yours and CL's cries of infringement are a pretty weak case against a free search engine searching the cached files of classified sites.

Agree to disagree. Cheers.

wmhjr

11-03-2010, 07:23 AM

You're forming opinions based on assumptions. So what about the traffic? Does CL own their traffic too? lol You can't claim infringement based on traffic. And now you're resorting to strawman's with your Microsoft 'open source'.

Yours and CL's cries of infringement are a pretty weak case against a free search engine searching the cached files of classified sites.

Agree to disagree. Cheers.

No, I'm saying that the increased traffic COSTS CL MONEY! They have to pay for resources to handle that traffic. Bandwidth. Load balancer capacity. Web server capacity. Memory. Storage, as the indexing of unstructured data typically takes up between 50-150% additional storage. Firewall processing capacity. Those are facts and not assumptions. Unless there is some sort of magic pixie dust that handles the technology. It is your opinion that the argument is weak. Unfortunately, it isn't a legal opinion. I am not resorting to "strawman" arguments. I'm pointing out comparisons. I would recommend that if you're interested, you pick up a couple books on Information Technology Law, Privacy and Intellectual Property law. As I said, this is my business, so I may well have a different perspective. That being said, my perspective is based on doing this. There is not a week that goes by that I am not talking for at least a couple hours with in-house counsel AND external counsel for issues such as this. The internet is not a "free for all". It is a communications vector. Frankly, if you walk down the street and see an unlocked car with the keys in the ignition, there is nothing technical preventing you from driving it away. It is still theft. Allowing other vendors to access indexed data wholely owned by CL, indexed by CL and supported by CL, for which only the INDEX is cached and all hits are routed through CL once displayed, for the sole purpose of expanding the scope of their product and brand, when they explicitly want "their product" to be "local", which subsequently results in more cost to CL, seems to be a decision that CL should be able to make.

But, let's leave it at you have an opinion. I have a different opinion. I completely support CL in this regard and understand their position. I would likely do the exact same thing.

Tony_SS

11-03-2010, 08:54 AM

And CL is profiting off the increased traffic. That is the cost of business with higher profits. Claz is a separate private enterprise. I believe its fair use. And that is why we have fair use laws. Otherwise we would have monopolies that would own everything they could to eliminate competition. Small business and entrepreneurs wouldn't flourish or exist. JMHO. A judge could see it either way. Information property is certainly a new issue with a lot of grey area.

fantasygoat

11-03-2010, 11:35 AM

And CL is profiting off the increased traffic.

I'm curious how you think CL is profiting from increased traffic. They don't charge for most ads, they don't have any click count based revenue model. Increased traffic actually *reduces* profits because they have to pay for it.

I manage similar systems for a living so this discussion was interesting to watch unfold. You clearly don't understand that infrastructure costs money. Having some third party site scrape your site for data, without compensating you, is not good business. It is CL's prerogative to control the use of their data as they see fit, and in this regard I agree with their decision.

As a consumer I would like to have more options on CL such as area based search, but such changes should be through CL itself, not some third party they can't control.

Tony_SS

11-03-2010, 02:20 PM

Fair enough.. that assumption was made assuming they price their services (that they do sell and profit on) based on traffic volume - the same way an ad is more expensive to purchase from a magazine with a high circulation rate.

So whats the infringement with searching cached data? Its not site scraping. That is what Search Tempest does.They and other 3rd party sites, that DO site scrape still exist - yet no one want to acknowledge that. They are not under the radar either.. Unless those are taken down, it just seems odd Claz was targeted only.

fantasygoat

11-05-2010, 07:46 AM

That "cached data" has to come from CL at some point, to be cached.

I can't speak for the others, perhaps they are not big enough to appear on their network graphs.

wmhjr

11-05-2010, 09:26 AM

Fair enough.. that assumption was made assuming they price their services (that they do sell and profit on) based on traffic volume - the same way an ad is more expensive to purchase from a magazine with a high circulation rate.

I understand. So just to be clear, CL does not operate under this business model. Increased traffic actually decreases their profit to some extent. I suppose you could try to make an argument that increased visibility would result in increased success of sales, which would then result in increased use. Problem is, only a very small subset of transactions are "fee based" so it would be an extremely difficult argument to substantiate. As opposed to magazines, who see revenue from not only the advertisement segment but also from circulation, CL gets no circulation revenue whatsoever. Circulation is actually a cost center and not a profit center for them. Completely different model.

So whats the infringement with searching cached data? Its not site scraping. That is what Search Tempest does.They and other 3rd party sites, that DO site scrape still exist - yet no one want to acknowledge that. They are not under the radar either.. Unless those are taken down, it just seems odd Claz was targeted only.

That's a more difficult question, and it's one that nobody on this site can answer definitively or with any authority. A lot has to do with how their intellectual property is actually protected, and we can only make some assumptions there. I'd say from just a general perspective, a great deal has to do with their published terms of use - which frankly govern both people posting there as well as people using those posts. They have clearly established the limits of their product, and how that private data is permitted to be used. Much like NFL football games which are broadcast over the public airwaves via OTA. You are perfectly legal in tuning to those stations (I'm not talking cable, dbs, etc - just OTA). You're also OK to record them for your own use. However, you are legally prevented from replaying those in a commercial establishment without the express permission of the NFL. That structured data which has been indexed at some cost (initially and recurring) by CL is wholely and completely owned by CL. There is zero confusion about that. The entire question here is how it is allowed to be used. CL has established limits for the use of THEIR data. They have given permission for it to be used in very specific terms. Violation of those terms could be considered infringement on a number of fronts, starting with but not limited to patent or copyright infringement.