DailyWebb.com

Men’s daily Webb Round-Up

  • Home
  • About
  • Advertise
  • Disclaimer
  • Get Email Updates!
  • LINKS
  • Play Sudoku Online

News Web Sites Seek More Search Control

Posted by admin in November 29th 2007  

NEW YORK (AP) - The desire for greater control over how search engines index and display Web sites is driving an effort by leading news organizations and other publishers to revise a 13-year-old technology for restricting access.

Currently, Google Inc. (GOOG) (GOOG), Yahoo Inc. (YHOO) (YHOO) and other top search companies voluntarily respect a Web site’s wishes as declared in a text file known as “robots.txt,” which a search engine’s indexing software, called a crawler, knows to look for on a site.

The formal rules allow a site to block indexing of individual Web pages, specific directories or the entire site, though some search engines have added their own commands.

The new proposal, to be unveiled Thursday at the global headquarters of The Associated Press, seeks to have those extra commands - and more - apply across the board. Sites, for instance, could try to limit how long search engines may retain copies in their indexes, or tell the crawler not to follow any of the links that appear within a Web page.

The current system doesn’t give sites “enough flexibility to express our terms and conditions on access and use of content,” said Angela Mills Wade, executive director of the European Publishers Council, one of the groups behind the proposal. “That is not surprising. It was invented in the 1990s and things move on.”

Robots.txt was developed in 1994 following concerns that some crawlers were taxing Web sites by visiting them repeatedly or rapidly. Although the system has never been sanctioned by any standards body, major search engines have voluntarily complied.

As search engines expanded to offer services for displaying news and scanning printed books, news organizations and book publishers began to complain.

The proposed extensions, known as Automated Content Access Protocol, partly grew out of those disputes. Leading the ACAP effort were groups representing publishers of newspapers, magazines, online databases, books and journals. The AP is one of dozens of organizations that have joined ACAP.

News publishers complained that Google was posting their news summaries, headlines and photos without permission. Google claimed that “fair use” provisions of copyright laws applied, though it eventually settled a lawsuit with Agence France-Presse and agreed to pay the AP without a lawsuit filed. Financial terms haven’t been disclosed.

Wade said ACAP could thwart future legal battles and make Web sites more comfortable about putting more material online, including scholarly journals and other items requiring subscriptions.

The new ACAP commands will use the same robots.txt file that search engines now recognize. Web sites can start using them Thursday alongside the existing commands.

Like the current robots.txt, ACAP’s use would be voluntary, so search engines ultimately would have to agree to recognize the new commands. Search engines also could ignore them and leave it to courts to rule on any disputes over fair use.

Google spokeswoman Jessica Powell said the company supports all efforts to bring Web sites and search engines together but needed to evaluate ACAP to ensure it can meet the needs of millions of Web sites - not just those of a single community.

“Before you go and take something entirely on board, you need to make sure it works for everyone,” Powell said.

ACAP organizers tested their system with French search engine Exalead Inc. but had only informal discussions with others. Wade said organizers wanted to focus first on getting sites to adopt the system, figuring search engines will follow once a critical mass is reached.

Danny Sullivan, editor in chief of the industry Web site Search Engine Land, said robots.txt “certainly is long overdue for some improvements.”

But he questioned whether ACAP would do much to prevent future legal battles.

And being an initiative of news publishers, he said, it might lack attributes that blogs, online retailers and other Web sites might need in an updated robots.txt.

Francis Cave, ACAP’s technical project manager, said Thursday’s plan was only “a first stab. … We full expect we will need to add to that.”

Already contemplated is support for video files, not just text and still images. Cave said online archives such as the British Library and the Internet Archive might also need special commands.

Source: news.ask.com

Share This

under: Internet, Technology
Digg it Add to del.icio.us Stumble it add to technorati

Related Post

  • iPhone 1.1.3 Firmware Bugs (January 22nd, 2008)
  • 5 Reasons to Use WordPress as CMS (January 9th, 2008)
  • Bill Gates: the exit interview (January 8th, 2008)
  • Casual Blogging Not Just Lunch Money Now (December 27th, 2007)
  • Facebook suing Ontario porn firm (December 17th, 2007)

No Comment Received

Leave A Reply

Please Note: Comments maybe under moderation after you submit your comments so there is no need to resubmit your comment again

« Britney and Paris top Santa’s naughty list: poll
Hidden Dangers in Visiting Porn Sites »

Feeds

feeds
get latest updates on news and subscribes to our feeds

More About The Site

DailyWebb.com - It's all about men and what it takes to be a man.

Subscribes

  • PageRank Checker
  • stumble
  • technorati add aol netvibes rojo myyahoo modern freedictionary subrss chicklet plusmo newsburst ngsub wwgthis subscribes

Advertisment

1 1

Your ad here, contact us

Tags

  • babe Babes bikini boob boobs breast breasts flowers fun Funny funny jokes funny pics funny pictures girl girls girlz gossip greek jokes greeks hot hot girls hotter hotties huge humour joke jokes lingerie links model photo photos pics pictures roses sex Sexy sexy lingerie sexy pics sexy Pictures tit tits Video Videos xxx

Search

Categories

  • Uncategorized (76)
  • Technology (34)
  • World (63)
  • Sports (26)
  • Entertainment (58)
  • Auto-Moto (19)
  • Internet (22)
  • Fashion (15)
  • Celebrity (80)
  • Funny (121)
  • pictures (28)
  • Videos (31)
  • Odd Stuff (29)
  • Sexy (139)
  • Babes (146)
  • DailyBabe (52)
  • HOT Links (12)

Links

Archives

  • August 2008 (108)
  • July 2008 (84)
  • June 2008 (99)
  • May 2008 (77)
  • April 2008 (51)
  • March 2008 (39)
  • February 2008 (59)
  • January 2008 (69)
  • December 2007 (59)
  • November 2007 (102)

Pages

  • About
  • Advertise
  • Disclaimer
  • Get Email Updates!
  • LINKS
  • Play Sudoku Online

Meta

  • Login
  • Valid XHTML
  • Valid CSS
  • WordPress
Close
  • Social Web
  • E-mail
  • del.icio.us
  • Digg
  • Furl
  • Netscape
  • Yahoo! My Web
  • StumbleUpon
  • Google Bookmarks
  • Technorati
  • BlinkList
  • Newsvine
  • ma.gnolia
  • reddit
  • Windows Live
  • Tailrank
E-mail It

Recent Entries

  • DAD, there’s this Girl at School
  • Daily FUN - 3 Nurses and a Wish
  • Viviana
  • Val
  • Daily Babe - Maureen Eggleton
  • White B&W
  • barca
  • WARNING - Hot Water is very not what you would expect
  • Daily FUN - Home remedies
  • Dayana
  • Red swimwear
  • colorado model
  • Cassidy

Recent Comments

  • Anonymous in The Greek way of doing business
  • Anonym in corner
  • Girls On You Tu… in Daily FUN - Pet Hate
  • boris in Comics - Russia vs Georgia
  • Iris in Sweet elegance
  • Russia vs Georg… in Comics - Russia vs Georgia
  • Sam in Beijing 2008 Olympic Pictograms
  • z3n in The Greek way of doing business
  • The Greek way o… in The Greek way of doing business
  • Why can’t… in According to McCain: Vladimir Putin…

Most Comments

  • According to McCain: Vladimir Putin is the President of Germany?! (35)
  • The Greek way of doing business (27)
  • MUM ! I'm OK (8)
  • PHOTOS: Christina Aguilera pregnant nude (6)
  • 7 Odd - Dream Vs Reality Pictures (4)
  • 9 dead in Nebraska mall shooting (4)
  • Hulk Hogan Headed for Divorce (3)
  • Snow and Ice Plaster Midwest; 3 Killed (3)
  • Lindsay Lohan is the dumbest person in Hollywood (3)
  • Lazy Police ? here is how you should deal with them (3)
  • The Poseidon Resort - the first underwater HOTEL in the world (3)
  • Nope it's Soap - the poop look-alike SOAP // WTF (3)
Box-Tube Box Modulize WordPress Theme By Dezzain Studio
©2006-2008 DailyWebb.com
Powered by WordPress 2.3.1    Valid XHTML    Valid CSS