[FCE] Reddit sues AI company Anthropic for allegedly ‘scraping’ user comments to train chatbot | Technology | The Guardian

收听本期播客

阅读正文

In a striking development in the technology sector, the widely-used social media platform Reddit has initiated legal proceedings against Anthropic, an artificial intelligence company. Reddit, a hub for millions of users who engage in daily discussions and share opinions, alleges that Anthropic has unlawfully utilized its vast collection of user comments to train a chatbot named Claude. The accusation centers on Anthropic’s use of automated systems, commonly known as bots, to extract data from the site—a practice called ‘scraping.’ Reddit contends that this unauthorized data collection infringes on the privacy and rights of its users.

To grasp the context of this dispute, it’s important to understand how modern artificial intelligence operates. Many AI tools, including chatbots, rely on extensive datasets often sourced from public platforms like Reddit. With approximately 100 million active users contributing content daily, Reddit offers a rich reservoir of human dialogue, making it an attractive target for companies building language models. Nevertheless, Reddit maintains that such data should only be accessed through formal agreements. The company has already established partnerships with major tech firms like Google and OpenAI, permitting the use of its data under strict conditions that prioritize user privacy and allow individuals to delete their contributions if desired.

The legal action was lodged in a San Francisco court in California. Reddit’s chief legal officer, Ben Lee, issued a firm statement highlighting the necessity for AI companies to adhere to clear guidelines regarding the use of personal information. He argued that scraping data without consent or boundaries is unacceptable and could jeopardize the individuals whose personal views are being exploited for commercial gain.

This lawsuit has reignited discussions about data privacy and the ethical implications of AI development. Many are now questioning the extent to which companies should be permitted to harvest publicly available information for profit. The resolution of this case could establish a significant precedent, potentially reshaping the operational practices of AI firms in the years ahead. As the tech community awaits Anthropic’s response to these serious allegations, the outcome remains uncertain but is sure to have a lasting impact.

This clash between innovation and privacy raises critical questions about the future of data usage in the digital age. It serves as a reminder of the delicate balance between technological advancement and the protection of individual rights.

阅读练习

1. What is the main reason Reddit is taking legal action against Anthropic?

  • A. Anthropic created a competing social media platform.
  • B. Anthropic used Reddit’s user data without permission.
  • C. Anthropic failed to invest in Reddit’s technology.
  • D. Anthropic posted harmful content on Reddit.

2. According to the article, why is Reddit’s content valuable to AI companies?

  • A. It provides advanced coding solutions.
  • B. It offers a large amount of user-generated dialogue.
  • C. It contains exclusive business data.
  • D. It has unique visual content.

3. What can be inferred about Reddit’s agreements with companies like Google and OpenAI?

  • A. They prevent users from posting on the platform.
  • B. They allow unlimited use of data without restrictions.
  • C. They include measures to protect user privacy.
  • D. They exclude the possibility of deleting user content.

4. What is Ben Lee’s attitude towards data scraping without consent?

  • A. He considers it a minor issue.
  • B. He believes it is necessary for AI progress.
  • C. He views it as unacceptable and harmful.
  • D. He thinks it should be encouraged.

5. What does the word ‘reignited’ in the third paragraph most likely mean?

  • A. Started for the first time
  • B. Brought to an end
  • C. Revived or brought back attention to
  • D. Completely ignored