AI

Salesforce Study Finds LLM Agents Flunk CRM and Confidentiality Tests 21

A new Salesforce-led study found that LLM-based AI agents struggle with real-world CRM tasks, achieving only 58% success on simple tasks and dropping to 35% on multi-step ones. They also demonstrated poor confidentiality awareness. "Agents demonstrate low confidentiality awareness, which, while improvable through targeted prompting, often negatively impacts task performance," a paper published at the end of last month said. The Register reports: The Salesforce AI Research team argued that existing benchmarks failed to rigorously measure the capabilities or limitations of AI agents, and largely ignored an assessment of their ability to recognize sensitive information and adhere to appropriate data handling protocols.

The research unit's CRMArena-Pro tool is fed a data pipeline of realistic synthetic data to populate a Salesforce organization, which serves as the sandbox environment. The agent takes user queries and decides between an API call or a response to the users to get more clarification or provide answers.

"These findings suggest a significant gap between current LLM capabilities and the multifaceted demands of real-world enterprise scenarios," the paper said. [...] AI agents might well be useful, however, organizations should be wary of banking on any benefits before they are proven.
Businesses

The US Navy Is More Aggressively Telling Startups, 'We Want You' (techcrunch.com) 20

An anonymous reader quotes a report from TechCrunch: While Silicon Valley executives like those from Palantir, Meta, and OpenAI are grabbing headlines for trading their Brunello Cucinelli vests for Army Reserve uniforms, a quieter transformation has been underway in the U.S. Navy. How so? Well, the Navy's chief technology officer, Justin Fanelli, says he has spent the last two and a half years cutting through the red tape and shrinking the protracted procurement cycles that once made working with the military a nightmare for startups. The efforts represent a less visible but potentially more meaningful remaking that aims to see the government move faster and be smarter about where it's committing dollars.

"We're more open for business and partnerships than we've ever been before," Fanelli told TechCrunch in a recent episode of StrictlyVC Download. "We're humble and listening more than before, and we recognize that if an organization shows us how we can do business differently, we want that to be a partnership." Right now, many of these partnerships are being facilitated through what Fanelli calls the Navy's innovation adoption kit, a series of frameworks and tools that aim to bridge the so-called Valley of Death, where promising tech dies on its path from prototype to production. "Your granddaddy's government had a spaghetti chart for how to get in," Fanelli said. "Now it's a funnel, and we are saying, if you can show that you have outsized outcomes, then we want to designate you as an enterprise service."

In one recent case, the Navy went from a Request for Proposal (RFP) to pilot deployment in under six months with Via, an eight-year-old, Somerville, Massachusetts-based cybersecurity startup that helps big organizations protect sensitive data and digital identities through, in part, decentralization, meaning the data isn't stored in one central spot that can be hacked. (Another of Via's clients is the U.S. Air Force.) The Navy's new approach operates on what Fanelli calls a "horizon" model, borrowed and adapted from McKinsey's innovation framework. Companies move through three phases: evaluation, structured piloting, and scaling to enterprise services. The key difference from traditional government contracting, Fanelli says, is that the Navy now leads with problems rather than predetermined solutions. "Instead of specifying, 'Hey, we'd like this problem solved in a way that we've always had it,' we just say, 'We have a problem, who wants to solve this, and how will you solve it?'" Fanelli said.

Businesses

Obscure Chinese Stock Scams Dupe American Investors by the Thousands (msn.com) 34

Thousands of American investors have lost millions of dollars to sophisticated pump-and-dump schemes involving small Chinese companies listed on Nasdaq, prompting the Justice Department to declare the fraud a priority under the Trump administration's white-collar enforcement program.

The scams recruit victims through social media ads and WhatsApp messages, directing them to purchase shares in obscure Chinese firms whose stock prices are artificially inflated before collapsing. Since 2020, nearly 60 China-based companies have conducted initial public offerings on Nasdaq raising $15 million or less each, with more than one-third experiencing sudden single-day price drops exceeding 50%. In one recent case, seven traders earned over $480 million by defrauding 600 victims who purchased shares in China Liberal Education Holdings.
Businesses

Dutch Court Confirms Apple Abused Dominant Position in Dating Apps (yahoo.com) 8

A Dutch court on Monday confirmed a 2021 consumer watchdog's ruling saying that Apple had abused its dominant position by imposing unfair conditions on providers of dating apps in the App Store. From a report: The Rotterdam District Court ruled that the Dutch Authority for Consumers and Markets (ACM) was therefore right to impose an order subject to a penalty for non-compliance. The court ruled that ACM was right in finding that dating app providers had to use Apple's own payment system, were not allowed to refer to payment options outside the App Store, and had to pay a 30% commission (15% for small providers) to Apple.
Transportation

Walmart's Drone Deliveries Expand, Now in Five Different US States (cnbc.com) 44

"Walmart is bringing drone deliveries to three more states," reports CNBC: On Thursday, the big-box retailer said it plans to launch the speedier delivery option at 100 stores in Atlanta, Charlotte, Houston, Orlando and Tampa within the coming year. With the expansion, Walmart's drone deliveries will be available in a total of five states: [parts of northwest] Arkansas, Florida, Georgia, North Carolina and [the Dallas-Fort Worth area of] Texas... The drone operator will have an up to a six-mile range from stores.
Walmart tells CNBC the most frequently delivered items include ice cream, fresh fruit, and pet food, as well as "urgent items, such as hamburger buns for a cookout, eggs to make brownies or Tylenol or cold medicine needed when sick." It's all part of Walmart's effort to compete with Amazon: With more than 4,600 Walmart stores across the U.S., the retailer has used its large footprint to get online orders to customers faster. It has an Express Delivery service that drops purchases at customers' doors in as fast as 30 minutes, along with InHome, a subscription-based service, that puts items directly into people's fridges. The company began same-day prescription deliveries last fall and has expanded the service across the country.... Walmart stores have an assortment of over 150,000 items in a location. Over 50% of those can be delivered by drone, said Greg Cathey [Walmart's senior VP for U.S. transformation and innovation]...

Walmart's drone delivery count so far is modest. The company did not share the specific count, but said it has racked up a total of more than 150,000 drone deliveries since 2021.

United States

New York State Begins Asking Employers to Offically Identify Layoffs Caused by AI (entrepreneur.com) 32

The state of New York is "asking companies to disclose whether AI is the reason for their layoffs," reports Entrepreneur: The move applies to New York State's existing Worker Adjustment and Retraining Notification (WARN) system and took effect in March, Bloomberg reported. New York is the first state in the U.S. to add the disclosure, which could help regulators understand AI's effects on the labor market.

The change takes the form of a checkbox added to a form employers fill out at least 90 days before a mass layoff or plant closure through the WARN system. Companies have to select whether "technological innovation or automation" is a reason for job cuts. If they choose that option, they are directed to a second menu where they are asked to name the specific technology responsible for layoffs, like AI or robots.

Transportation

17-Year-Old Student Builds 3D-printed Drone In Garage, Interests DoD and MIT (yahoo.com) 63

"Cooper Taylor is only 17 years old, but he's already trying to revolutionize the drone industry," writes Business Insider: His design makes the drone more efficient, customizable, and less expensive to construct, he says. He's built six prototypes, 3D printing every piece of hardware, programming the software, and even soldering the control circuit board. He says building his drone cost one-fifth of the price of buying a comparable machine, which sells for several thousand dollars. Taylor told Business Insider he hopes that "if you're a first responder or a researcher or an everyday problem solver, you can have access to this type of drone."

His innovation won him an $8,000 scholarship in April at the Junior Science and Humanities Symposium, funded by the Defense Department. Then, on May 16, he received an even bigger scholarship of $15,000 from the US Navy, which he won after presenting his research at the Regeneron International Science and Engineering Fair...

It all started when Taylor's little sister got a drone, and he was disappointed to see that it could fly for only about 30 minutes before running out of power. He did some research and found that a vertical take-off and landing, or VTOL, drone would last longer. This type of drone combines the multi-rotor helicopter style with the fixed wings of an airplane, making it extremely versatile. It lifts off as a helicopter, then transitions into plane mode. That way, it can fly farther than rotors alone could take it, which was the drawback to Taylor's sister's drone. Unlike a plane-style drone, though, it doesn't need a runway, and it can hover with its helicopter rotors.

Taylor designed a motor "that could start out helicopter-style for liftoff, then tilt back to become an airplane-style motor," according to the article.

And now this summer he'll be "working on a different drone project through a program with the Reliable Autonomous Systems Lab at the Massachusetts Institute of Technology."

Thanks to Slashdot reader Agnapot for sharing the news.
United States

Executives from Meta, OpenAI, and Palantir Commissioned Into the US Army Reserve (theregister.com) 184

Meta's CTO, Palantir's CTO, and OpenAI's chief product officer are being appointed as lieutenant colonels in America's Army Reserve, reports The Register. (Along with OpenAI's former chief revenue officer).

They've all signed up for Detachment 201: Executive Innovation Corps, "an effort to recruit senior tech executives to serve part-time in the Army Reserve as senior advisors," according to the official statement. "In this role they will work on targeted projects to help guide rapid and scalable tech solutions to complex problems..." "Our primary role will be to serve as technical experts advising the Army's modernization efforts," [Meta CTO Andrew Bosworth] said on X...

As for Open AI's involvement, the company has been building its ties with the military-technology complex for some years now. Like Meta, OpenAI is working with Anduril on military ideas and last year scandalized some by watering down its past commitment to developing non-military products only. The Army wasn't answering questions on Friday but an article referenced by [OpenAI Chief Product Officer Kevin] Weil indicated that the four will have to serve a minimum of 120 hours a year, can work remotely, and won't have to pass basic training...

"America wins when we unite the dynamism of American innovation with the military's vital missions," [Palantir CTO Shyam] Sankar said on X. "This was the key to our triumphs in the 20th century. It can help us win again. I'm humbled by this new opportunity to serve my country, my home, America."

AI

Enterprise AI Adoption Stalls As Inferencing Costs Confound Cloud Customers 18

According to market analyst firm Canalys, enterprise adoption of AI is slowing due to unpredictable and often high costs associated with model inferencing in the cloud. Despite strong growth in cloud infrastructure spending, businesses are increasingly scrutinizing cost-efficiency, with some opting for alternatives to public cloud providers as they grapple with volatile usage-based pricing models. The Register reports: [Canalys] published stats that show businesses spent $90.9 billion globally on infrastructure and platform-as-a-service with the likes of Microsoft, AWS and Google in calendar Q1, up 21 percent year-on-year, as the march of cloud adoption continues. Canalys says that growth came from enterprise users migrating more workloads to the cloud and exploring the use of generative AI, which relies heavily on cloud infrastructure.

Yet even as organizations move beyond development and trials to deployment of AI models, a lack of clarity over the ongoing recurring costs of inferencing services is becoming a concern. "Unlike training, which is a one-time investment, inference represents a recurring operational cost, making it a critical constraint on the path to AI commercialization," said Canalys senior director Rachel Brindley. "As AI transitions from research to large-scale deployment, enterprises are increasingly focused on the cost-efficiency of inference, comparing models, cloud platforms, and hardware architectures such as GPUs versus custom accelerators," she added.

Canalys researcher Yi Zhang said many AI services follow usage-based pricing models that charge on a per token or API call basis. This makes cost forecasting hard as the use of the services scale up. "When inference costs are volatile or excessively high, enterprises are forced to restrict usage, reduce model complexity, or limit deployment to high-value scenarios," Zhang said. "As a result, the broader potential of AI remains underutilized." [...] According to Canalys, cloud providers are aiming to improve inferencing efficiency via a modernized infrastructure built for AI, and reduce the cost of AI services.
The report notes that AWS, Azure, and Google Cloud "continue to dominate the IaaS and PaaS market, accounting for 65 percent of customer spending worldwide."

"However, Microsoft and Google are slowly gaining ground on AWS, as its growth rate has slowed to 'only' 17 percent, down from 19 percent in the final quarter of 2024, while the two rivals have maintained growth rates of more than 30 percent."
Java

UK Universities Sign $13.3 Million Deal To Avoid Oracle Java Back Fees (theregister.com) 30

An anonymous reader quotes a report from The Register: UK universities and colleges have signed a framework worth up to 9.86 million pounds ($13.33 million) with Oracle to use its controversial Java SE Universal Subscription model, in exchange for a "waiver of historic fees due for any institutions who have used Oracle Java since 2023." Jisc, a membership organization that runs procurement for higher and further education establishments in the UK, said it had signed an agreement to purchase the new subscription licenses after consultation with members. In a procurement notice, it said institutions that use Oracle Java SE are required to purchase subscriptions. "The agreement includes the waiver of historic fees due for any institutions who have used Oracle Java since 2023," the notice said.

The Java SE Universal Subscription was introduced in January 2023 to an outcry from licensing experts and analysts. It moved licensing of Java from a per-user basis to a per-employee basis. At the time, Oracle said it was "a simple, low-cost monthly subscription that includes Java SE Licensing and Support for use on Desktops, Servers or Cloud deployments." However, licensing advisors said early calculations to help some clients showed that the revamp might increase costs by up to ten times. Later, analysis from Gartner found the per-employee subscription model to be two to five times more expensive than the legacy model.

"For large organizations, we expect the increase to be two to five times, depending on the number of employees an organization has," Nitish Tyagi, principal Gartner analyst, said in July 2024. "Please remember, Oracle defines employees as part-time, full-time, temporary, agents, contractors, as in whosoever supports internal business operations has to be licensed as per the new Java Universal SE Subscription model." Since the introduction of the new Oracle Java licensing model, user organizations have been strongly advised to move off Oracle Java and find open source alternatives for their software development and runtime environments. A survey of Oracle users found that only one in ten was likely to continue to stay with Oracle Java, in part as a result of the licensing changes.

Biotech

23andMe's Founder Anne Wojcicki Wins Bid For DNA Testing Firm (reuters.com) 11

Anne Wojcicki, co-founder of 23andMe, has regained control of the bankrupt DNA-testing company after a nonprofit she controls outbid Regeneron Pharmaceuticals with a $305 million offer. The company filed for bankruptcy in March due to declining demand and fallout from a major 2023 data breach.

"The agreement with non-profit TTAM Research Institute is the result of a final round of bidding that occurred earlier today between TTAM and Regeneron Pharmaceuticals," the company said in a statement.
Businesses

GameStop CEO Says The Company's Future Isn't In Games (gamespot.com) 30

GameStop is leaning heavily to trading cards as part of its future strategy, according to CEO Ryan Cohen. The news comes as a part of larger strategy shift to buy and hold a lot of bitcoin. From a report: Cohen has said that continuing to focus on trading cards, including the incredibly popular recent Pokemon card sets, is a "natural extension" of GameStop's business. He added that the collectibles could have potential for high profit margins.

Pokemon cards have a seen a gigantic resurgence recently. Stores regularly sell of sets, including the Destined Rivals set that launched on May 30. Cards have become increasingly hard to find as scalpers buy up supply and sell Pokemon card products -- including cards, special boxes, and accessories -- at exorbitant prices.

The Almighty Buck

Walmart and Amazon Are Exploring Issuing Their Own Stablecoins (msn.com) 51

Walmart and Amazon are exploring the possibility of issuing their own stablecoins in the United States, WSJ reported Friday, potentially shifting billions of dollars in transaction volume away from traditional banks and card networks. The retail giants, along with Expedia Group and several airlines, have recently discussed launching corporate stablecoins that would allow them to circumvent the existing payments infrastructure dominated by Visa and Mastercard.

The companies' final decisions hinge on passage of the Genius Act, legislation currently moving through Congress that would establish a regulatory framework for stablecoins. These digital currencies maintain a one-to-one exchange ratio with dollars and are backed by cash or Treasury reserves, offering merchants the potential for faster payment settlement and significantly reduced processing fees compared to traditional card transactions that can take days to clear.
AI

Salesforce Blocks AI Rivals From Using Slack Data (theinformation.com) 9

An anonymous reader shares a report: Slack, an instant-messaging service popular with businesses, recently blocked other software firms from searching or storing Slack messages even if their customers permit them to do so, according to a public disclosure from Slack's owner, Salesforce.

The move, which hasn't previously been reported, could hamper fast-growing artificial intelligence startups that have used such access to power their services, such as Glean. Since the Salesforce change, Glean and other applications can no longer index, copy or store the data they access via the Slack application programming interface on a long-term basis, according to the disclosure. Salesforce will continue allowing such firms to temporarily use and store their customers' Slack data, but they must delete the data, the company said.

Advertising

Amazon Doubles Prime Video Ads to 6 Minutes Per Hour 79

Amazon has quietly doubled the ad load on Prime Video to 4-6 minutes per hour, up from the 2-3.5 minutes initially discussed when ads launched in 2024. AdWeek reports: According to six ad buyers and documents reviewed by ADWEEK, the current ad load on Prime Video now ranges from four to six minutes per hour. And while that could bring down CPMs, buyers will be watching whether this impacts user experience. "Prime Video ad load has gradually increased to four to six minutes per hour," an Amazon representative wrote to an ad buyer in an email obtained by ADWEEK. The exchange occurred earlier this month.

The increase, which Amazon had telegraphed to investors but has not publicly acknowledged to consumers, gives the company significantly more inventory to sell across its rapidly expanding streaming business. "They told us the ad load would be increasing," said Kendra Tang, programmatic supervisor at Rain the Growth Agency. "That's been confirmed recently when we noticed more avails in the system."
Government

CISA Loses Another Senior Exec (theregister.com) 34

An anonymous reader quotes a report from The Register: The US Cybersecurity and Infrastructure Security Agency has lost another senior leader: executive director Bridget Bean departed on Wednesday. Bean, who served as the de facto agency boss for five months between former CISA director Jen Easterly's departure in January and Madhu Gottumukkala's appointment to the deputy director post last month, said she was "officially retiring from Federal service once again" in a LinkedIn post. "My time at CISA has been truly remarkable," she wrote. "Having had the privilege to serve as the Senior Official Performing the Duties of Director of CISA for 5 months has been a profound honor."

CISA's executive leadership page now lists Gottumukkala as its acting director, and the agency remains without a Senate-confirmed leader. President Trump nominated Sean Plankey to serve as the agency's director, and his nomination is scheduled for consideration (PDF) by the Senate's Homeland Security and Governmental Affairs Committee today. However, his appointment still requires a full Senate vote. Senator Ron Wyden (D-OR) has said he will continue to block Plankey's confirmation until CISA releases an unclassified report on American telecommunications networks' weak security.

At the time of her departure, Bean had spent three and a half years with CISA and more than three decades with the federal government, including a job as the Federal Emergency Management Agency's third-ranking official. Before accepting the executive director post, she was CISA's first chief integration officer. In this position, she "led the integration of the agency's operations and ensured CISA's frontline of regional staff seamlessly supported the critical infrastructure that Americans rely on every hour of every day," according to her bio on the agency's website. [...] Bean's retirement comes during a talent exodus from CISA -- and other federal government agencies -- with some folks getting fired and others taking the Trump administration's buyout offer to resign from public service. As of May 30, the heads of five of CISA's six operational divisions and six of its 10 regional offices had left the agency, and around 1,000 people, nearly one-third of its total staff, have reportedly left CISA since Trump took office.

Businesses

Native-Immigrant Entrepreneurial Synergies 23

The abstract of a study on NBER: We examine the performance of startups co-founded by immigrant and native teams. Leveraging unique data linking startups to founders' and employees' employment and education histories, we find native-migrant teams outperform native-only and migrant-only teams.

Native-migrant startups have larger employment three years after founding, are more likely to secure funding, access larger funding rounds, and achieve more successful exits. An instrumental variables strategy based on native shares in university-degree programs confirms native-migrant teams are larger and more likely to receive funding. Superior access to diverse labor pools, successful VCs, and expanded product markets are key factors in driving native-migrant outperformance.
Businesses

Canva Now Requires Use of LLMs During Coding Interviews 85

An anonymous reader quotes a report from The Register: Australian SaaS-y graphic design service Canva now requires candidates for developer jobs to use AI coding assistants during the interview process. [...] Canva's hiring process previously included an interview focused on computer science fundamentals, during which it required candidates to write code using only their actual human brains. The company now expects candidates for frontend, backend, and machine learning engineering roles to demonstrate skill with tools like Copilot, Cursor, and Claude during technical interviews, Canva head of platforms Simon Newton wrote in a Tuesday blog post.

His rationale for the change is that nearly half of Canva's frontend and backend engineers use AI coding assistants daily, that it's now expected behavior, and that the tools are "essential for staying productive and competitive in modern software development." Yet Canva's old interview process "asked candidates to solve coding problems without the very tools they'd use on the job," Newton admitted. "This dismissal of AI tools during the interview process meant we weren't truly evaluating how candidates would perform in their actual role," he added. Candidates were already starting to use AI assistants during interview tasks -- and sometimes used subterfuge to hide it. "Rather than fighting this reality and trying to police AI usage, we made the decision to embrace transparency and work with this new reality," Newton wrote. "This approach gives us a clearer signal about how they'll actually perform when they join our team."
The initial reaction among engineers "was worry that we were simply replacing rigorous computer science fundamentals with what one engineer called 'vibe-coding sessions,'" Newton said.

The company addressed these concerns with a recruitment process that sees candidates expected to use their preferred AI tools, to solve what Newton described as "the kind of challenges that require genuine engineering judgment even with AI assistance." Newton added: "These problems can't be solved with a single prompt; they require iterative thinking, requirement clarification, and good decision-making."
The Internet

Abandoned Subdomains from Major Institutions Hijacked for AI-Generated Spam (404media.co) 17

A coordinated spam operation has infiltrated abandoned subdomains belonging to major institutions including Nvidia, Stanford University, NPR, and the U.S. government's vaccines.gov site, flooding them with AI-generated content that subsequently appears in search results and Google's AI Overview feature.

The scheme, reports 404 Media, posted over 62,000 articles on Nvidia's events.nsv.nvidia.com subdomain before the company took it offline within two hours of being contacted by reporters. The spam articles, which included explicit gaming content and local business recommendations, used identical layouts and a fake byline called "Ashley" across all compromised sites. Each targeted domain operates under different names -- "AceNet Hub" on Stanford's site, "Form Generation Hub" on NPR, and "Seymore Insights" on vaccines.gov -- but all redirect traffic to a marketing spam page. The operation exploits search engines' trust in institutional domains, with Google's AI Overview already serving the fabricated content as factual information to users searching for local businesses.
AI

Large Language Models, Small Labor Market Effects (nber.org) 18

The abstract of a study featured on NBER: We examine the labor market effects of AI chatbots using two large-scale adoption surveys (late 2023 and 2024) covering 11 exposed occupations (25,000 workers, 7,000 workplaces), linked to matched employer-employee data in Denmark.

AI chatbots are now widespread -- most employers encourage their use, many deploy in-house models, and training initiatives are common. These firm-led investments boost adoption, narrow demographic gaps in take-up, enhance workplace utility, and create new job tasks. Yet, despite substantial investments, economic impacts remain minimal. Using difference-in-differences and employer policies as quasi-experimental variation, we estimate precise zeros: AI chatbots have had no significant impact on earnings or recorded hours in any occupation, with confidence intervals ruling out effects larger than 1%. Modest productivity gains (average time savings of 3%), combined with weak wage pass-through, help explain these limited labor market effects. Our findings challenge narratives of imminent labor market transformation due to Generative AI.

Slashdot Top Deals