Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
Crime Stats

London's Crime Hot Spots Predicted Using Mobile Phone Data 64

KentuckyFC (1144503) writes A growing number of police forces around the world are using data on past crimes to predict the likelihood of crimes in the future. These predictions can be made more accurate by combining crime data with local demographic data about the local population. However, this data is time consuming and expensive to collect and so only updated rarely. Now a team of data experts have shown how combing crime data with data collected from mobile phones can make the prediction of future crimes even more accurate. The team used an anonymised dataset of O2 mobile phone users in the London metropolitan area during December 2012 and January 2013. They then used a small portion of the data to train a machine learning algorithm to find correlations between this and local crime statistics in the same period. Finally, they used the trained algorithm to predict future crime rates in the same areas. Without the mobile phone data, the predictions have an accuracy of 62 per cent. But the phone data increases this accuracy significantly to almost 70 per cent. What's more, the data is cheap to collect and can be gathered in more or less real time. Whether the general population would want their data used in this way is less clear but either way Minority Report-style policing is looking less far-fetched than when the film appeared in 2002.
This discussion has been archived. No new comments can be posted.

London's Crime Hot Spots Predicted Using Mobile Phone Data

Comments Filter:
  • Crime reduction is certainly a worthy reward, but as the article says, lots of people might not be too happy with having their information shared this way. Let's hope it is truly anonymous (which I doubt) and see how it goes.
    • Re:Price of safety (Score:5, Insightful)

      by CanHasDIY ( 1672858 ) on Thursday September 18, 2014 @09:45AM (#47935905) Homepage Journal

      Crime reduction is certainly a worthy reward, but as the article says, lots of people might not be too happy with having their information shared this way.

      Especially considering that said "information sharing" leads to a mere 8% increase in accuracy.

      Let's hope it is truly anonymous (which I doubt) and see how it goes.

      Let's assume that it's not, and see how it's used nefariously. That's not cynicism, that's realism.

      • The real question is "Is it worth invading privacy for any increase?" And, to be fair, it's an 8 percentage-point increase, or a 13% increase in accuracy.
        • by pnutjam ( 523990 )
          Not even an 8% increase, that press release (article) says almost 70%. It is certainly not 70% or they would be saying almost 75%.
      • Expounding on your statistics point as I agree that there is no significant increase in accuracy, notice the key phrase in the article.

        The team used an anonymised dataset of O2 mobile phone users in the London metropolitan area during December 2012 and January 2013. They then used a small portion of the data to train a machine learning algorithm to find correlations between this and local crime statistics in the same period.

        In other words, they took everything they gathered and pulled a subset that matched criteria that would back the claim that they could detect future crimes.

        Computers can surely show what law enforcement already knows. E.G. That area is a known crime area. Computers don't make tea leaf reading possible, which is the claim that both Governments and Tech companies peddling so

        • In other words, they took everything they gathered and pulled a subset that matched criteria that would back the claim that they could detect future crimes.

          While it's possible that they did in fact pull a biased sample, this methodology is what I was taught in academia as a legit way to test machine learning. If you have one sample set, first split it into two. Use one set, usually much smaller, to train the neural network. That data set, because it's tuned to find those specific correlations, obviously

          • by s.petry ( 762400 )

            They did not submit a smaller sample as academia would teach, they submitted a small set of "select" data (their words, not mine).

            If you are only teaching a bias only a bias will be understood.

            • I read "select" as meaning specific fields (as in, select types of data), not deliberately selected subsets of data... got a quote that helps clarify things?

              • by s.petry ( 762400 )
                Nope, I don't have anything not provided in TFA. I always trust that the language used is intentional. If they meant something other that what was stated they could and should have used different language.
            • by pnutjam ( 523990 )
              To be fair, that press release (article) had alot of weasel words.
      • Especially considering that said "information sharing" leads to a mere 8% increase in accuracy.

        Speaking of accuracy, it went from 62% to 70%. While indeed that is eight percentage points, it's a nearly 13% improvement.

      • Especially considering that said "information sharing" leads to a mere 8% increase in accuracy.

        Well, closer to 22%. While it's true that 8% of the predictions are more accurate, what is important is that ~22% of the predictions that used to be wrong are no longer. In much the same way as if it went to 100% accurate, you don't get to bitch about it being only a 38% increase in accuracy. You get to talk about whether it's worth the cost, and how we can get something only 62% as accurate without the cost.

      • New Dartmouth smartphone app reveals users' mental health, performance, behavior

        Dartmouth researchers and their colleagues have built the first smartphone app that automatically reveals students' mental health, academic performance and behavioral trends. In other words, your smartphone knows your state of mind -- even if you don't -- and how that affects you.

        The StudentLife app, which compares students' happiness, stress, depression and loneliness to their academic performance, also may be used in the gener

        • I don't buy it - an app that monitors every sensor, plus apparently monitoring abstract stuff like "stress level" somehow, 24/7?

          Wouldn't that pretty much lock up and drain the battery of almost every phone on the market today? Hey, maybe that's how they determined stress level - using the accelerometer to determine how hard the student threw the phone against the wall when it froze up on them for the last time.

    • by Anonymous Coward

      "Let's hope" increasingly is not good enough. There is a point where just going forward without applying what we learned or even learning from what happened before in similar situations (your "hope" here) becomes criminal negligence. If we're not past that point for everybody yet, and apparently not past it for those who should be paying attention to this sort of thing, we're certainly past it for those who do pay attention to what happens in this space.

    • by PPH ( 736903 )

      Let's hope it is truly anonymous

      Some interesting data could still be collected. If the same phone repeatedly appears near the scene of a crime, one could deduce that crimes will occur in the future in its proximity.

      From TFA:

      Their analysis shows that some mobile phone data is more important than others. For example, the data relating to whether or not the phone owner was at home, was particularly strongly correlated with crime patterns.

      Not so anonymous, IMO.

  • by badzilla ( 50355 ) <ultrak3wlNO@SPAMgmail.com> on Thursday September 18, 2014 @09:40AM (#47935867)
    More phones in an area = more people. More people = more crime.
  • fuck them. Almost 2/3 prediction from existing crime stats. Gee I know a lot of cops aren't the brightest but really? Thats not enough of a leg up?

  • by pr0t0 ( 216378 ) on Thursday September 18, 2014 @09:47AM (#47935933)

    The "machine learning algorithm" is a euphemism for three hairless teenagers floating in pools of milk.

    Watch out for the spiders.

  • Doubtful (Score:4, Interesting)

    by Anonymous Coward on Thursday September 18, 2014 @09:48AM (#47935935)

    62% to 70% isn't exactly groundbreaking for something that varies greatly. This increase looks suspiciously like selecting results for passing a statistical test instead of using a statistical test to verify the significance of a given result. Relevant xkcd: Significant [xkcd.com].

    Also, there is no such thing as anonymised phone data.

  • by Spamalope ( 91802 ) on Thursday September 18, 2014 @10:01AM (#47936015)
    This just in: If you track the location of criminal's cell phones you can predict areas at higher risk for crime.
    • Or as an alternative: If you track the location of cop's cell phones you can predict areas at higher risk for crimes, after they've been called in.

  • Any article citing statistics is invalid when they don't understand the difference between percent and per cent. Getting 62 things right per US penny is a VERY cost effective system, probably regardless of what information we want to get right.

    Unfortunately, all this says is that if we place our population under total surveillance with trackers, we can increase anticipation of crime by 8% (accuracy of 62 to ALMOST 70%). This says nothing about preventing those crimes or what type of crimes it prevents.

    • I expect it's protection against invasion of privacy is limited.

    • Any article citing statistics is invalid when they don't understand the difference between percent and per cent.

      FYI: "The one-word percent is standard in American English. Percent is not absent from other varieties of English, but most publications still prefer the two-word per cent. The older forms per-cent, per cent. (per cent followed by a period), and the original per centum have mostly disappeared from the language (although the latter sometimes appears in legal writing).

      "There is no difference betw

      • by Paco103 ( 758133 )

        Well, I will consider myself schooled! Thank you for educating me. That has always been a huge annoyance of mine and many others I know, but I guess it actually does make more sense when considering the origin of the word. I am saddened that one of my huge pet peeves is apparently unjustified, but in time I will adjust.

        On the other hand, I still love finding unnecessary quotes [unnecessaryquotes.com] in public!

  • Big Brother is watching you. Again. Even when they say they're not.
  • by Anonymous Coward

    ... in London... and in which parts...

    But we're not allowed to say...

  • If I determine that this area is more likely to have a crime and increase police presence, then the crime doesn't happen because there's too much "heat" then haven't I skewed my results?

    Or do you intend to have the cops lay low so they can "catch them in the act" or at least catch them quicker "after the fact"?
    • by dkman ( 863999 )
      As far as the 8% being insignificant, if the 8% is cheap to gain then I view that as significant.

      As for public data being collectively aggregated without permission - that's another story.

      Hell, they should be handing out cell phones for free they use your data for so much nowadays.
    • by AHuxley ( 892839 )
      The GCHQ was very aware of this in the 1960's on and did all it could to ensure people saw radomes and satellite dishes as been for tracking Soviet movements deep into Eastern Europe.
      ie not a gov ground station getting domestic calls.
      UK law enforcement and political parties where more interested in phone calls, later cell phone tracking, rapid decryption of consumer grade computer encryption and getting legally safe convictions in closed courts.
      Government Technical Assistance Centre (GCHQ Technical Assis
  • by Baby Duck ( 176251 ) on Thursday September 18, 2014 @11:18AM (#47936685) Homepage
    This is not like Minority Report at all. It predicts which locations at which times have a higher probability of a crime committing. It does not predict the particular crime, transgressor, or victim. It won't actually stop any crime from happening. The best it can do is allow a police force to more intelligently deploy their forces. They will be more able to rapidly respond to crimes after they happen, since statistically, they will more often have officers already dispatched to the nearby crime area.
    • If you have a small enough town with a small enough cell size, it should be blindingly obvious which handset IMSI numbers where usually in the area when a crime was committed.

      With enough data, you can simply map out the handset IMSI of the most probable perpetrators. There were 5 instances of a street robbery, at night, and the only common denominator is IMSI xyz that has been in the vicinity and moving around the time of all 5 robberies. It either is a totally unlucky individual or the most likely suspect.

  • by ledow ( 319597 )

    "significantly"

    I do not think that word means what you think it means.

  • Ummm, re " ... this [crime] data is time consuming and expensive to collect and so only updated rarely....": Wrong! Not at all so. Most CAD's - Computer-Aided-Dispatch - including our own free, Open Source contribution at www.ticketscad.org - do exactly that. It's inherent to the task of dispatch management. AS
  • We know that people that commit crimes are much more often from certain social and cultural backgrounds. There are untold numbers of "anecdotal evidence" around, but we don't want that to be true. So we tell ourselves white lies, blame victims, discount hundreds of incidents as "anecdotal evidence", pinpoint the few cases outside the norm and fabricate elaborate excuses about why such and such were practically forced to commit crime. We are constantly telling ourselves how we are to blame for not paying eno

"Protozoa are small, and bacteria are small, but viruses are smaller than the both put together."

Working...