bluebones.net – Page 16 – Adventures in Computer Programming

September 25, 2008July 27, 2010

How Soon Is Now?

I know they can’t win, but when does this end?

Update: Wired says Muxtape was “most heartbreaking death of 2008?

August 2, 2008July 27, 2010

PHP Wrapper for Google Maps API Geocoding

My PHP wrapper for the Google Maps API geocoding service. See below the code for the back story. Please do report bugs or ask questions in the comments.

Code

Example:


<?php

/*
A PHP wrapper for the Google Maps API geocoding services.
Requires json_decode be available.
Licensed under the MIT/X11 license (see below).
Thomas David Baker, <bakert@gmail.com>

Example:

// Can take streetnames ("Broadway"), longer addresses ("High Street, Kensington"), 
// postcodes/zipcodes ("SW1A 1AA", "90210") or points of interest ("Buckingham Palace", "Mount Everest").

$results = Geocoder::simpleGeocode("Broadway");
foreach ($results as $result) {
    echo $result['address'] . "\n";
    echo $result['longitude'] . "\n";
    echo $result['latitude'] . "\n";
}
*/

class Geocoder {
    
    // Use your google maps API key here, or provide it as a parameter on each call.
    const API_KEY = null;
    // You may want to change this here, or you can provide it as a parameter on each call.
    const HOST = "maps.google.co.uk";

    // Get an array of possible geocoding matches for an address or an empty array if none found.
    // Matches are of the type array('q' => <original search string>, 'address' => <best effort at a street address', 
    // 'longitude' => <longitude>, 'latitude' => <latitude>
    // Return value of null signals an error somewhere along the way.
    public static function simpleGeocode($addr, $host=self::HOST, $key=self::API_KEY) {
        $data = self::geocode($addr, $host, $key);
        if (! ($data && $data['Status']['code'])) {
            return null;
        }
        $statusCode = $data['Status']['code'];
        if ($statusCode == "602" || ! $data['Placemark']) {
            return array();
        } else if ($statusCode != "200") {
            return null;
        }
        $result = array();
        foreach ($data['Placemark'] as $placemark) {
           $result[] = self::parsePlacemark($placemark);
        }
        return $result;
    }

    // Get the Google Maps API JSON output as an assoc. array for the specified address.
    // Return value of null means the data could not be retrieved, false means could not be decoded.
    public static function geocode($addr, $host=self::HOST, $key=self::API_KEY) {
        if (! $key) { throw new Exception("Add your Google Maps API key to the source to use this function without passing it as a parameter."); }
        $url = "http://" . self::HOST . "/maps/geo?output=json&oe=utf-8&q=" . urlencode($addr) . "&key=" . $key;
        $json = file_get_contents($url);
        if (! $json) { return null; }
        $data = json_decode($json, true);
        if (! $json) { return false; }
        return $data;
    }

    // Takes a member of the Google Maps API 'Placemark' array and converts it to something flatter and more manageable.
    // Return value is assoc array with keys 'address', 'longitude' and 'latitude'
    public static function parsePlacemark($placemark) {
        $result = array();
        $result['address'] = $placemark['address'];
        $coordinates = (($placemark['Point']['coordinates']) ? $placemark['Point']['coordinates'] : array());
        $result['longitude'] = $coordinates[0];
        $result['latitude'] = $coordinates[1];
        return $result;
    }

}

/*
Copyright (c) 2008 Thomas David Baker

Permission is hereby granted, free of charge, to any person
obtaining a copy of this software and associated documentation
files (the "Software"), to deal in the Software without
restriction, including without limitation the rights to use,
copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the
Software is furnished to do so, subject to the following
conditions:

The above copyright notice and this permission notice shall be
included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES
OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT
HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
OTHER DEALINGS IN THE SOFTWARE.
*/

Back Story …

Every since Google made their Google Maps API geocoding services available for the UK last year I’ve been meaning to turn off the handmade scraper I use on London Cinema and use that.

Today London Cinema sent me an email because the scraper had identified the canonical form of a user-entered address as being and it couldn’t find a latitude and longitude for it:

hotels%5Cx26ie%3DUTF8%5Cx26hl%3Den%5Cx26f%3Dq%5Cx26sampleq%3D1%22+onclick%3D%22return+loadUrl%28this.href%29%22%5Cx3ehotels%5Cx3c%2Fa%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx3ca+href%3D%22%2Fmaps%3Fq%3Dhotels+in+manchester%2C+lancashire%5Cx26ie%3DUTF8%5Cx26hl%3Den%5Cx26f%3Dq%5Cx26sampleq%3D1%22+onclick%3D%22return+loadUrl%28this.href%29%22%5Cx3ehotels+in+manchester%2C+lancashire%5Cx3c%2Fa%5Cx3e%5Cx3c%2Fdiv%5Cx3e%5Cx3cfont+size%3D%22+1%22%5Cx3eLocation%3A%5Cx3c%2Ffont%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx3cform+id%3Drnl_form+action%3D%22%2Fmaps%22+onSubmit%3D%22document.getElementById%28%27user_q%27%29.value+%3Ddocument.getElementById%28%27q_d%27%29.value%22%5Cx3e%5Cx3cinput+type%3Dtext+size%3D22+id%3Drnl_near+name%3Dnear+%2F%5Cx3e+%5Cx3cinput+type%3Dhidden+id%3Duser_q+name%3Dq+value%3D%22%22%2F%5Cx3e%5Cx3cinput+type%3Dhidden+name%3Df+value%3Dp+%2F%5Cx3e%5Cx3cinput+type%3Dsubmit+name%3DbtnG+value%3D%22Search+Maps%22%2F%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx3cfont+size%3D-1%5Cx3e%5Cx3cinput+type%3Dcheckbox+name%3Drl+checked+value%3D1+%2F%5Cx3e+Make+this+my+default+location%5Cx3cbr%2F%5Cx3e%5Cx3c%2Ffont%5Cx3e%5Cx3c%2Fform%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx3cb%5Cx3eExamples%3A%5Cx3c%2Fb%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx26nbsp%3B%5Cx26nbsp%3B%5Cx3cb%5Cx3e%5Cx26%23183%3B%5Cx3c%2Fb%5Cx3e+%5Cx3cspan+dir%3Dltr%5Cx3eglasgow%5Cx3c%2Fspan%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx26nbsp%3B%5Cx26nbsp%3B%5Cx3cb%5Cx3e%5Cx26%23183%3B%5Cx3c%2Fb%5Cx3e+%5Cx3cspan+dir%3Dltr%5Cx3ebuckingham+palace+road+SW1%5Cx3c%2Fspan%5Cx3e%5Cx3cbr%2F%5Cx3e%5Cx3cbr%5Cx3e%5Cx3cspan+id%3Dfeatco+class%3D%22noprint+hdr%22+style%3Dfont-weight%3Abold%5Cx3eBrowse+popular+maps%5Cx3c%2Fspan%5Cx3e%5Cx3cdiv+id%3Dfc_0+class%3Dnoprint+style%3D%22padding-top%3A+2pt%22%5Cx3e%5Cx3ca+href%3D%22%2Fmaps%2Fms%3Fmsa%3D0

That seemed like a good reason to get around to it!

June 28, 2008

Long Pause ssh-ing to Ubuntu Hardy

I just installed Hardy Heron on a webserver and found ssh over the LAN paused for a long time after connecting and before asking for the password (or connecting with ssh key).

The problem is some kind of reverse DNS lookup that doesn’t work on the LAN. After some digging and going down some blind alleys with Kerberos I found the answer buried in this thread.

On the machine being connected to (remote side/server side)

/etc/ssh/sshd_config

UseDNS no

April 28, 2008June 7, 2008

String Representation of XML Objects in PHP

There’s got to be an easier way to do this.

Perhaps I am unsophisticated. But sometimes when I am debugging I just want to print strings to see what is going on. When working with PHP’s DOM XML stuff, this is difficult. var_dump and print_r don’t do what I’d like. What I really want is just to see the XML of the DOMDocument or the DOMElement or the node list or whatever it is I happen to have in my variable (I may not even know).

There may be a much easier way to get a string representation of an arbitrary XML object in PHP. If so, please link it up in the comments. Failing that, here’s a rough pass at the kind of function I need:

    function printXml($xml) {
        $s = self::xmlToString($xml);
        print "<pre>" . htmlentities($s) . "</pre>";
    }

    function xmlToString($xml) {
        if ($xml instanceof DOMDocument) {
            $s = $xml->saveXml();
        } else if ($xml->length) {
            $s = '';
            foreach ($xml as $element) {
                $s .= self::xmlToString($element);
            }
        } else {
            $s = self::xmlToStringProper($xml);
        }
        return $s;
    }

    function xmlToStringProper($node) {
        $dom = new DOMDocument();
        $xmlContent = $dom->importNode($node, true);
        $dom->appendChild($xmlContent);
        return $dom->saveXml();
    }

April 15, 2008June 23, 2008

History Meme

11:41:13 bakert@bluebones:~$  history | awk '{a[$2]++}END{for(i in a){print a[i] " " i}}' | sort -rn | head
4611 vi
3728 u
3367 cd
2817 ruby
1832 ls
1589 svn
1230 td
852 mysql
790 ssh
635 gd

April 10, 2008June 13, 2008

My First Program

I wrote my first programs at the age of 8. They are preserved on Tom 1, a 15 minute cassette tape. It starts with copied listings from the ZX Spectrum Introduction book. But before long there is a program called ‘askage’ which I’m pretty sure is an original composition. Here’s the code in it’s entirety:

10 PRINT "How old are you?"
20 INPUT A$
30 PRINT "Really? You look much older than that!"

April 4, 2008June 7, 2008

Programmatically Find Tests With PHPUnit

Nothing annoys me more than having to manually add tests to a central location in order to get them to run. Here’s some code that automatically and recursively (down the directory tree) finds files ending in Test.php and loads the tests within. Now all you need to do is create the tests.

< ?php

require_once 'PHPUnit/Framework.php';
require_once 'PHPUnit/TextUI/TestRunner.php';

define(Test, '/path/to/test/dir');

class AllTests {
    public static function main() {
        PHPUnit_TextUI_TestRunner::run(self::suite());
    }

    public static function suite() {
        $suite = new PHPUnit_Framework_TestSuite('My Test Suite');
        foreach (self::find_all_tests(TEST) as $path) {
            require_once($path);
            $class = preg_replace('%.*/(.*)\.php%', '$1', $path);
            $suite->addTestSuite($class);
        }
        return $suite;
    }

    private static function find_all_tests($start_dir) {
        $res = array();
        foreach (glob("$start_dir/*Test.php") as $path) {
            $res[] = $path;
        }
        foreach (glob($start_dir . '/*', GLOB_ONLYDIR) as $subdir) {
            $res = array_merge($res, self::find_all_tests($subdir));
        }
        return $res;
    }
}

?>

March 15, 2008June 7, 2008

PEAR and MAMP

To use PEAR and MAMP together run the following command:

/Applications/MAMP/bin/php5/bin/pear config-show

Look up the value of “PEAR directory” in the output and add this directory to include_path in /Applications/MAMP/conf/php5/php.ini

Change the php5 in the paths to php4 if you are using that version for some godforsaken reason.

March 2, 2008March 7, 2012

Deal or No Deal Player Selection Is Not Random

Fairly obvious one, this. UK Deal or No Deal contestant’s names flash up at the beginning of the show as if one is being selected at random. However, we can see very easily that this is not really happening.

If selection was random (one of 22 potential contestants randomly selected at the start of the show) 9.3% of contestants would have to wait 50 or more shows for their turn. 1 in 100 contestants would have to wait 97 shows or more. In practice, this doesn’t happen. There have currently been more than 500 contestants and only one contestant (Lucy Harrington) has had to wait as long as 50 shows (she waited exactly 50). Filming 3 shows per day (15 per week), it would not be practical for contestants to ever wait much longer than 30 shows.

Spoiling my not-very-amazing detective work, Producer Glenn Hugill is on record as saying:

“No it’s not random. For some reason the Radio Times said it was, but that didn’t come from us. It’s always been a selection otherwise we can’t guarantee people that they will play within a reasonable time period/have people in the audience etc. The players for each week are selected on Monday and confirmed in threes each weekday morning.”

But I didn’t read that until after I’d written a little ruby program to tell me how long I might have to wait under random conditions!

Here’s a table of wait times in a truly random scenario:

Will sit out 0 shows ... 4.5 percent chance (cumulative: 4.5)
Will sit out 1 show ... 4.3 percent chance (cumulative: 8.9)
Will sit out 2 shows ... 4.1 percent chance (cumulative: 13.0)
Will sit out 3 shows ... 4.0 percent chance (cumulative: 17.0)
Will sit out 4 shows ... 3.8 percent chance (cumulative: 20.8)
Will sit out 5 shows ... 3.6 percent chance (cumulative: 24.4)
Will sit out 6 shows ... 3.4 percent chance (cumulative: 27.8)
Will sit out 7 shows ... 3.3 percent chance (cumulative: 31.1)
Will sit out 8 shows ... 3.1 percent chance (cumulative: 34.2)
Will sit out 9 shows ... 3.0 percent chance (cumulative: 37.2)
Will sit out 10 shows ... 2.9 percent chance (cumulative: 40.1)
Will sit out 11 shows ... 2.7 percent chance (cumulative: 42.8)
Will sit out 12 shows ... 2.6 percent chance (cumulative: 45.4)
Will sit out 13 shows ... 2.5 percent chance (cumulative: 47.9)
Will sit out 14 shows ... 2.4 percent chance (cumulative: 50.2)
Will sit out 15 shows ... 2.3 percent chance (cumulative: 52.5)
Will sit out 16 shows ... 2.2 percent chance (cumulative: 54.7)
Will sit out 17 shows ... 2.1 percent chance (cumulative: 56.7)
Will sit out 18 shows ... 2.0 percent chance (cumulative: 58.7)
Will sit out 19 shows ... 1.9 percent chance (cumulative: 60.6)
Will sit out 20 shows ... 1.8 percent chance (cumulative: 62.4)
Will sit out 21 shows ... 1.7 percent chance (cumulative: 64.1)
Will sit out 22 shows ... 1.6 percent chance (cumulative: 65.7)
Will sit out 23 shows ... 1.6 percent chance (cumulative: 67.3)
Will sit out 24 shows ... 1.5 percent chance (cumulative: 68.7)
Will sit out 25 shows ... 1.4 percent chance (cumulative: 70.2)
Will sit out 26 shows ... 1.4 percent chance (cumulative: 71.5)
Will sit out 27 shows ... 1.3 percent chance (cumulative: 72.8)
Will sit out 28 shows ... 1.2 percent chance (cumulative: 74.1)
Will sit out 29 shows ... 1.2 percent chance (cumulative: 75.2)
Will sit out 30 shows ... 1.1 percent chance (cumulative: 76.4)
Will sit out 31 shows ... 1.1 percent chance (cumulative: 77.4)
Will sit out 32 shows ... 1.0 percent chance (cumulative: 78.5)
Will sit out 33 shows ... 1.0 percent chance (cumulative: 79.4)
Will sit out 34 shows ... 0.9 percent chance (cumulative: 80.4)
Will sit out 35 shows ... 0.9 percent chance (cumulative: 81.3)
Will sit out 36 shows ... 0.9 percent chance (cumulative: 82.1)
Will sit out 37 shows ... 0.8 percent chance (cumulative: 82.9)
Will sit out 38 shows ... 0.8 percent chance (cumulative: 83.7)
Will sit out 39 shows ... 0.7 percent chance (cumulative: 84.4)
Will sit out 40 shows ... 0.7 percent chance (cumulative: 85.2)
Will sit out 41 shows ... 0.7 percent chance (cumulative: 85.8)
Will sit out 42 shows ... 0.6 percent chance (cumulative: 86.5)
Will sit out 43 shows ... 0.6 percent chance (cumulative: 87.1)
Will sit out 44 shows ... 0.6 percent chance (cumulative: 87.7)
Will sit out 45 shows ... 0.6 percent chance (cumulative: 88.2)
Will sit out 46 shows ... 0.5 percent chance (cumulative: 88.8)
Will sit out 47 shows ... 0.5 percent chance (cumulative: 89.3)
Will sit out 48 shows ... 0.5 percent chance (cumulative: 89.8)
Will sit out 49 shows ... 0.5 percent chance (cumulative: 90.2)
Will sit out 50 shows ... 0.4 percent chance (cumulative: 90.7)
Will sit out 51 shows ... 0.4 percent chance (cumulative: 91.1)
Will sit out 52 shows ... 0.4 percent chance (cumulative: 91.5)
Will sit out 53 shows ... 0.4 percent chance (cumulative: 91.9)
Will sit out 54 shows ... 0.4 percent chance (cumulative: 92.3)
Will sit out 55 shows ... 0.4 percent chance (cumulative: 92.6)
Will sit out 56 shows ... 0.3 percent chance (cumulative: 92.9)
Will sit out 57 shows ... 0.3 percent chance (cumulative: 93.3)
Will sit out 58 shows ... 0.3 percent chance (cumulative: 93.6)
Will sit out 59 shows ... 0.3 percent chance (cumulative: 93.9)
Will sit out 60 shows ... 0.3 percent chance (cumulative: 94.1)
Will sit out 61 shows ... 0.3 percent chance (cumulative: 94.4)
Will sit out 62 shows ... 0.3 percent chance (cumulative: 94.7)
Will sit out 63 shows ... 0.2 percent chance (cumulative: 94.9)
Will sit out 64 shows ... 0.2 percent chance (cumulative: 95.1)
Will sit out 65 shows ... 0.2 percent chance (cumulative: 95.4)
Will sit out 66 shows ... 0.2 percent chance (cumulative: 95.6)
Will sit out 67 shows ... 0.2 percent chance (cumulative: 95.8)
Will sit out 68 shows ... 0.2 percent chance (cumulative: 96.0)
Will sit out 69 shows ... 0.2 percent chance (cumulative: 96.1)
Will sit out 70 shows ... 0.2 percent chance (cumulative: 96.3)
Will sit out 71 shows ... 0.2 percent chance (cumulative: 96.5)
Will sit out 72 shows ... 0.2 percent chance (cumulative: 96.6)
Will sit out 73 shows ... 0.2 percent chance (cumulative: 96.8)
Will sit out 74 shows ... 0.1 percent chance (cumulative: 96.9)
Will sit out 75 shows ... 0.1 percent chance (cumulative: 97.1)
Will sit out 76 shows ... 0.1 percent chance (cumulative: 97.2)
Will sit out 77 shows ... 0.1 percent chance (cumulative: 97.3)
Will sit out 78 shows ... 0.1 percent chance (cumulative: 97.5)
Will sit out 79 shows ... 0.1 percent chance (cumulative: 97.6)
Will sit out 80 shows ... 0.1 percent chance (cumulative: 97.7)
Will sit out 81 shows ... 0.1 percent chance (cumulative: 97.8)
Will sit out 82 shows ... 0.1 percent chance (cumulative: 97.9)
Will sit out 83 shows ... 0.1 percent chance (cumulative: 98.0)
Will sit out 84 shows ... 0.1 percent chance (cumulative: 98.1)
Will sit out 85 shows ... 0.1 percent chance (cumulative: 98.2)
Will sit out 86 shows ... 0.1 percent chance (cumulative: 98.3)
Will sit out 87 shows ... 0.1 percent chance (cumulative: 98.3)
Will sit out 88 shows ... 0.1 percent chance (cumulative: 98.4)
Will sit out 89 shows ... 0.1 percent chance (cumulative: 98.5)
Will sit out 90 shows ... 0.1 percent chance (cumulative: 98.5)
Will sit out 91 shows ... 0.1 percent chance (cumulative: 98.6)
Will sit out 92 shows ... 0.1 percent chance (cumulative: 98.7)
Will sit out 93 shows ... 0.1 percent chance (cumulative: 98.7)
Will sit out 94 shows ... 0.1 percent chance (cumulative: 98.8)
Will sit out 95 shows ... 0.1 percent chance (cumulative: 98.9)
Will sit out 96 shows ... 0.1 percent chance (cumulative: 98.9)
Will sit out 97 shows ... 0.0 percent chance (cumulative: 99.0)
Will sit out 98 shows ... 0.0 percent chance (cumulative: 99.0)
Will sit out 99 shows ... 0.0 percent chance (cumulative: 99.0)

January 21, 2008June 7, 2008

Typing

So basically, there are 4 dimensions:

Static (expressions have types) vs. dynamic (values have types)

Strong (values cannot be coerced to other types without a cast) vs. weak (the runtime performs a variety of coercions for convenience)

Latent (no type declarations) vs. manifest (type declarations)

Nominal (subtyping relations are declared explicitly) vs. structural (subtyping relations are inferred from the operations available on types)

And you can place most languages on one of these 4 axes, though several support multiple forms of typing:

Ocaml: static, strong, latent, structural typing

Haskell: static, strong, latent, structural typing, with nominal typing available via newtype and manifest typing through optional type declarations.

Erlang: dynamic, strong, latent, structural typing

Scheme: dynamic, strong, latent, structural typing, with nominal typing available in many object systems.

Common Lisp: dynamic, strong, latent or manifest typing. Same note about structural vs. nominal typing as Scheme, but nominal subtyping is used more often in practice.

Python & Ruby: dynamic, strong, latent, structural typing. Nominal subtyping is available via isinstance or Ruby equivalent, but good practice frowns upon it.

PHP: dynamic, weak, latent, nominal or structural typing. Culture is much friendlier to nominal subtyping than Python or Ruby, but it’s not required.

Java & C : mostly static, strong, manifest, nominal typing. The casts give you a form of weak-typing when necessary, and C templates are structurally typed.

C: static, generally weak, manifest, nominal typing.

Assembly: dynamic, weak, latent, structural typing.

Reddit comment by Nostrademons