Inspired by this tweet from the always awesome Fermat's Library.
Challenge
In the language of your choice, write a script that will determine if two strings are an anagram for each other. You can use the approach outlined in the tweet, or — for bonus points — come up with your own solution 🤓.
Via Wikipedia:
An anagram is word or phrase formed by rearranging the letters of a different word or phrase, typically using all the original letters exactly once.
For our challenge, we'll use the strict definition — each letter must be used exactly once.
Expected Behavior:
isAnagram(stressed, desserts) // TRUE
isAnagram(happy, sad) // FALSE
Have fun!
Latest comments (38)
I feel we need SPOILER warnings for the comments
So there was a perl6 version of the using this method posted on twitter. But it had a wee bug (it was summing not multiplying the value). Here's the fixed version:
I realise any version of Perl can look like line noise so I'll break it done.
Firstly the
*@wordsindicates that all the arguments passed into the function should be put into an Array called@words.state %primesstate variables are defined once and then reused each time the function is called so the first time you call theanafunction the primes hash is populated.(^Inf).grep(*.is-prime)this gives a lazy list of integers from 0 to Infinity (but then only returns those that are prime).'A'..'Z'gives the Range of characters from A to Z (inclusive).The
Zmeta operator applies the infix operator on it's right (in this case the=>Pair constructor) to each element in the Positional objeect on it's left and right until one is used up. (That important, don't use a Z operator on a couple of lazy lists... it won't go well).So that gives us a hash starting
{ A => 2, B => 3, C => 5 ...and so on.@words.mapapplies the code block inside the map call to each item in @words. Inside the block$_is assign the current value.$_.combsplits the word into individual characters and%primes{$_.comb}references each of the characters and gets it's value in the prime array.[*]and[==]are cases of applying the[]meta operator to an infix operator. It acts like taking the list on the right and putting the operator between each item.So for the string "ABC"
%primes{$_.comb}returns the list(2 3 5)and then you have[*] (2 3 5)which can be thought of as2 * 3 * 5or30.The
[==]takes our list of products and puts==between them. In this case it makes use of Perl6 operator chaining which allow you to write something like30 == 30 == 30or5 < 10 > 7in both the cases each individual Boolean check is made and then these are all combined into a sets of AND checks.Hope that makes sense. Note that this function only works on uppercase strings. It would be easy enough to make it case insensitive and to also add any Unicode characters for accents found in the expected strings.
This algorithm is actually an excellent illustration of why clever algorithms aren't always better.
In fact the clever algorithm here is worse in basically every way imaginable. It's more complicated, limited in scope, error prone, can fail in unexpected ways, and possibly less efficient than the obvious solution.
I'm surprised that no one has talked about using
ordin python (It Returns the Unicode code point for a string) and comparing the sumIt even performs better than sorting
It performs better but it's wrong. Comparing sums can give you false positives.
Rust way, could be done better.
This just compares the char vector directly after sorting them.
C++
I like the solution to "subtract" all characters of the shorter word from the (possibly) longer word, and check if there is some character left:
Ok, this is more fun, but it somehow reimplemented the library function ( \ \ ) (which can be read as the difference operation in set theory)
I'm sorry, it took me a dinner out to see, this was unnecessarily complicated. :) The delete functions perfectly fits a right fold. And if the words aren't of equal length, they are not anagrams.
delete could also be imported from Data.List
This is close to cheating and must have terrible space and runtime complexity:
The solution based on sort:
The library functions sort and permutations seem to be such an integral part of the solution, they should be implemented, here. :)
I stick with mappings to prime numbers. Haskell, without considerations of performance:
lookup and charToPrime return a Maybe value, without catching the eye. Umlauts or digits result in a Nothing value for the character and in a product of 1 for the word. The handling of those cases could be improved.
Always happy to provide Clojure answers ;)
And of course I snooped the algorithm from all the other answers here.