For reasons I can't explain, our SiteMinder-protected web site is logging user in two different formats, one that just has the simple user name, and another that has the domain prefix; so for a given user, we have web server access logs that contain both "myname" and "MyDom//MyName".
My goal is to "normalize" these so that when I perform stats against user, I get both of these variants aggregated into one count for the one user.
Might seem like a job for a simple RegEx, BUT... notice that the cases are different! "myname" gets logged in lower case, but "MyDom//MyName" gets logged in mixed case. Through RegEx and use of Upper(), I've been able to get the two variants to display the same in a report... but they are still getting reported distinctly with separate counts. I tried to dedup based on the "normalized" value, but then it only returned one of the two variants, with only the counts for that variant (not both of them aggregated.)
Any ideas?
... View more