Is there any reg ex pattern we need to follow to validate the company name before we trigger search?
I see company names are more of mix of special characters however just concerned if we add XML tag or some character with different format (different language…)
company names are more of mix of special characters
Do you mean that they allow characters from the unicode range? Here’s an example:
FC011780 - this is COÖPERATIEVE RABOBANK U.A.
however just concerned if we add XML tag or some character with different format (different language…)
I’m not sure what you’re asking - the JSON input to search in the API is in the UTF-8 character set. The search seems to be reasonably sensible e.g. searching for “Cooperatieve Rabobank” (without umlaut o) brings up the company FC011780 above as the first item. I don’t know how far you can take this - but then I believe most company names are registered using standard ASCII characters.
I don’t know what you’re referring to with the “XML tag” part. Is that something to do with the XML gateway? If so then maybe try posting to that forum.
Ah, understand. You’re issuing a “GET” so the company name when you search should be url encoded. So special characters you input would be as per that. I don’t know how CH would interpret e.g. UTF-8 characters. You can always test e.g. get your browser / javascript to send something! My previous:
…wasn’t correct e.g. here you’re not sending in JSON, it’s url-encoded strings.
As to what data you may get out, the UTF-8 / JSON comment stands. As to odd characters which are present in company names, here’s a selection I’ve encountered:
Some more quirky names: 11363219 ; (they have exemption from use of Limited in name) 07846401 AUS %U2013 SERVICE LIMITED - did someone urlencode the n-dash by mistake when incorporating it? 11338331 COCO & ROX LIMITED - more encoding issues?
We’ve not found any emoji-names yet, probably only a matter of time…
Just as a by-the-way some of the companies listed by @dmw will not be found in the API as they’ve been dissolved more than 6 years ago (you can see them in webCHeck).