Category:Data quality rule

From CDL
Jump to: navigation, search

Business rules which are used to ensure data quality and data management performance in the Corporate Data League (complete list: Excel).

 Has description
Business partner name missingIt is necessary that each business partner has at least one name. With respect to the CDL data model it is at least required that a name of type LOCAL or INTERNATIONAL is present.
Cadastro de Pessoa Fisica invalid (Brazil)The CPF number is an identification number of Brazilian citizens emitted by the Brazilian Ministry of Revenue, which is called "Ministério da Fazenda". CPF stands for "Cadastro de Pessoa Física" (literally, physical person registration) as opposed to the CNPJ number for companies. CPF consists of C1...C11. Where C1...C9 consists of random numbers and C10, C11 are the check numbers. The check number can be calculated by the following methods.

- From right to left all digits are multiplied by a descending sequence starting with 9. - The sum of all products is computed. - The sum of step 2 is taken modulo 11. - The result of step 3 is taken modulo 10.

- The checkdigit found is appended to the number and steps 1 to 4.
Care of information misplacedCare of (typicall indicated by "c/o") information must not be maintained in the business partner's name, locality or thoroughfare but is to be managed as care of attribute. If there is care of information found as an attribute value other than "care of" the rule is violated.
EIN format invalid (United States)This rule checks the format of Employer identification number (United States) as described in the additional information tab
Fundamental address parts missingIt is necessary that an address, PO Box- or street address, comprises at least a post code or locality.
HST presence consistency (Harmonized Sales Tax number (Canada))The HST It is used in provinces where both the federal goods and services tax (GST) and the regional provincial sales tax (PST) have been combined into a single value added sales tax. HST is applied in the provinces: New Brunswick, Newfoundland and Labrador, Nova Scotia, Ontario and Prince Edward Island. This rules checks presence of only HST tax for the business entity: if there re also GST, PST identifiers available as well as HST for the business entity, the rule is violated.
Identifier Format invalid(Tax identification Number(Italy))Tax Identification Number(Italy) is known as Codice Fiscale and consists of 16 digits of characters, where C1 to C6 Alphabetic, C7, and C8 is Numeric, C9 is Alphabetic, C10, and C11 is Numeric, C12 is Alphabetic, C13 to C15 belongs to Numeric and C16 is the Numeric.

C1 C2 C3 - Are letters for the last name. C4 C5 C6 - Are letter for the name. C7 C8 - Are numbers for the year of birth. C9 - Is a letter for the month of birth. C10 C11 -Are numbers for the day of birth and sex. C12 C13 C14 C15 - Are one letter and three numbers for the Italian town or to the foreign state of birth.

C16 - Have a supervisory function. It is a checksum digit.
Identifier format invalid (Business Registration Number (China))The Chinese business registration number consists of 18 alpha-numeric digits. If there are not exactly 18 digits then the rule is violated.
  • The first number= registering authority (typically a 9, representing AIC)
  • The second number= registered entity type (enterprise, personal owned business, etc.)
  • Third to eight numbers= registering region code (highest administrative division, associated city, etc.)
  • Ninth to seventeenth numbers = organization code (likely the old tax registration number)
  • The eighteenth number= check digit (either a number or letter)
Identifier format invalid (Business number (Canada))The Canadian business number consists of exactly 9 numerical digits. The rule checks whether there are exactly 9 numerical digits available not considering possible whitespaces or other delimiters between the digits.
Identifier format invalid (CIF number (Spain))Certificado de Identificación Fiscal consists of 9 alphanumeric characters (a letter followed by 8 digits) with the following structure:

T P P N N N N N C

Being:

  • T: Organization type letter (1 alpha/letter digit) indicates the type of the organization, for example, A for Public Limited company or B for Limited Liability Company.
  • P: Provincial Code (2 numerical digits), correspond to an identifier of the geographical Spanish province in which the organization’s headquarters of the company is located. For example 28 for Madrid or 58 for Barcelona.
  • N: Sequential numbering ( 5 numerical digits) within province determined for letters P in the previous point.
  • C: Control digit (1 digit, letter or numerical).

Meaning of the first letter:

  • A Sociedad Anónima
  • B Sociedad de Responsabilidad Limitada
  • C Sociedades Colectivas
  • D Sociedades Comanditarias
  • E Comunidades de Bienes
  • F Sociedades Cooperativas
  • G Asociaciones
  • H Comunidades de propietarios en régimen de propiedad horizontal
  • J Sociedades civiles con o sin personalidad jurídica
  • P Corporaciones Locales)
  • Q Organismos Autónomos
  • R Congregaciones e instituciones religiosas
  • S Órganos de la administración del estado y de las Comunidades Autónomas
  • U Uniones Temporales de Empresas
  • V – Otros tipos no definidos en el resto de claves.
... further results
 Has description
Inconsistency between BIC FI and EU VAT ID FI (Finland)The Business identity code in Finland and the European VAT number in Finland share identical values. The numerical digits of the EU VAT are identical with the business identity code. There is an inconsistency when the 8 digits after the FI prefix of the EU VAT are different to the 8 digits of the business identity code (usually formatted like this "1234567-8", the dash and the EU VAT prefix "FI" are omitted by this rule)
Inconsistency of country and country codeCountry and country code information must be consistent according to the specified reference country names and codes
Inconsistency of post code and localityPost code and locality information must be consistent according to postal registries in a given country. The CDL uses the public GeoNames database to validate consistency of post code and locality information.
Natural person identifiedIn the CDL no natural persons are managed. This rule checks whether a record represents a natural person. In course of the CDL collaboration process these records are preventively deleted.
Post code invalidThe value of a post code must be registered as a post code in a given country. The CDL uses mainly the public GeoNames database to validate post codes.
Post code invalid (United States)This rule checks whether a given US post code is is available in the CDL reference data.
Unknown administrative area level 1This rule checks wheter a given administrative area is known by the CDL reference data. The CDL refers to the region list as maintained by ISO 3166.
VAT ID missing (Switzerland)In Switzerland each company has a National identifier (Switzerland)UID
 Has description
Company is greylistedThe rule checks whether a given business partner is known to be inactive by means of being "out of business", "in liquidation" or in a similar status. For this purpose the rule searches for information in the CDL business partner repository and in addition in several connected data sources. These are:
  • UK: Companies House
  • CH: Swiss business register
  • FR: French business register (SIREN)
  • ... further will follow
Contact information misplacedContact information is not allowed in the registered name, trade name or international name. This rule checks whether contact information is misplaced by identifying e.g. common keywords such as "attn:" or "z.Hd." and additionally parsing the company name for natural person names that are not meant to be part of the legal name (e.g. when natural person names are placed after the legal form)
Identifier format inaccurate (AFM number (Greece))The AFM number (Greece) for legal entities consists of 9 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated. This rule checks the syntax, i.e. format of the AFM number (Greece) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.
Identifier format inaccurate (Australian Company Number (Australia))The Australian Company Number (Australia) consists of 9 digits. There is a convention to display the SCN in the format XXX XXX XXX; three blocks of three characters, each block separated by a blank. This is to assist readability and the inserted blanks do not form part of the ACN. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value in the places where they are not supposed to be. If there are any whitespaces, dots or hyphens then the rule is violated. This rule checks the syntax, i.e. format of the Australian Company Number (Australia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.
Identifier format inaccurate (Business Number (Australia))This rule checks the syntax, i.e. format of Australian Business Number (ABN) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. Australian Business Number (ABN) consists of 11 digits in the format: "99 999 999 999". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces in a places where no whitespace is expected to be, hyphens or dots then the rule is violated.
Identifier format inaccurate (Business number (Canada))This rule checks the syntax, i.e. format of the Business number in Canada with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. The Canadian business number consists of exactly 9 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.
Identifier format inaccurate (CIF number (Spain))This rule checks the syntax, i.e. format of the CIF number in Spain with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. CIF number (Spain) consists of a letter followed by 8 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.
Identifier format inaccurate (CNPJ number (Brazil))This rule checks the syntax, i.e. format of the CNPJ number (Brazil) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. The CNPJ consists of a 14-digit number formatted as 00.000.000/0001-00. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots situated in the places where they are not expected to be then the rule is violated.
Identifier format inaccurate (CUIT number (Argentina))This rule checks the syntax, i.e. format of CUIT number (Argentina) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. The CUIT number in Argentina consists of 11 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.
Identifier format inaccurate (CURP number (Mexico))CURP number (Mexico) consists of 18 characters. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated. This rule checks the syntax, i.e. format of the CURP number (Mexico) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.
... further results

Pages in category "Data quality rule"

The following 200 pages are in this category, out of 1,289 total.

(previous page) (next page)

I

(previous page) (next page)