Dataset Statistics Sample Clauses

Dataset Statistics. To get an idea of the distribution of atoms in the datasets, the number of unique atoms in the subject, predicate and object positions were computed and tabulated in Table 2.l. Additionally, the number of atoms that could possibly take part in one of the 3 possible joins between different positions were calculated and tabulated in Table 2.2. The abbreviations S, P, and O denote subject, predicate, and object respectively. Dataset Uniq Subj Uniq Pred Uniq Obj DBpedia l36l08 8878 282l99 Uniprot 592639 79 294676 SP2Bench 3l629 6l 8l9l9 Table 2.l: Subject, Predicate, Object statistics Dataset SP SO PO DBpedia 0 48085 0 Uniprot 0 l05560 8 SP2Bench 0 l48l6 0 Table 2.2: Join statistics
AutoNDA by SimpleDocs
Dataset Statistics. The new data set was created based on the 6512 application resume pool from the School of Nursing at Emory University. All the application resumes here applied the specific job, Clinical Research Coordinator, which was divided into four different levels: CRC I, CRC II, CRC III, CRC IV. In addition, for each level of CRC position, it may have multiple different CRC jobs. For example, more than one CRC job may have the same CRC level. Therefore, there are 108 jobs for CRC I, 88 jobs for CRC II, 29 jobs for CRC III and 6 jobs for CRC IV. Out of the 6512 unique resumes, in other words, out of the 6512 applicants, some of them may apply multiple jobs in the same level or across the levels, so there were totally 25027 applications. Due to multiple applications from one applicant, one more necessary cleaning process was to divide applicants into groups by their highest will of application. For example, if one applicant both applied for CRC I and CRC II, he or she should be grouped into CRC II applicant by his or her highest level applied. In this way, the ratio of applicants in the four levels was 28:12:8:2. Then, 2025 resumes were randomly selected to form the dataset: 1134 CRC I applicants’ resumes, 486 CRC II applicants’ resumes, 324 CRC III applicants’ resumes and 81 CRC IV applicants’ resumes. Annotation of these 2025 resumes will be explained more in detail in Section 3.2.

Related to Dataset Statistics

  • Statistics 1. Each Party shall provide to the other Party statistics that are required by domestic laws and regulations, and, upon request, other available statistical information as may be reasonably required for the purpose of reviewing the operation of the air services.

  • Usage Statistics The Distributor shall ensure that the Publisher will provide access to both composite system-wide use data and itemized data for the Licensee, the Participating Institutions, individual campuses and labs, on a monthly basis. The statistics shall meet or exceed the most recent project Counting Online Usage of NeTworked Electronic Resources ("COUNTER") Code of Practice Release,3 including but not limited to its provisions on customer confidentiality. When a release of a new COUNTER Code of Practice is issued, the Distributor shall ensure that the Publisher will comply with the implementation time frame specified by COUNTER to provide usage statistics in the new standard format. It is more than desirable that the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol4 is available for the Licensee to harvest the statistics.

  • Statistical Analysis 31 F-tests and t-tests will be used to analyze OV and Quality Acceptance data. The F-test is a 32 comparison of variances to determine if the OV and Quality Acceptance population variances 33 are equal. The t-test is a comparison of means to determine if the OV and Quality Acceptance 34 population means are equal. In addition to these two types of analyses, independent verification 35 and observation verification will also be used to validate the Quality Acceptance test results.

  • Statistical Sampling Documentation a. A copy of the printout of the random numbers generated by the “Random Numbers” function of the statistical sampling software used by the IRO.

  • Statistical Information Any third-party statistical and market-related data included in the Registration Statement, the Time of Sale Disclosure Package and the Prospectus are based on or derived from sources that the Company believes to be reliable and accurate in all material respects.

  • Aggregated Statistics Notwithstanding anything to the contrary in this Agreement, Provider may monitor Client’s use of the Services and collect and compile Aggregated Statistics. As between Provider and Client, all right, title, and interest in Aggregated Statistics, and all intellectual property rights therein, belong to and are retained solely by Provider. Client acknowledges that Provider may compile Aggregated Statistics based on Client Data input into the Services. Client agrees that Provider may (i) make Aggregated Statistics publicly available in compliance with applicable law, and (ii) use Aggregated Statistics to the extent and in the manner permitted under applicable law; provided that such Aggregated Statistics do not identify Client or Client’s Confidential Information.

  • Data To permit evaluation of requests under paragraph (c) of this clause based on unreasonable cost, the Contractor shall include the following information and any applicable supporting data based on the survey of suppliers: Foreign and Domestic Construction Materials Cost Comparison Construction material description Unit of measure Quantity Cost (dollars) * Item 1: Foreign construction material Domestic construction material Item 2 Foreign construction material Domestic construction material [List name, address, telephone number, and contact for suppliers surveyed. Attach copy of response; if oral, attach summary.] [Include other applicable supporting information.] (*Include all delivery costs to the construction site.]

  • Metering Data At Developer’s expense, the metered data shall be telemetered to one or more locations designated by Connecting Transmission Owner, Developer and NYISO. Such telemetered data shall be used, under normal operating conditions, as the official measurement of the amount of energy delivered from the Large Generating Facility to the Point of Interconnection.

  • Authoritative Root Database To the extent that ICANN is authorized to set policy with regard to an authoritative root server system (the “Authoritative Root Server System”), ICANN shall use commercially reasonable efforts to (a) ensure that the authoritative root will point to the top-­‐level domain nameservers designated by Registry Operator for the TLD, (b) maintain a stable, secure, and authoritative publicly available database of relevant information about the TLD, in accordance with ICANN publicly available policies and procedures, and (c) coordinate the Authoritative Root Server System so that it is operated and maintained in a stable and secure manner; provided, that ICANN shall not be in breach of this Agreement and ICANN shall have no liability in the event that any third party (including any governmental entity or internet service provider) blocks or restricts access to the TLD in any jurisdiction.

  • Exchange Control Information Exchange control reporting is required for cash transactions exceeding AUD10,000 and for international fund transfers. If an Australian bank is assisting with the transaction, the bank will file the report on your behalf.

Time is Money Join Law Insider Premium to draft better contracts faster.