Data set Collection Sample Clauses

Data set Collection. From the total available corpus (70k documents), we currently have access to ~60,000 excavation reports and related documents, such as appendices, drawings and maps. These texts have been gathered by DANS (Digital Archiving and Networked Services) in the Netherlands, over the past 20 years. We received the documents from DANS as PDF files, and have used the pdftotext tool (Glyph & Cog LLC, 1996) to convert these to plain text. This data set contains 30,152,318 lines and 657,808,600 words (as counted by the command line tool “wc”). The texts are quite diverse; the dates of publication span decades with the earlier ones having been scanned and OCRd from hardcopies created in the 80s. The other temporal variation is in how old the found artefacts are, ranging from 200,000 BC to the present. Also, the type of research can be very different between reports, some might describe a short desk evaluation of a small area without any fieldwork, while others detail huge excavations over multiple years with detailed analysis by a team of specialists. To get a representative sample across all these ranges, a random sampling strategy would not be ideal, and we instead opted to manually select documents, taking into account the variation described above. We selected a total of 15 documents as annotation candidates (~42,000 tokens). For the purposes of calculating the IAA and evaluating the annotation guide- lines, we manually selected roughly 100 sentences from these documents contain- ing all the entity types (Table 3.1, explained below) and specific difficult cases as validation set, annotated by all annotators. Entity Description Examples Artefact An archaeological object found in the ground. Axe, pot, stake, arrow head, coin Time Period A defined (archaeological) period in time. Middle Ages, Neolithic, 500 BC, 4000 BP Location A placename or (part of) an address. Amsterdam, Xxxxx- xxxxxx 0, Xxxxxxxxxx Context An anthropogenic, definable part of a stratigraphy. Something that can contain Artefacts Rubbish pit, burial mound, stake hole Material The material an Artefact is made of. Bronze, wood, flint, glass Species A species’ name (in Latin or Dutch) Cow, Corvus Corax, oak Table 3.1: Descriptions and examples for each entity type. Examples are trans- lated from Dutch.
AutoNDA by SimpleDocs
Data set Collection. ‌ From the total available corpus (70k documents), we currently have access to ~60,000 excavation reports and related documents, such as appendices, drawings and maps. These texts have been gathered by DANS (Digital Archiving and Networked Services) in the Netherlands, over the past 20 years. We received the documents from DANS as PDF files, and have used the pdftotext tool (Glyph & Cog LLC, 1996) to convert these to plain text. This data set contains 30,152,318 lines and 657,808,600 words (as counted by the command line tool “wc”). The texts are quite diverse; the dates of publication span decades with the earlier ones having been scanned and OCRd from hardcopies created in the 80s. The other temporal variation is in how old the found artefacts are, ranging from 200,000 BC to the present. Also, the type of research can be very different between reports, some might describe a short desk evaluation of a small area without any fieldwork, while others detail huge excavations over multiple years with detailed analysis by a team of specialists. To get a representative sample across all these ranges, a random sampling strategy would not be ideal, and we instead opted to manually select documents, taking into account the variation described above. We selected a total of 15 documents as annotation candidates (~42,000 tokens). For the purposes of calculating the IAA and evaluating the annotation guide- lines, we manually selected roughly 100 sentences from these documents contain- ing all the entity types (Table 3.1, explained below) and specific difficult cases as validation set, annotated by all annotators.

Related to Data set Collection

  • Data Collection Some downloaded software included in the Materials may generate and collect information about the software and usage and transmit it to Intel to help improve Intel’s products and services. This collected information may include product name, product version, time of event collection, license type, support type, installation status, hardware and software performance, and use. 9.

  • Sample Collection The collection and testing of the samples shall be performed only by a laboratory and by a physician or health care professional qualified and authorized to administer and determine the meaning of any test results. The laboratory performing the test shall be one that is certified by the National Institute of Drug Abuse (NIDA). The laboratory chosen must be agreed to between the Union and the Employer. The laboratory used shall also be one whose procedures are periodically tested by the NIDA where they analyze unknown samples sent to an independent party. The results of employee’s tests shall be made available to the Medical Review Officer. Collection of urine samples shall be conducted in a manner, which provides the highest degree of security for the sample and freedom from adulteration. Recognized strict chain of custody procedures must be followed for all samples as set by NIDA. The Union and the Employer agree that security of the biological urine samples is absolutely necessary therefore the Employer agrees that if the security of the sample is compromised in anyway, any positive test shall be invalid and may not be used for any purpose. Urine samples will be submitted as per NIDA Standards. Employees have the right for Union or legal counsel representative to be present during the submission of the sample. A split sample shall be reserved in all cases for an independent analysis in the event of a positive test result. All samples must be stored in a scientific acceptable preserved manner as established by NIDA. All positive confirmed samples and related paperwork must be retained by the laboratory for at least six (6) months or for the duration of any grievance, disciplinary action or legal proceedings whichever is longer. At the conclusion of this period, the paperwork and specimen shall be destroyed. Tests shall be conducted in a manner to ensure that an employee’s legal drug use and diet does not affect the test results.

  • Debt Collection Unpaid licensing fees and charges for cleaning, damage to property, equipment, and furnishings are an obligation by the occupant to Housing Services. Any unpaid account balances will be sent to an outside collection agency and may be reported to one or more credit bureau reporting service(s). After internal collection efforts have failed to result in full payment, and in accordance with RCW 19.16.500, collection fees of up to 50% of the unpaid balance will be assessed to your account, and you are responsible for paying these fees together with all costs and expenses, including reasonable attorney's fees and court costs, necessary for the collection of your delinquent account. Requests for future housing will be considered only if payments are current.

  • Payment and Collection Your bill will be based on monthly meter readings provided to XOOM Energy by your NGDC. If there is an error in your meter reading, XOOM Energy will adjust its bill to you upon your NGDC providing a corrected meter reading to XOOM Energy. You represent that you are financially able and willing to fulfill the terms and conditions of this Agreement and that you have not filed, are not in the process of filing or plan to begin any bankruptcy proceedings. Your first bill payment will be due to the NGDC on the date specified in the NGDC bill. If you do not pay it on time, you could be subject to interest and late charges imposed by the NGDC, and your service could be disconnected. In all events, you shall remain obligated to pay for all natural gas received by you and any interest, fees and penalties incurred by XOOM Energy. You will also be responsible for all costs, including legal fees, associated with the collection of amounts owed to XOOM Energy.

  • Billing and Collection The Originating party shall xxxx and collect such information service charges and shall remit the amounts collected to the Terminating Party less:

  • Information Collection Information collection activities performed under this award are the responsibility of the awardee, and NSF support of the project does not constitute NSF approval of the survey design, questionnaire content or information collection procedures. The awardee shall not represent to respondents that such information is being collected for or in association with the National Science Foundation or any other Government agency without the specific written approval of such information collection plan or device by the Foundation. This requirement, however, is not intended to preclude mention of NSF support of the project in response to an inquiry or acknowledgment of such support in any publication of this information.

  • Data Collection and Reporting 1. Grantee shall develop and use a local reporting unit that will provide an assigned location for all clients served within the Hospital. This information shall also be entered into Client Assignment and Registration (CARE)when reporting on beds utilized at the Hospital.

  • Collections All collections of monies or other property in respect, or which are to become part, of the Property (but not the safekeeping thereof upon receipt by PFPC Trust) shall be at the sole risk of the Fund. If payment is not received by PFPC Trust within a reasonable time after proper demands have been made, PFPC Trust shall notify the Fund in writing, including copies of all demand letters, any written responses and memoranda of all oral responses and shall await instructions from the Fund. PFPC Trust shall not be obliged to take legal action for collection unless and until reasonably indemnified to its satisfaction. PFPC Trust shall also notify the Fund as soon as reasonably practicable whenever income due on securities is not collected in due course and shall provide the Fund with periodic status reports of such income collected after a reasonable time.

  • Credit, Payment and Collection You will receive a single monthly bill for both your natural gas and the delivery of such natural gas from your utility distribution company. Payment is due by the date set forth on the invoice. Should you fail to pay the monthly bill or fail to meet any agreed upon payment arrangement, your service may be terminated in accordance with your local utility’s tariffs and your contract with XOOM may be automatically terminated, leading to XOOM seeking cost recovery fees as set out herein. You represent that you are financially able and willing to fulfill the terms and conditions of this Agreement and that you have not filed, are not in the process of filing or plan to begin any bankruptcy proceedings. If accepted as a customer, XOOM may report your payment experience. Bills not paid by their due date are subject to a late payment fee at the greater of the rate of 1.5%, or the maximum permitted by law, based on your total outstanding balance per month. XOOM will charge a $35 return check fee for all returned checks or the maximum allowed by law. XOOM may terminate your commodity service and may suspend services under procedures approved by law. In all events, you shall remain obligated to pay for all natural gas received by you and any interest, fees and penalties incurred by XOOM. You will also be responsible for all costs, including legal fees, associated with the collection of amounts owed to XOOM.

  • COLLECTION OF CHARGES 16.1 A Sector Association may request the consent of the Administrator to collect charges due from Operators to the Administrator in respect of facilities under the charging scheme.

Draft better contracts in just 5 minutes Get the weekly Law Insider newsletter packed with expert videos, webinars, ebooks, and more!