PDF Extraction Sample Clauses

PDF Extraction. The approach by Xxxxx et al [5, 6] works with PDF files, in particular those making use of Type 1 fonts. The tool is able to extract the name, size, font name and style, baseline coordinates and bounding box of each character within the file. In order to achieve this, image processing of a rasterised version of the file in conjunction with analysis of the PDF is carried out. Initially a clip is made which discovers the bounding boxes of every single glyph. The dimensions of the clip along with its page number and associated file are passed to the PDF extractor. The extractor uses this information to find the correct page within the file in order to start the extraction process. Each page within a PDF file has a content stream, which holds instructions for pla- cing and displaying characters, lines and images, along with a number of other resources, including font dictionaries, which contain, directly or indirectly the name and family of a font, the names of all of its characters and various other information. In the first pass over the file, all of this information is collated. In the second pass, the content stream is processed. Each time a text environment is entered within a content stream, the font and font size are both stated, followed by a command that transforms the current position and the byte value for each character to be displayed. When the procedure encounters byte values, the appropriate character is obtained from the font extracted in the first pass. The remaining instructions within the text environment are then sequentially processed. In addition to text environments, content streams also contain drawing environments, these are also processed as the lines drawn here often also represent characters, such as fraction lines and other extendible characters. At this stage, even though the exact information on the characters within the PDF file has been obtained, the exact bounding boxes of the characters is not known. Whilst the baseline information is sufficient for recognising simple text on a given line, more accurate data is required when trying to calculate 2D relationships between characters, which is necessary for formula recognition. This task is completed by registering the characters with the bounding boxes obtained during image analysis. This involves overlaying the connected components and character information, implementing special rules to deal with symbols consisting of more than one connected component, such as , =, and symbo...
AutoNDA by SimpleDocs

Related to PDF Extraction

  • System Logging The system must maintain an automated audit trail which can 20 identify the user or system process which initiates a request for PHI COUNTY discloses to 21 CONTRACTOR or CONTRACTOR creates, receives, maintains, or transmits on behalf of COUNTY, 22 or which alters such PHI. The audit trail must be date and time stamped, must log both successful and 23 failed accesses, must be read only, and must be restricted to authorized users. If such PHI is stored in a 24 database, database logging functionality must be enabled. Audit trail data must be archived for at least 3 25 years after occurrence.

  • RE-WEIGHING PRODUCT Deliveries are subject to re- weighing at the point of destination by the Authorized User. If shrinkage occurs which exceeds that normally allowable in the trade, the Authorized User shall have the option to require delivery of the difference in quantity or to reduce the payment accordingly. Such option shall be exercised in writing by the Authorized User.

  • Meter Testing Company shall provide at least twenty-four (24) hours' notice to Seller prior to any test it may perform on the revenue meters or metering equipment. Seller shall have the right to have a representative present during each such test. Seller may request, and Company shall perform, if requested, tests in addition to the every fifth-year test and Seller shall pay the cost of such tests. Company may, in its sole discretion, perform tests in addition to the fifth year test and Company shall pay the cost of such tests. If any of the revenue meters or metering equipment is found to be inaccurate at any time, as determined by testing in accordance with this Section 10.2 (Meter Testing), Company shall promptly cause such equipment to be made accurate, and the period of inaccuracy, as well as an estimate for correct meter readings, shall be determined in accordance with Section 10.3 (Corrections).

  • SAMPLE (If applicable and the project has specifications, insert the specifications into this section.)

  • Laboratory Testing All laboratories selected by UPS Freight for analyzing Controlled Substances Testing will be HHS certified.

  • Compressed Work Week The Company and Union recognize the concept of the compressed work week. It is further understood that the compressed work week conditions will apply only to those departments that are on the compressed work week.

  • Access Toll Connecting Trunk Group Architecture 9.2.1 If CSTC chooses to subtend a Verizon access Tandem, CSTC’s NPA/NXX must be assigned by CSTC to subtend the same Verizon access Tandem that a Verizon NPA/NXX serving the same Rate Center Area subtends as identified in the LERG. 9.2.2 CSTC shall establish Access Toll Connecting Trunks pursuant to applicable access Tariffs by which it will provide Switched Exchange Access Services to Interexchange Carriers to enable such Interexchange Carriers to originate and terminate traffic to and from CSTC’s Customers. 9.2.3 The Access Toll Connecting Trunks shall be two-way trunks. Such trunks shall connect the End Office CSTC utilizes to provide Telephone Exchange Service and Switched Exchange Access to its Customers in a given LATA to the access Tandem(s) Verizon utilizes to provide Exchange Access in such LATA. 9.2.4 Access Toll Connecting Trunks shall be used solely for the transmission and routing of Exchange Access to allow CSTC’s Customers to connect to or be connected to the interexchange trunks of any Interexchange Carrier which is connected to a Verizon access Tandem.

  • Random Testing Notwithstanding any provisions of the Collective Agreement or any special agreements appended thereto, section 4.6 of the Canadian Model will not be applied by agreement. If applied to a worker dispatched by the Union, it will be applied or deemed to be applied unilaterally by the Employer. The Union retains the right to grieve the legality of any imposition of random testing in accordance with the Grievance Procedure set out in this Collective Agreement.

  • Solution The Supplier’s contractually committed technical approach for solving an information technology business objective and associated Requirements as defined and authorized by the scope of the Contract or any order or Statement of Work issued under the Contract. Solution means all Supplier and Supplier’s third-party providers’ components making up the Solution, including but not limited to Software, Product, configuration design, implementation, Supplier-developed interfaces, Services and Work Product.

  • Calibration The comparison of a measurement system or device of unverified accuracy with a measurement system of known and greater accuracy to detect deviation of the unverified measurement system from required performance specifications (of the unverified measurement system or device) and to quantify all measured values to applicable units of the international system of units.

Draft better contracts in just 5 minutes Get the weekly Law Insider newsletter packed with expert videos, webinars, ebooks, and more!