A tax collecting authority increased 40% in tax collection with Xtract.io data aggregation.

Background and business need

A principal tax collecting agency in the U.S. state was in charge of administering the state's tax laws and collection of state taxes for nearly 40 programs including 1.8% of Transient Lodging rent.

Without adequate data about the transient lodging facilities (revenue, number of rooms, exact locations, etc.) the tax collecting authority faced a challenge in monitoring whether the Transient Lodging facilities are paying tax to the government as per the occupancy and required standards.

The Department of revenue was suspecting the lodging facilities to be forging the occupancy rate by altering the number of rented spaces booked ( occupancy rate is calculated as the ratio of rented or used space to the total amount of available space).

The pressing need to aggregate information for these different categories got them to Xtract.io and they decided to implement it for their business.

The data and insights they wanted to aggregate on partnering with Xtract.io

  • To get the daily occupancy rate of the lodging facilities

  • To validate the occupancy of the lodging facilities throughout the state on a daily basis

  • Cross-check whether the taxes paid as per the occupancy rate is the same as the bookings made across online platforms

  • Monthly report creation for audit purposes

Challenges faced by the customer

The complexity involved in aggregating this data was multi-fold. Here’s a quick overview of the expertise they required from Xtract.io

  • A fully automated approach was required to find out the occupancy rate of the lodging facilities

  • The data on occupancy was required to be aggregated from 10+ third-party websites periodically (roughly 4 times a day)

  • Site-specific bots were required to crawl multiple websites

Xtract.io Solutions and analysis

The data experts at Xtract.io analyzed the challenges and implemented a step-by-step solution.


A custom-built platform

A TOT(Transient Occupancy Tracker) platform was built to monitor the transient lodging information from sites on a periodic basis. We deployed Mobito, a proprietary web-crawler platform to crawl and aggregate this data.

The changes in booking status are identified and loaded onto the TOT database against the property reference. This is done daily, four times a day.


Data aggregation from multiple websites/channels

The extraction of the data from multiple websites is done with the help of multiple site-specific bots. These multiple site-specific bots extract data from the respective site and populate the information in the database.

By using the web change monitoring bot the daily change in the online sites will be monitored and the daily occupancy can be calculated.

Here’s the overview of different bots we built to serve different purposes:

  • Multiple site-specific bots were created in Python. Each of these bots performs a specific iterative process such as extracting all links in a website, automatically downloading the HTML pages of the extracted links, etc.

  • All bots are placed sequentially and in parallel, as required & connected using a workflow that defines the flow of data from one bot to the other

  • The data is standardized and delivered through custom-built APIs


Data delivery and reports + analysis

Once the data is collected and the details of the present and the past day are found, the occupancy details are stored in the form of pdf/CSV/HTML. This is done four times a day.

TOT system helps users configure periodic reports (daily/weekly/fortnightly/monthly/quarterly) on transient occupancy for the state/regions for delivery to the user’s secure email or to secure channels (SFTP/dropbox). TOT provides a secure login for up to 6 administrative users.

Results and outcomes

Xtract.io helped the Department of Revenue identify the tax evaders by calculating the occupancy rates. The results are quantified and are as follows.

The solution covered more than 95% of all transient lodging facilities of the State

Tax evaders were identified based on variation in the tax paid and occupancy data

40% Increase in tax collection

Looking out for a similar solution?

Have a similar requirement? Get started with your data aggregation project now!

© 2024 Xtract.io Technology Solutions Pvt Ltd | All Rights Reserved | A Mobius Venture.

© 2024 Xtract.io Technology Solutions Pvt Ltd | All Rights Reserved | A Mobius Venture.