{"id":2519,"date":"2018-08-31T07:30:42","date_gmt":"2018-08-31T07:30:42","guid":{"rendered":"https:\/\/devbloglavaprotocols.nityo.in\/how-can-food-trucks-owners-determine-locations-that-bring-in-the-most\/"},"modified":"2018-08-31T07:30:42","modified_gmt":"2018-08-31T07:30:42","slug":"how-can-food-trucks-owners-determine-locations-that-bring-in-the-most","status":"publish","type":"post","link":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/how-can-food-trucks-owners-determine-locations-that-bring-in-the-most\/","title":{"rendered":"How Can Food Trucks Owners Determine Locations That Bring in The Most $?"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div><p><em><strong>By Peter Murray,\u00a0Content Markerting Manager, Carto<\/strong><\/em><\/p>\n<p><!--more--><\/p>\n<p>Advances made possible thanks to location data continues to transform business practices and processes when it comes to<a href=\"https:\/\/lavalabs.net\/spatial-analysis-site-planning\/\" target=\"_blank\" rel=\"noopener\">\u00a0site planning<\/a>. Now, with insights from\u00a0new data streams,\u00a0it is possible to determine which sites are most likely to<a href=\"https:\/\/lavaprotocols.com\/2017\/12\/07\/location-intelligent\/\" target=\"_blank\" rel=\"noopener\"> increase sales<\/a> for seasonal, temporary, and mobile businesses.<\/p>\n<p>Food trucks, a lunch-time staple for some, operate on a location-dependent business model. Generally speaking, food trucks offer similar meal options for roughly the same price, which makes it difficult for them to differentiate themselves from nearby competitors. As a result, food truck location can determine whether a business succeeds or fails.<\/p>\n<p>Recently, Carto helped a local food truck business determine prime spots for their trucks with revenue prediction models. The company provided one month\u2019s worth of anonymised transaction data for each of its 10 food carts. With this information, the team of data scientists from Carto were able to determine current performance, build increasingly confident revenue models, and, finally, predict the six best performing food truck locations.<\/p>\n<h2 id=\"measuring-current-performance\"><strong>Measuring Current Performance<\/strong><\/h2>\n<p>Before predicting what locations should be selected to drive future sales, the team had to figure out a way to measure the current performance of each site in Manhattan and Brooklyn.<\/p>\n<p>To get started,\u00a0<a href=\"https:\/\/carto.com\/team\/wenfei\/\" target=\"_blank\" rel=\"noopener\">Wenfei<\/a>\u00a0and\u00a0<a href=\"https:\/\/carto.com\/team\/dongjie-fan\/\" target=\"_blank\" rel=\"noopener\">Dongjie<\/a>, two of Carto\u2019s data scientists, first aggregated the data by truck by hour to find a measure for the\u00a0<strong>average spend per hour<\/strong>.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Average spend per hour\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The graphs show that hourly revenue for each food truck usually peaks around lunch-time, although sometimes there are spikes in sales around breakfast-time as well. Next, Wenfei and\u00a0<a href=\"https:\/\/carto.com\/team\/mamata-akella\/\" target=\"_blank\" rel=\"noopener\">Mamata<\/a>, Carto\u2019s head of cartography (science of drawing maps), mapped food truck sales using proportional circles reflecting revenue amounts for each location across Manhattan and Brooklyn.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Food Truck Sales\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>As expected, food trucks located in high volume traffic areas\u2013Grand Central Station, SoHo, Times Square, etc.\u2013are the most lucrative locations for this company.<\/p>\n<p>Now the team then wanted to figure where the best locations are for increasing sales, which means they\u2019ll need to identify some variables near and around the current locations that can serve as predictors in the revenue model. Traditionally, these predictors are identified using data from the census and points of interest (POI) data.<\/p>\n<p>The demographic insights available from census data are helpful for\u00a0<a href=\"https:\/\/carto.com\/blog\/new-approach-customer-segmentation-location-data\/\" target=\"_blank\" rel=\"noopener\">segmenting<\/a>\u00a0target customers, but this use case illustrates one of the significant limitations of working with census data.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Census Tracts\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The census provides residential data for area of operations, and in the image above this information is presented at the census tract level. However, many food truck customers are workers who commute into the city or tourists visiting New York landmarks, which is likely why the Grand Central Station and Times Square are among the most profitable locations.<\/p>\n<p>As such, residential data offers few insights relevant to increasing sales among this target customer base.<\/p>\n<p>POI data will be more useful here for finding patterns of nearby attractions around high-performing food trucks that can serve as a predictor for our models.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Point of Interests\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The first map shows every POI in Manhattan and Brooklyn, but there\u2019s so much noise that it\u2019s hard to determine which attractions appear and reappear near and around each of our food trucks. Since many customers select food trucks based on proximity, 200 meter radius buffers were created around each cart, which is about a 2 \u00bd to 3 minute walking time, so predictor features could more easily be identified in the second map.<\/p>\n<h2 id=\"building-more-precise-models-with-new-data-streams\"><strong>Building More Precise Models with New Data Streams<\/strong><\/h2>\n<p>Now we\u2019re ready to start building a\u00a0<a href=\"https:\/\/medium.com\/mlreview\/gradient-boosting-from-scratch-1e317ae4587d\" target=\"_blank\" rel=\"noopener\">gradient boosted regression (GBR) model\u00a0<\/a>that will allow us to determine which features from this data are most important when considering where to place our food trucks. In short, the GBR model will help us rank feature importance that will provide us a list of predictors to look for when considering a potential food truck location.<\/p>\n<p>The first revenue model was created using only traditional data sources, specifically census and POI data:<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Model One\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The GBR model returned an R-squared score, a measure of the variability within the data set from 0-1 that can gauge confidence in the model. An R-squared score of .38 means that there is a range of variability in the data. This means that the current data does not provide a high enough\u00a0 confidence on what features are most important to consider when selecting a food truck location.\u00a0 More data is needed to increase the score.<\/p>\n<p>To improve the model, MasterCard spend data was added and the same equation was performed to see whether the R-squared score would increase.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Spend Score\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>MasterCard spend scores provide aggregated and anonymised merchant-level transaction insights on where, when, and how people spend money. More specifically, the transaction percentile score provides a frequency measure that is important. Because most food carts offer similar types of food for around the same price, the frequency measure provides insights on customer volume for each cart.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Model Two\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>Here we see a sizeable score increase and greater alignment among points in the scatter plot. However, the R-squared score could be stronger so a layer of foot traffic data was added to the model.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Model Three\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>Here the R-squared score has increased by 18 points since model one, which makes a lot of sense and confirms our earlier assumption with POI buffers that food trucks rely on foot traffic from nearby customers.<\/p>\n<p>It is significant to note that when additional derivative data layers were added to our model there was an improvement in our R-squared score.<\/p>\n<blockquote>\n<p>Without these new data streams, we would not be in a position to identify with much confidence where the best locations are for each food truck.<\/p>\n<\/blockquote>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Feature Importance\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The image above presents the 12 features that our model identified as having a statistically significant impact on food truck sales, and the top four features were selected to serve as predictors for identifying new locations: 1. Foot Traffic from previous hour, 2. Foot Traffic from current hour, 3. Day of the week, and 4. Mastercard frequency score.<\/p>\n<h2 id=\"revenue-predictions\"><strong>Revenue Predictions<\/strong><\/h2>\n<p>Now it is time to start mapping the selected predictors across New York City using 100\u00d7100 meter grid tiles (roughly the size of a city block). Next, using a histogram, we looked at the sales distribution across the city and calculated the weekly sales average per truck to be approximately $2,786 (approx. RM11,360).<\/p>\n<p>Since the goal is to find new locations that are likely to increase sales revenue, we selected the higher end of the revenue distribution and then clustered them into revenue areas. Because the model\u2019s R-squared score was .63 there\u2019s not quite enough confidence to pinpoint the exact location for each truck. Instead, these revenue areas were clustered to locate regions within a neighbourhood with a higher likelihood of being profitable.<\/p>\n<div class=\"Wrap\">\n <picture>\n  <img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/static.hsstatic.net\/BlogImporterAssetsUI\/ex\/missing-image.png\" alt=\"Model Three\" data-recalc-dims=\"1\">\n <\/picture>\n<\/div>\n<p>The image above shows the changes to the map that each of these operations yielded. In the end, six locations were identified with revenue predictions for each. Below, the six locations are ranked highest to lowest by weekly sales average for each locations.<\/p>\n<ol>\n<li>Corona Park: $6,128 (approx. RM 24, 900) weekly sales average<\/li>\n<li>Penn Station: $5,975 (approx. RM 24, 370) weekly sales average<\/li>\n<li>SoHo: $5,911 (approx. RM 24, 110) weekly sales average<\/li>\n<li>Grand Central Station: $5,766 (approx. RM 23, 520) weekly sales average<\/li>\n<li>West Village: $5,234 (approx. RM 21, 350) weekly sales average<\/li>\n<li>DUMBO: $5,193 (approx. RM 21, 140) weekly sales average<\/li>\n<\/ol>\n<p>While there are the usual suspects on this list (Penn Station, Grand Central, etc.), it is surprising that Corona Park turns out to be the best location for increasing food truck sales revenue. When nearby tourist attractions and the area\u2019s population density are taken into consideration, the results make sense.<\/p>\n<h2 id=\"a-new-era-of-site-planning\"><strong>A New Era of Site Planning<\/strong><\/h2>\n<p>New data streams are ushering in a new era of site planning, therefore making previously impossible solutions possible. Indeed, as this food truck example highlights, the future of site planning depends on accessing and working with various types of data, from traditional sources to new derivative data sets, to identify, understand, and quantify the impact that mobility patterns will have on your sales revenue.<\/p>\n<p><a href=\"https:\/\/carto.com\/blog\/optimizing-food-truck-locations\/\" target=\"_blank\" rel=\"noopener\">Article<\/a> first appeared on the Carto Blog.<\/p>\n<p><strong><i>Lava is an <\/i><a href=\"https:\/\/lavaprotocols.com\/location-intelligence\/\"><i>authorised Partner of Carto in Malaysia, <\/i><\/a><i>a location intelligence software that analyses and visualizes complex and large amounts of data. With more than a decade of experience in the industry, we\u2019re proud to say we\u2019re one of the leading cloud solutions providers and consultants in the Asia Pacific region.<\/i><\/strong><\/p>\n<p><span class=\"et_bloom_bottom_trigger\"><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>With new data streams,\u00a0it&#8217;s possible to determine food truck locations that are most likely to increase sales for seasonal and mobile businesses.<\/p>\n","protected":false},"author":1,"featured_media":2520,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[18],"class_list":["post-2519","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-blog"],"jetpack_featured_media_url":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-content\/uploads\/2024\/10\/LabsAug_070818.jpg","_links":{"self":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts\/2519","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/comments?post=2519"}],"version-history":[{"count":0,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/posts\/2519\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/media\/2520"}],"wp:attachment":[{"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/media?parent=2519"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/categories?post=2519"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lavaprotocols.com\/the-cloud-blog\/wp-json\/wp\/v2\/tags?post=2519"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}