<html>

  <head>

    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

  </head>

  <body>

    <div class="moz-cite-prefix">Regarding where to have this discussion

      - I'm just gonna continue the email chain cause I haven't followed

      where to put the discussion on the wiki, but someone feel free to

      steer it over there if we want to!</div>

    <div class="moz-cite-prefix"><br>

      A brief update regarding establishing a larger list of employers

      to include in the dataset: basic contact information for all

      seattle businesses, sorted by the North American Industry

      Classification System, is available at <a

href="https://web6.seattle.gov/fas/slimbizsearch/ResultsPage.aspx?NAICList=Top100">https://web6.seattle.gov/fas/slimbizsearch/ResultsPage.aspx?NAICList=Top100</a>,

      but it's a huge list of course, with no info on number of

      employees or revenue to filter out the smaller ones. Still, I did

      send off an email about getting a copy of the database just for

      purposes of cross-referencing names and such.<br>

      On 8/10/19 6:42 PM, Katie Wilson wrote:<br>

    </div>

    <blockquote type="cite"

      cite="mid:09AFFEF2-2BC9-4B67-9BDE-D526954EF246@transitriders.org">

      <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

      For “neighborhood” I think it makes sense to use the <a

href="https://www.seattle.gov/transportation/projects-and-programs/programs/transportation-options-program/commute-trip-reduction-program/draft-2019-2023-networks-and-targets"

        class="" moz-do-not-send="true">“CTR Network Areas” as defined

        here</a>.<br class="">

      <div class=""><br class="">

      </div>

      <div class="">For “industry” I think it makes sense to use the <a

href="https://www.seattle.gov/Documents/Departments/SDOT/TransportationOptionsProgram/CTR_Draft_Strategic_Plan_Jan2019.pdf"

          class="" moz-do-not-send="true">“Employment Sector” categories

          listed on Page 12 of this CTR strategic plan.</a></div>

      <div class=""><br class="">

      </div>

      <div class="">

        <div class="">On the ratings, I think it does make sense to

          lump "piggy bank" and "brown tortoise" in the same rating (0),

          and then add a tortoise badge for employers that aren’t even

          doing the pre-tax thing.</div>

        <div class=""><br class="">

        </div>

        <div class="">Another simplification option to consider would be

          to lump together 3 and 4 leaves. But let’s leave them separate

          for now and depending on how things shake out we can easily

          combine them later.</div>

      </div>

      <div class=""><br class="">

      </div>

      <div class="">We don’t have any major sources of data on what

        benefits employers provide other than:</div>

      <div class="">— <a

href="https://seattletransitpasses-research.pbworks.com/w/page/133438080/First%20Public%20Records%20Request"

          class="" moz-do-not-send="true">Metro public disclosure

          request spreadsheet</a></div>

      <div class="">— Our commute survey</div>

      <div class="">— Info gleaned online from company websites, asking

        around, glassdoor etc (what I’ve found I’ve added to the <a

href="https://seattletransitpasses-research.pbworks.com/w/page/132177123/Employers"

          class="" moz-do-not-send="true">relevant tables in the wiki</a>,

        on CTR employers and “potential poster children” and “likely

        target assessment” and “hotels”)</div>

      <div class=""><br class="">

      </div>

      <div class="">Maybe it makes sense to have another string

        indicating sufficient certainty — when we have two sources, or

        one very reliable source, we enter an X or whatever, and that

        gives us the green light to display that data. Also it may not

        make sense to put a lot of work into categorizing employers into

        Network Area and Employment Sector until we have reliable data

        on what benefits they’re offering.</div>

      <div class=""><br class="">

      </div>

      <div class="">Speaking of Seattle Coffee Works, I spoke with their

        HR person a few months ago and actually employees have to pay

        $20/month (pre-tax $) if they want an ORCA card. Still a great

        deal but not 100% subsidy as reported in the Metro data— which,

        I then learned, is self-reported by the company. Metro only

        knows that all those companies are signed up for the Passport

        program. I noted the real situation <a

href="https://seattletransitpasses-research.pbworks.com/w/page/133439169/Potential%20Poster%20Children"

          class="" moz-do-not-send="true">on this page</a>. Anyway, the

        point is we should probably crosscheck the Metro data as much as

        we can with our survey or other sources of information.

        <div class=""><br class="">

        </div>

        <div class="">(Also speaking of Seattle Coffee Works they have

          locations in <a

            href="https://www.seattlecoffeeworks.com/our-cafes.aspx"

            class="" moz-do-not-send="true">Capitol Hill & Cascade

            too</a>. From talking with the HR person I’m pretty sure all

          are include in their passport program, and the employees swap

          around a lot from location to location. They probably use the

          Ballard location as home base for transit pass purposes since

          that’s the least expensive zone.)</div>

      </div>

      <div class=""><br class="">

      </div>

      <div class="">One project would be to come up with a list of

        employers that have name recognition (or that we are interested

        in for some other reason) and put a little work into attaining

        sufficient certainty. If we posted the list to a page and put a

        call out on social media and email I bet we’d get some answers.</div>

      <div class=""><br class="">

      </div>

      <div class="">

        <div class="">

          <div>

            <blockquote type="cite" class="">

              <div class="">On Aug 8, 2019, at 5:26 PM, Stephen DeSanto

                <<a href="mailto:rachidian@gmail.com" class=""

                  moz-do-not-send="true">rachidian@gmail.com</a>>

                wrote:</div>

              <br class="Apple-interchange-newline">

              <div class="">

                <div dir="ltr" class="">

                  <div class="">Hi everyone,</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">I've taken a first pass at the data

                    schema for showing employer transit benefits in our

                    upcoming web app. In this draft, each employer

                    record is represented as follows:</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">{<br class="">

                        "employer": string,<br class="">

                        "industry": [string],<br class="">

                        "neighborhood": [string],<br class="">

                        "alias": [string],<br class="">

                        "rating": int,</div>

                  <div class="">    "description": string<br class="">

                  </div>

                  <div class="">    "badges": [string]<br class="">

                    }</div>

                  <div class=""><br class="">

                  </div>

                  <div class=""><b class="">Employer</b> is a plain text

                    string.</div>

                  <div class=""><b class="">Industry</b> is a list of

                    strings (or a single string, if we want to limit one

                    employer = one industry).</div>

                  <div class=""><b class="">Neighborhood</b> is treated

                    similarly to industry<br class="">

                  </div>

                  <div class=""><b class="">Alias</b> is a list of other

                    names for the same company. For example, <br

                      class="">

                  </div>

                  <div class=""><b class="">Rating</b> is a numerical

                    scale that represents the "worker's monthly cost of

                    an unlimited transit pass". The scale provided

                    during the meeting went from "4 leaves" to "brown

                    tortoise"; aligning to the leaves, that gives us a

                    scale of [-1, 0, 1, 2, 3, 4]. We could adjust this

                    up to 0-5, or lump "piggy bank" and "brown tortoise"

                    in the same rating.</div>

                  <div class=""><b class="">Description</b> is a string

                    that describes the employer's transit benefits, i.e.

                    why they got the rating they did.<br class="">

                  </div>

                  <div class=""><b class="">Badges</b> is a list of

                    strings that represent any additional categories we

                    want to assign to a company (e.g. "industry leader",

                    "polluter").</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">We can make changes to this schema if it

                    makes it easier to work with our underlying data

                    visualization platform (e.g. Tableau? DataTables?),

                    but hopefully this is a suitable starting place.</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">As an example, take a hypothetical

                    record for Seattle Coffee Works.</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">{</div>

                  <div class="">    "employer": "Seattle Coffee Works",<br

                      class="">

                        "industry": ["restaurant"],<br class="">

                        "neighborhood": ["cbd", "ballard"],<br class="">

                        "alias": ["Ballard Coffee Works"],<br class="">

                        "rating": 4,</div>

                  <div class="">    "description": "Provides 100% ORCA

                    Passport subsidy."<br class="">

                  </div>

                  <div class="">    "badges": ["leader"]</div>

                  <div class="">}<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class=""><b class="">Where Our Data Lives (For

                      Now)</b><br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">I've also taken a rough chop at getting

                    started with the data. Here, I've just taken the raw

                    list of ORCA Business Passport employers and

                    assigned a score based on their subsidy percentage,

                    as an example:</div>

                  <div class=""><br class="">

                  </div>

                  <div class=""><a

href="https://docs.google.com/spreadsheets/d/1HmOcG7hJLD1G0unCMPcsDnXr4RIA_PMKEE5ne-hhQR8/edit?usp=sharing"

                      class="" moz-do-not-send="true">https://docs.google.com/spreadsheets/d/1HmOcG7hJLD1G0unCMPcsDnXr4RIA_PMKEE5ne-hhQR8/edit?usp=sharing</a></div>

                  <div class=""><br class="">

                  </div>

                  <div class="">The spreadsheet contains columns for

                    each item of the employer record, as well as some

                    additional columns to record the raw data we have on

                    file for that employer, so we can use that data to

                    automatically or manually determine an employer's

                    rating.<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">If we have data from other sources not

                    listed (e.g. survey data, City of Seattle data), the

                    "source_" columns can be renamed or added to

                    represent that source's data. For example, if I want

                    to add data from the TRU survey, I might rename

                    "__source_b" to "__TRU Survey", then include results

                    from that survey in that column for each company.

                    (The columns beginning with two underscores are ones

                    I don't expect to be publicly available.)</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">PBworks feels really inadequate for

                    editing large data sets, and I don't know where else

                    to put it, so it's living in Google Sheets for now.

                    Set to read-only with the link, for now, but please

                    request editing permissions so you can add stuff to

                    the sheet.<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">Currently, my expectation is that the

                    spreadsheet will be hand-edited in Google Sheets,

                    and then when we're ready to put live data in the

                    web app, we can export the sheet to a flat file,

                    which we can then import into a format appropriate

                    for the website (big ol' JSON file, database,

                    whatever). Manual process, but probably fine for a

                    project of this scale; I'm open to alternatives. </div>

                  <div class=""><br class="">

                  </div>

                  <div class=""><b class="">Things To Do Next</b><br

                      class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">Aside from the ORCA Passport data and

                    the data we collected through TRU survey / legwork

                    (on PBworks), do we have any other data sources that

                    would provide context for a score?</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">For the data sources we have, we'll have

                    to start filling out the rest of the spreadsheet, I

                    guess?<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">Also, we will need to determine:</div>

                  <div class="">a) master list of "industries" we want

                    to support, and</div>

                  <div class="">b) "industry" field(s) for each employer</div>

                  <div class="">c) "neighborhood" field(s) for each

                    employer we don't have one for (or being more

                    precise than what I have now)<br class="">

                  </div>

                  <div class="">d) which companies get tagged with which

                    badges</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">Hope that helps.<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                  <div class="">In solidarity,</div>

                  <div class=""><br class="">

                  </div>

                  <div class="">Stephen<br class="">

                  </div>

                  <div class=""><br class="">

                  </div>

                </div>

                <br class="">

              </div>

            </blockquote>

          </div>

          <br class="">

        </div>

      </div>

      <br>

      <fieldset class="mimeAttachmentHeader"></fieldset>

      <pre class="moz-quote-pre" wrap="">

</pre>

    </blockquote>

  </body>

</html>