advanced
car-sharing
smartcar
outage

Smartcar Outage Runbook

Operational runbook for Smartcar or OEM telematics outages affecting lock/unlock, telemetry, and rental session completion

Levy Fleets TeamFebruary 11, 202611 min read

Smartcar Outage Runbook

When Smartcar or an OEM connected-services backend degrades, rental sessions can fail at the most sensitive steps: unlock, lock, and telemetry-dependent billing. This runbook defines how to maintain safety and reduce customer impact during those incidents.

Outage Signals

Treat issues as a potential outage when you observe one or more of:

  • Sudden spike in lock/unlock failures across multiple vehicles or brands
  • Telemetry timestamps not advancing for a broad fleet segment
  • High timeout rates on command requests
  • Concurrent customer reports across different locations

If failures are isolated to one vehicle, follow normal troubleshooting first.


Severity Levels

SeverityCriteriaDefault Policy
SEV-1Widespread inability to unlock/lock; active returns blockedFreeze new session starts, prioritize safe session closure
SEV-2Partial brand/region degradation; intermittent failuresRestrict affected cohorts, monitor every 15 minutes
SEV-3Elevated error rate but acceptable fallback successContinue operations with alerting and manual readiness

First 15 Minutes

1

Declare Incident

Open an internal incident record with timestamp, observed symptoms, and affected brands/regions.

2

Scope Impact

Identify how many active sessions and pending check-ins depend on affected vehicles.

3

Set Temporary Policy

Choose a mode: full hold on new starts, partial hold by brand/region, or monitored continue.

4

Notify Support and Ops

Broadcast response instructions to support, operator teams, and on-call responders.

5

Start Customer Messaging

Send short in-app and email notices to impacted customers.


Fallback Operation Modes

Mode A: Safe-Return Priority (SEV-1)

  • Pause new rental starts on affected vehicles.
  • Allow active rentals to proceed to return with operator assistance.
  • Prioritize lock confirmation during check-out.
  • If lock verification is unavailable, require manual safety verification before finalizing.

Mode B: Limited Continue (SEV-2)

  • Block only impacted brands/regions.
  • Allow unaffected fleet segments to operate normally.
  • Add proactive warnings on affected check-in flows.

Mode C: Monitor and Retry (SEV-3)

  • Keep operations running.
  • Increase retry windows and operator alerts.
  • Prepare to escalate if failure rates worsen.

Command Failure Handling

StepFailureFallback
Check-in unlockCommand times out/failsRetry, operator-assisted unlock, or reassign vehicle
In-trip lock/unlockIntermittent failureRetry with user guidance and operator escalation path
Check-out lockCannot confirm lock stateManual verification protocol before force-close
Telemetry readStale odometer/fuelUse manual photo evidence and post-incident reconciliation

For repeated customer failures, route to support immediately instead of repeated blind retries.


Customer Communication Templates

Active Incident Banner

"Some connected-vehicle commands are currently delayed. We are actively working on a fix and support is available if your rental is affected."

Check-In Impacted

"Your vehicle connection is temporarily unavailable. We are retrying now and can help reassign your booking if needed."

Check-Out Impacted

"Return confirmation is delayed due to a temporary connectivity issue. Keep the vehicle secured and follow in-app instructions while we finalize your session."


Operator Checklist During Outage

  • Keep a queue of sessions needing manual assistance.
  • Record all manual interventions with reason and timestamps.
  • Capture supporting photos for any force-end or delayed-finalization flow.
  • Do not charge disputed overages until telemetry or evidence is reconciled.
  • Confirm vehicle physical security for every manually closed check-out.

Recovery Phase

When upstream systems recover:

  1. Confirm command success and telemetry freshness return to baseline.
  2. Re-enable paused session-start policies in stages.
  3. Reconcile sessions that were force-closed or delayed.
  4. Review potential billing corrections (late fees, surcharges, extensions caused by outage).
  5. Send closure communication to impacted customers.

Post-Incident Review

Capture these points within 24 hours:

  • Start/end timestamps and total impact window
  • Affected vehicles, sessions, and customers
  • Number of manual interventions and unresolved cases
  • Revenue or support impact
  • What changed in monitoring, policy, or tooling after incident

Use this review to tighten future response time and reduce repeat disruption.

Billing Integrity

During outages, avoid automatic penalties that rely on uncertain timestamps or stale telemetry. Reconcile first, then bill.

Need Help?

For incident response assistance, contact support@levyelectric.com and include your incident start time, affected subaccount, and representative session IDs.