nvim-databricks

A simple, minimalistic, easy plugin to work with Databricks & Pyspark locally

nvim-databricks example cluster view

nvim-databricks example run output

Important

This plugin is no longer maintained, in favor of my new plugin: lazydbrix, which solves the same problem, but, better.

Plugin to view, start/stop and pick a Databricks cluster when working with spark locally through databricks-connect. Offers a custom run output window and support for picking cluster when debugging using nvim-dap.

📋 Requirements

Neovim 0.9.2
jq cli, install for your specific OS
Databricks connect, docs
Databricks CLI==v0.212.2, follow the docs for installation, and setup your ~/.databrickscfg file like:

[DEFAULT]
host = <your_host>
token = <your_token>
cluster_id = <your_cluster_id>
org_id = <your_org_id>
jobs-api-version = 2.1

[other_profile]
...

NOTE: All profiles that are in this file, needs to have valid setups, otherwise the plugin won't work

Support

nvim-dap, setting picked Databricks cluster when running debugger

📦 Installation

Use your favorite package manager, e.g., lazy:

{
    'phdah/nvim-databricks',
    dependencies = {
        'mfussenegger/nvim-dap', -- Optional dependency
    }
}

and in your init.lua, put

require('nvim-databricks').setup()

⚙️ Configurations

For adding config, add a setup input like

require('nvim-databricks').setup({
    DBConfigFile = '~/.databrickscfg', -- Set path to Databricks connect config file
    python = 'python3', -- Set Python version for DBRun
    dap = "true", -- Toggle to enable setting nvim-dap python environmental variables for cluster selection
})

For more advanced configuarations, refer to Advanced Configurations

NOTE: The values above are the defaults, so if this is what you want, just leave the setup empty. As defualt, python3 is used to be compatible with Mac users, and should pick up any venv dynamically.

🚀 Using

Here are the functions available to use

Command	Description
`DBOpen`	Opens a list of clusters for all of the profiles setup in the configuration file. Choose a cluster, and it will update the plugin state, hence using that cluster for all following runs of a Python script when using `DBRun`, throughout the current neovim session.
`DBRun`	Runs your currently open Python neovim buffer, using the selected `profile` and `cluster` from the `DBOpen` command. If no selection has been made, this will use the `DEFAULT` profile from your config, or specified environemental variables `DATABRICKS_CONFIG_PROFILE` and `DATABRICKS_CLUSTER_ID`.
`DBPrintState`	Print the current selected; `profile`, `cluster id` and `cluster name`.

Remapping in Lua

vim.api.nvim_set_keymap('n', '<new_keymap>', ':DBOpen')
vim.api.nvim_set_keymap('n', '<new_keymap>', ':DBRun')
vim.api.nvim_set_keymap('n', '<new_keymap>', ':DBPrintState')

⚙️ Advanced Configurations

For customizing the Python interpreter used by this plugin, you can specify it directly in the setup configuration as a string, or using the require function to access it from somewhere. The python configuration runs within the same process as the Neovim subprocess shell. For example, if you're also using the venv-selector.nvim plugin to manage your virtual environments, you can reference its active path like

require('nvim-databricks').setup({
    python = require('venv-selector').get_active_path(),
})

Limitations

Currently only supports Pyspark.

Roadmap

Feature	Status
Tabs for workspaces	✅
Setup workspace specific output	✅
Build a `run` module, to utelize lua in memory variables for cluster choosing	✅
Optimize window opening by `persisting` buffers	✅
Optimize window using `asynchronous` Databricks API calls	✅
Add `nerd` fonts images for cluster status etc	✅
Add a complete UI, which displays graphical objects	🟨
Add graphical objects to indicate, e.g., selected cluster and its state	🟨
Add file type check for how to run with `DBRun`, e.g., if `.py` use `python`	🟨
Add a `y` key for yanking all relevant output from the `DBRun` output window	🟨
Add a `h` key for `help`, in which we can see all the keymaps. The "list" of them is too long now	🟨

Known bugs

NOTE: If the plugin don't work, it's important to first check that any potential VPN's or Databricks tokens are setup correctly. This would most likely show with a message: Error executing command: databricks --profile prod clusters list ... when running DBOpen. This error is due to a failing Databricks API call.

Bugs	Status	Solution
JSON output of `databricks clusters list` is different between versions, hence, parsing is not working.	✅	Using the standard of `Databricks CLI v0.212.2`, which outputs a list of JSONs

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
doc		doc
images		images
lua		lua
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nvim-databricks

📋 Requirements

Support

📦 Installation

⚙️ Configurations

🚀 Using

Remapping in Lua

⚙️ Advanced Configurations

Limitations

Roadmap

Known bugs

About

Releases 4

Packages

Languages

License

phdah/nvim-databricks

Folders and files

Latest commit

History

Repository files navigation

nvim-databricks

📋 Requirements

Support

📦 Installation

⚙️ Configurations

🚀 Using

Remapping in Lua

⚙️ Advanced Configurations

Limitations

Roadmap

Known bugs

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages