Qfrom_slim

Qfrom is a unified and simple to use tool for data manipulation and data analysis. This Project is based on Python 3.10.0

I come from C# and I'm used to and in love with the Linq lib and pandas dataframes annoyed me and like this the Qfrom package was born. This is my take and adventure into datamanagement, numpy and python, from which, not least, a better understanding of pandas has emerged. warning this project vaiolates also some of the holy python conventions. there is also a bit of coding black magic involved. in some caeses it even feels Qfrom can read your mind (#args-not-specified, #function-returning-multiple-columns). but just in case you asked your self i can asssure you there is no AI involved. now i wish you the best of fun with a first example of Qfrom in use.

For a first impression here is a example of an imaginary company. This company consist of four peoble.

data = {
    'name': ['Emma', 'Bob', 'Steve', 'Ann'],
    'job': ['manager', 'employee', 'employee', 'freelancer'],
    'salary': [90_000, 61_000, 48_000, 56_000]
}

To turn this data info a Qfrom object, just pass the data into the constructor.

q = Qfrom(data)
q

> Qfrom
> name	job	salary
> Emma	manager	90000
> Bob	employee	61000
> Steve	employee	48000
> Ann	freelancer	56000

One day Emma the manager asks Bob the data scientist for a list of all employees. Bob just enters the following into his IDE

q.where(func=lambda job: job=='employee')

> Qfrom
> name	job	salary
> Bob	employee	61000
> Steve	employee	48000

Because Bob finished is task so quickly, Emma wants to increase everyones salary by 1,000 $

q = q.map('salary', lambda x: x+1_000)
q

> Qfrom
> name	job	salary
> Emma	manager	91000
> Bob	employee	62000
> Steve	employee	49000
> Ann	freelancer	57000

Now Emma wants to see the average salary per job title. A tricky one, bot not for Bob...

(q.groupby('job')
    .map('group', func.vec(lambda x: x['salary']))
    .map('group', func.vec(lambda x: x.agg(agg.mean)))
    .rename({'key': 'job', 'group': 'mean salary'}))

> Qfrom
> job	mean salary
> manager	91000.0
> employee	55500.0
> freelancer	57000.0

For a full desciption of all Qfrom functionality just browse throght the following documentation.

Qfrom_slim
Contents
class Qfrom
class col
- 0 -> 1 functions
  - id
- 1 -> 1 functions
  - normalize
  - abs
  - shift
  - not
- n -> 1 functions
  - any
  - all
  - min
  - min_colname
  - max
  - max_colname
  - sum
  - mean
  - median
  - var
  - eq
  - agg
  - state
  - lod_and
  - lod_or
  - lod_xor
- 1 -> n functions
  - copy
  - flatten
- n -> m functions
  - ml_models
class func
- vec
- multicol
class agg
- any
- all
- min
- min_id
- max
- max_id
- sum
- mean
- median
- var
- len
- size
- state
class plot
- plot
- bar
- hist
- box
- scatter
class out
- list
- set
- dict
- array
- mtx
- df
- csv
- csv file
- json
- json file
class trans
- shuffle
Performance Tests
- setup
- runtime tests
  - append
  - getitem
  - iter
  - select
  - map
  - orderby
  - where
  - groupby
  - agg

class Qfrom

import Qfrom like this

from QfromPackage.Qfrom_slim import Qfrom

import list

l = [1, 2, 3]
Qfrom(l)

> Qfrom
> y
> 1
> 2
> 3

l = [
    (1, 4),
    (2, 5),
    (3, 6)
    ]
Qfrom(l)

> Qfrom
> y0	y1
> 1	4
> 2	5
> 3	6

l = [
    {'a': 1, 'b': 4},
    {'a': 2, 'b': 5},
    {'a': 3, 'b': 6}
    ]
Qfrom(l)

> Qfrom
> a	b
> 1	4
> 2	5
> 3	6


np	0.16 s	1.0%
df	0.163 s	1.015%
qs	0.438 s	2.736%
l	0.044 s	0.275%
np_mtx	0.06 s	0.375%


np	0.244 s	1.0%
df	6.877 s	28.15%
qs	0.234 s	0.958%
l	0.431 s	1.764%
df_lcph	0.357 s	1.461%
df_np	0.212 s	0.867%
np_iter	1.419 s	5.809%


np	0.174 s	1.0%
df	8.22 s	47.184%
qs	0.492 s	2.823%
l	0.003 s	0.016%


np	0.144 s	1.0%
df	5.486 s	38.089%
qs	0.161 s	1.119%
l	0.119 s	0.826%


np	0.0 s	0%
df	0.0 s	0%
qs	0.0 s	0%
l	0.0 s	0%


np	0.0 s	0%
df	0.106 s	0%
qs	0.0 s	0%
l	0.0 s	0%

Files

README.md

Latest commit

History

README.md

File metadata and controls

Qfrom_slim

Contents

class Qfrom

import list

import dict

import set

import array

import matrix

import DataFrame

import csv

import json

import generator

eq

str

repr

append

setitem

set row

set column

set cell

getitem

get row

get column

get cell

contains

iter

len

keys

values

items

remove

rename

select

string

dynamic column selection

tuple

map

out not specified

args not specified

args dynamic column selection

func not specified respectively copying column

vectorize function

function returning multiple columns

function returning a scalar

function returning a generator

orderby

where

groupy

flatten

unique

value counts

agg

join

join cross

join outer

join outer left

join outer right

join id

join id outer

join id outer left

join id outer right

concat

concat outer

concat outer left

concat outer right

calculate

call

class col

0 -> 1 functions

id

1 -> 1 functions

normalize

abs

shift