1.1 Introduction to HTML Online HTML padDev API DocsWeb Archive MachineEmmet DocsAtom shortcut cheatsheet: Ctrl + `Heading1 to 6 onlysearch google to see more heading html mdnHTML templatehtml hit enterhtml:4t ...
1.1 Introduction to HTML Online HTML padDev API DocsWeb Archive MachineEmmet DocsAtom shortcut cheatsheet: Ctrl + `Heading1 to 6 onlysearch google to see more heading html mdnHTML templatehtml hit enterhtml:4t ...
CMU课程推荐 CMU课程简介和先修查询 17437: Web Application Development (Django) 17214: Principles of Software ConstructionKeynotes 15110: Principles of Computing 15112: Fundamentals of Programming and C...
云计算:核心概念 ScalabilityA distributed program is considered to be scalable if it remains effective when the quantities of users, data, and resources increase significantly.从线性编程到分布式编程倍速公式:Ts: sequential pro...
Distributed Programming: Synchronous vs Asynchronous SynchronousA distributed program is synchronous if and only if the component tasks operate in lockstep.Such mechanisms usually have an important effect on performance.AsynchronousAsynchronous p...
Distributed Programming: Two Traditional Communication Mechanism shared memory vs message passing文章目录Shared MemoryKey Abstraction:FeatureSynchronisation Mechanism1. Semaphore2. Locks3. BarrierExampleThe Message-Passing Programming ModelKey Abstraction:FeatureExamp...
Database Manipulation in Java (Netbeans & Derby) 文章目录Set up Derby Database in Netbeans1. Create a new Database2. Get the Database URL3. Add library to Java4.Code SkeletonLoad the Database DriverConnect to DatabaseCreate TableInsert Records using Sta...
Leetcode: Valid Parentheses 问题描述思路:考虑特殊情况栈(stack)来存储未配对的符号指针移动,如果和栈顶匹配,则弹栈指针移动,如果和栈顶不匹配,且属于给定的符号,则压栈终止条件为,遍历结束,栈空class Solution { public boolean isValid(String s) { // corner case 1 if (s...
Spark之RDD操作 // 读取文件,压缩文件和路径都可以val lines = sc.textFile("hdfs://localhost:9000/...") // hdfsval textFile = sc.textFile("file:///usr/local/spark/mycode/wordcount/word.txt") // local// 从driver的内存中读取val array = Ar...
Spark实战(5) DataFrame基础之处理缺失值 Drop Missing Valuefrom pyspark.sql import SparkSessionspark = SparkSession.builder.appName('aggs').getOrCreate()df = spark.read.csv('sales_info.csv', inferSchema = True, header = True)df.printSche...
Spark实战(5) DataFrame基础之GroupBy和Aggregate 文章目录groupBy()AggregationAggregation FunctiongroupBy()from pyspark.sql import SparkSessionspark = SparkSession.builder.appName('aggs').getOrCreate()df = spark.read.csv('sales_info.csv', inferSche...
Spark实战(4) DataFrame基础之数据筛选 文章目录filter写法一filter写法二条件符号获取结果filter写法一from pyspark.sql import SparkSessionspark = SparkSession.builder.appName('ops').getOrCreate()df = spark.read.csv('appe_stock.csv',inferSchema = True, heade...
Spark实战(3) DataFrame基础之行列操作和SQL 文章目录行列操作SQL操作行列操作df['age'] # I only get a column objectdf.select('age').show() # I get a datafram with a column that we could use with show() method# see the first two row elementsdf.head(2) # r...
云计算之分布式编程(1) 基本概念串行(sequential):cpu一次只执行一个程序,按照顺序执行所有程序并行(concurrent):多个任务交替使用cpu资源,在时间上共享单一cpu资源并发(parallel):多个任务在多个cpu上同时执行分布式(distributed program):并发任务在不同的,互联的机器上执行(不仅仅是cpu)多任务(multitasking):用户可以同时运行多个任务(...
Spark实战(2) DataFrame基础之创建DataFrame 之前,RDD语法占主导,但是比较难用难学.现在,有了DataFrame,更容易操作和使用spark.文章目录创建DataFrame创建DataFrame(指定Schema)创建DataFramefrom pyspark.sql import SparkSession# 新建一个sessionspark = SparkSession.builder.appName('Basics')...
Spark实战(1) 配置AWS EMR 和Zeppelin Notebook SparkContext和SparkSession的区别,如何取用?SparkContext:在Spark 2.0.0之前使用通过资源管理器例如YARN来连接集群需要传入SparkConf来创建SparkContext对象如果要使用SQL,HIVE或者Streaming的API, 需要创建单独的Context val conf = new SparkConf() .setA...