31
Jul

Modifying Dynamic Variables in ODS Graphics

If you are familiar with the output delivery system (ODS), then you know that you can modify the tables and graphs that analytical procedures display by modifying table and graph templates. Perhaps less familiar is the fact that you can also modify dynamic variables. Procedures use dynamic variables to control […]

The post Modifying Dynamic Variables in ODS Graphics appeared first on Graphically Speaking.

Read More
31
Jul

Map of US domestic cannabis eradication

In my quest for interesting data to graph, I found some Drug Enforcement Administration (DEA) data on US domestic cannabis eradication. Does the data say anything interesting? Read on to find out! ... While doing some searches for other data, I happened across a table on the DEA website titled […]

The post Map of US domestic cannabis eradication appeared first on The SAS Training Post.

Read More
31
Jul

SAS 9.4: Transport Layer Security (HTTPS) and SAS

Encryption and SAS is a wide ranging topic – so wide it gets its own book and features strongly in both the SAS(R) 9.4 Intelligence Platform: Security Administration Guide, Second Edition and SAS(R) 9.4 Intelligence Platform: Middle-Tier Administration Guide, Third Edition. In this blog we’ll take a high level look at […]

SAS 9.4: Transport Layer Security (HTTPS) and SAS was published on SAS Users.

Read More
31
Jul

Large matrices in SAS/IML 14.1

Last week, SAS released the 14.1 version of its analytics products, which are shipped as part of the third maintenance release of 9.4. If you run SAS/IML programs from a 64-bit Windows PC, you might be interested to know that you can now create matrices with about 231 ≈ 2 […]

The post Large matrices in SAS/IML 14.1 appeared first on The DO Loop.

Read More
31
Jul

Super-Easy Outlier Check with Proc Freq

Sometimes, you can just eyeball it. Really, if something truly is an outlier, you ought to be able to spot it. Take this plot, for example. It should be pretty obvious that the vast majority of our sample for the Fish Lake game were students in grades, 4, 5 and 6. Those in the lower […] Read More
30
Jul

How Not to Get Fired from a Software Team

I’m really busy here in Brazil. Honest. You think I am sitting here like this: But really, it’s like this: Still, I have taken time away to explain to you how not to get your sorry ass fired, so listen up niños . I had the benefit of starting writing software at a large organization, […] Read More
29
Jul

Unicode in Formatted Data - SAS 9.40M3

SAS 9.4 Maintenance release 3 was released on July 14.  The ODS Graphics procedures include many important, useful and cool features in this release, some that have been requested by you for a while.  In the next few articles, I will cover some of these features.  Last time I covered […]

The post Unicode in Formatted Data - SAS 9.40M3 appeared first on Graphically Speaking.

Read More
29
Jul

50 million illegal aliens apprehended in the US

There's been quite a bit of controversy about the number of undocumented immigrants in the US lately - for example, Ann Coulter claims that number is 30 million, whereas others claim it's about 11 million (readers of my blog are data-savvy, and would dig into the details of such claims, […]

The post 50 million illegal aliens apprehended in the US appeared first on The SAS Training Post.

Read More
29
Jul

Deploy edx spark environment to DigitalOcean

This summer I took the Spark courses at edx CS100 and CS190, and had wonderful experience.
The two classes apply a Vagrant virtual machine containing Spark and all teaching materials. There are two challenges with the virtual machine —
  1. The labs usually take long time to finish, say 8-10 hours. If the host machine is closed, the RDDs will be lost and the pipeline has to be run again.
  2. Some RDD operations take a lot computation/communication powers, such as groupByKey and distinct. Many of my 50k classmates complained about the waiting time. And my most used laptop is a Chromebook and doesn’t even have options to install Virtual Box.
To deploy the learning environment to a cloud may be an alternative. DigitalOcean is a good choice because it uses mirrors for most packages, and the network speed is amazingly fast that is almost 100MB/s (thanks to the SSD infrastructure DigitalOcean implements for the cloud, otherwise the hard disk may not stand this rapid IO; see my deployment records GitHub).

I found that a Linux box with 1 GB memory and 1 CPU at DigitalOcean that costs 10 dollars a month will handle most labs fairly easy with IPython and Spark. A 2 GB memory and 2 CPU droplet will be ideal since it is the minimal requirement for a simulated cluster. It costs 20 dollars a month, but is still much cheaper than the cost to earn the big data certificate that is $100 (50 for each). I just need to write Python scripts to install IPython notebook with SSL, and download Spark and the course materials.
  • The DevOps tool is Fabric and the fabfile is at GitHub.
  • The deployment pipeline is also at GitHub
Read More
29
Jul

SAS Enterprise Guide now updates itself

I returned to work from a 2+ week vacation this morning. When I fired up SAS Enterprise Guide (as I do each work day and occasionally on weekends), I was greeted with this message: As a SAS insider, I knew this was coming. It's a new feature that was added […]

The post SAS Enterprise Guide now updates itself appeared first on The SAS Dummy.

Read More
Back to Top