Charles De Vera

Hi! I'm Chad and I'm currently a fourth year undergraduate computer science student at University of California at Irvine. I'm into coding, web dev and web design, machine learning, and linguistics.

My coding skills include python (flask, spaCy, ntlk, Pytorch, sklearn, pandas, numpy), HTML/CSS, javascript(react, react-native, node.js), Java, C, and C++. I also know some database management (SQL, Firebase), Git and APIs.

My goal right now is to get a software engineering or data science job/internship and work on my grad school applications.

When I get frustrated with work, I like to play and watch basketball, watch anime, playing video games like Super Smash Bros and League of Legends, listening to any music, and drink water.

my linkedin, my github

last updated on 5/10/2020

TandomTV Mobile App

react-native (javascript), google firebase, Twitch.tv api

I helped create a simple mobile application for TandomTV with a team of 6 peoeple. This company aims to solve the pain points of watching videos on platforms such as Twitch on your AdnroidTV or smart TV by enabling users to do tasks such as logging in, clipping streams, and sending bits on your mobile device. This was done as part of a project course at UCI.
I helped develop the app itself and the backend, by integrating Google firebase to "connect" the TV and the phone. This is done by using the realtime database, allowing the phone and the TV to interact with each other in real time. Moreover, I integrated the Twitch.tv API to enable users to log in and do actions on te app on their behalf.

no github repo - project done for a private company, check out the app when it releases publicly

A Sudoku Solver

c++, vim, command line

This is a simple sudoku solver done in the command line; all coding was done by me and the puzzle is solved using backtracking search with forward checking for slightly faster results.

github

A Basic Search Engine

python, sqlite, beautifulsoup, spacetime-crawler, tkinter

This is a simple search engine that I worked on with a partner that searches webpages in UCI's ics domain. We used tdf-idf along with cosine similarity to rank the webpages according to the search query. Pages were stored and retrieved in an sqlite database.

no github repo - school project

Computation of Language Lab (CoLaLab)

python (spaCy, ntlk, pandas, pytorch, sklearn, numpy), google colab, html, css, javascript, jQuery

I am currently working with Dr. Lisa Pearl with the help of Dr. Richard Futrell on a project where we try to detect if a document is an imitation. For example, suppose we have a document X written by author A. If we are given a document Y which some author B wrote that imitates the style of author A, we try to detect that document Y is an imitation and not written by author A.
We use a variety of NLP techniques to accomplish this task and the quality of each method is captured using precision, recall, and accuracy. We are also using a dataset provided by Dr. Pearl's former student although we are in the process of getting more data from Amazon MTurk. Thus, the current results are based on very little data. Without going into too much detail, here are some of the approaches we have used (click the colored text for code):

SVM With Sentential Embeddings: Here, we use spaCy to access a word's sentential embedding. For the features, we combined the mean embedding vector of the original document, the mean embedding vector of the target document, and the the difference between these vectors. This creates a 900-dimension feature vector which we use an SVM to classify "real" documents and "fake" documents.
SpaCy's textcat Pipeline With BERT: Now, we use spaCy's own pipeline with our data. It is modified with the use of the BERT language model. The info can be found here on spaCy's website.
PyTorch LSTM With Sentential Embeddings: This was one of the more interesting approches as we used the same features as in the first bullet point, but we did not include the target document vector. Here, we abuse the LSTM as it can tell the difference between each vector. I learned to implement this in PyTorch and it gave the most promising results. However, we need to wait on more data to be completely sure.

paper in progress

Learning and Decision Neuroscience Lab

matlab, python, r

Still learning the ropes here but currently working on verifying MATLAB code written by a graduate student who already graduated. This involves two models, one based on reinforcement learning and one based on probabilistic graphical models, that model behavior on different schedules of reinforcement. I am focusing on verifying these models and potentially rebuilding them if needed.

Under construction!

charles de vera