Skip to content

Commit

Permalink
test: add Github workflow to build the branch 'cc'
Browse files Browse the repository at this point in the history
  • Loading branch information
sebastian-nagel committed Oct 29, 2024
1 parent 9a5a66f commit a16d3cf
Show file tree
Hide file tree
Showing 2 changed files with 59 additions and 3 deletions.
56 changes: 56 additions & 0 deletions .github/workflows/cc-build.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

name: cc ci

on:
push:
branches: [cc]
pull_request:
types: [opened, synchronize, reopened]
branches: [cc]
jobs:
test:
strategy:
matrix:
java: ['11']
os: [ubuntu-latest]
runs-on: ${{ matrix.os }}
steps:
- uses: actions/checkout@v4
- name: Set up JDK ${{ matrix.java }}
uses: actions/setup-java@v4
with:
java-version: ${{ matrix.java }}
distribution: 'temurin'
- name: Install CLD2
run: |
sudo apt-get update
sudo apt-get install libcld2-0 libcld2-dev
- name: Install language-detection-cld2
run: |
git clone https://github.com/commoncrawl/language-detection-cld2.git
cd language-detection-cld2/
mvn install
- name: Install crawler-commons development version
run: |
git clone https://github.com/crawler-commons/crawler-commons.git
cd crawler-commons/
mvn install
- name: Install recent public suffix list
run: |
curl https://publicsuffix.org/list/public_suffix_list.dat -o conf/effective_tld_names.dat
- name: Test
run: ant clean test -buildfile build.xml
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,17 +12,17 @@ Notable additions in Common Crawl's fork of Nutch (not yet pushed to upstream Nu
How to install additional requirements to build this fork of Nutch:
- [crawler-commons](/crawler-commons/crawler-commons) development snapshot package:
```
git clone git@github.com:crawler-commons/crawler-commons.git
git clone https://github.com/crawler-commons/crawler-commons.git
cd crawler-commons/
mvn install
```
- install the latest public suffix list into `conf/` to ensure that it is definitely used (see #17):
- install the latest public suffix list into `conf/` to ensure that it is definitely used (see [#17](https://github.com/commoncrawl/nutch/issues/17)):
```
wget https://publicsuffix.org/list/public_suffix_list.dat -O conf/effective_tld_names.dat
```
- [Java wrapper for CLD2 language detection](/commoncrawl/language-detection-cld2)
```
git clone git@github.com:commoncrawl/language-detection-cld2.git
git clone https://github.com/commoncrawl/language-detection-cld2.git
cd language-detection-cld2/
mvn install
```
Expand Down

0 comments on commit a16d3cf

Please sign in to comment.