-
-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[feature] Added intergration with openwisp-monitoring #488
Conversation
b07090d
to
d6cd4ad
Compare
2387c29
to
0794151
Compare
d5667db
to
92bb4e4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tests are failing on Django 3.2.0 here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is still failing, I restarted the build to find out whether it was a temporary failure because the error looked weird (had to do with a non-existent time).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The build passed, let's keep testing this a bit more before merging.
e02a821
to
04db169
Compare
04db169
to
bfaabe7
Compare
bfaabe7
to
798c300
Compare
ca60da8
to
3a4da6d
Compare
3a4da6d
to
e94c22b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Problem found during testing: the count of total users should be taken from the SQL DB because influxdb doesn't know. The goal of the total users chart is to show the total amount of users registered in the system at any given time.
e94c22b
to
dd2a83b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's add more space between date filter and the pie charts.
Can we change the default colors of the RADIUS charts to differentiate between traffic charts?
They look too similar to the traffic charts and do not stand out.
Eg: (total: #3A3335, #FDF0D5, #C6D8D3, #F0544F, #D81E5B, #048A81, #343E8A)
I noticed that if the called-station-id of the session does not match exactly the mac address of a device, the total session and session traffic charts do not count these sessions, why? I think this is wrong, it can lead to loss of important data (eg: if conversion of called-station-id doesn't work temporarily or when traffic is performed by NASes which are not registered in OW).
Let's also move the radius sessions tab in the device page to a similar position as the WiFi sessions.
We found these shortcomings during the review
|
dd2a83b
to
c0cfe23
Compare
Summary of Changes: The current implementation faced a challenge wherein the charts for "User Registration" and "Total User Registration" started from zero, displaying the number of users registered since the feature deployment rather than the total count of users registered on the system. To address this, the proposed solution was to write the total number of registered users to InfluxDB once a day, eliminating the need for a data point write on every user registration event. Solution Exploration: In the latest iteration, I delved into the feasibility of the proposed solution. Initial investigations revealed a necessity to write metric points every hour instead of once a day, as the chart resolution for "last 7 days" draws a point for each hour. Implementation Details: To accommodate this, the following logic was implemented in Python: def write_user_registration_metrics():
now = timezone.now()
metric_data = []
# Get the registration data for the past hour.
# The query returns a tuple of organization_id, registration_method and
# count of users who registered with that organization and method.
registered_users = (
RegisteredUser.objects.filter(
user__openwisp_users_organizationuser__created__lte=now,
user__openwisp_users_organizationuser__created__gt=now
- timezone.timedelta(hours=1),
)
.values_list('user__openwisp_users_organizationuser__organization_id', 'method')
.annotate(count=Count('user', distinct=True))
)
for org_id, registration_method, count in registered_users:
registration_method = clean_registration_method(registration_method)
metric, _ = Metric._get_or_create(
configuration='user_signups_new',
name='User SignUps',
key='user_signups_new',
object_id=None,
content_type=None,
extra_tags={
'organization_id': str(org_id),
'method': registration_method,
},
)
metric_data.append((metric, {'value': count, 'time': now}))
Metric.batch_write(metric_data) Caveats Identified: However, two caveats were encountered:
Decision and Refinement: After discussion with @nemesifier, it was concluded that the current implementation, which allows a DISTINCT operation on user IDs, resolves the first caveat effectively. To address the second caveat, it was decided to update the logic to write a metric point for all users created. For users without a RegisteredUser object, the metric point will be assigned an "unspecified" method. Resolution: This adjustment resolves the mismatch between the total user count displayed on charts and the user's changelist page. Furthermore, to rectify the initial problem, a celery task will be added to retroactively write metric points for old data from the database. This celery task would get triggered on the post_migration signal, thus aligning the implementation with the desired outcome. |
c009ba0
to
a0ac82c
Compare
I introduced a task to retroactively write RADIUS metrics from the database to InfluxDB. This task aimed to populate InfluxDB with historical data, ensuring comprehensive metric tracking. Task Implementation: The task, outlined below, iterates through user data to extract relevant metrics: @shared_task
def post_migrate_retroactive_write():
"""
This task retroactively writes the RADIUS metrics
to InfluxDB using the data present in the database.
"""
metric_data = []
for user in (
User.objects.select_related('registered_user')
.only('id', 'registered_user__method')
.iterator()
):
try:
registration_method = user.registered_user.method
except RegisteredUser.DoesNotExist:
registration_method = 'unspecified'
registration_method = clean_registration_method(registration_method)
org_query = list(
user.openwisp_users_organizationuser.values_list(
'organization_id',
'created',
)
)
if not org_query:
# This user is not a member of any organization.
# Use "None" for organization_id for this user.
org_query = [(None, user.date_joined)]
for (org_id, time) in org_query:
metric = _get_user_signup_metric(
organization_id=str(org_id), registration_method=registration_method
)
metric_data.append(
# (metric, {'value': sha1_hash(str(user.id)), 'time': time})
(metric, {'value': user.username, 'time': time})
)
Metric.batch_write(metric_data) Encountered Caveats: During testing with real-world data, two significant challenges were identified: Retention Policy Limitations: InfluxDB's retention policy imposes constraints on storing data beyond defined limits. Consequently, storing metrics from the installation's inception becomes infeasible. Moreover, InfluxDB's continuous deletion of older data introduces inconsistency in total user values over time. Chart Display Limitations: The chart displays totals based on the selected time range exclusively. For instance, if there are a total of 60 users – 59 registered within the last year and one registered within the last two years – the chart would only depict 59 users when the time range is set to "last 365 days." Resolution Considerations: In response to the encountered caveats, a proposal was made to implement a separate retention period for this data. While this would address the first caveat, the second remains a concern. @nemesifier suggested reverting to the previous implementation, where the number of users registered with each method for each organization is recorded in InfluxDB every hour. To resolve the caveats with this approach:
The total count of registered users will be recorded in InfluxDB with a special value (e.g., We would write the number of registered users for each organization separately. This values would be used when the chart is filtered by an organization or when a non-superuser is viewing the chart.
If a user does not have a related RegisteredUser object, then use "unspecified" method for that user. User the special value as organization_id for such users. |
d36e1e7
to
d512e90
Compare
- Removed obsolete code
- Rename "User Registration" to "User Registrations" - Rename "Unique RADIUS Session Count" to "Unique RADIUS Sessions" - Rename "Total RADIUS Sessions Traffic" to "Traffic of RADIUS Sessions"
1bdf7b6
to
edeaa79
Compare
8a8cb1c
to
4a61e80
Compare
4a61e80
to
36e7b01
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ready! 🏁 🔥 🚀
Blockers
Closes openwisp/openwisp-monitoring#532
Closes openwisp/openwisp-monitoring#534