Russian QnA 333K
nyuuzyou

folder main (2 files)
filedata.parquet 225.87MB
fileREADME.md 1.73kB
Type: Dataset
Tags:

Bibtex:
@article{,
title= {Russian QnA 333K},
journal= {},
author= {nyuuzyou},
year= {},
url= {https://huggingface.co/datasets/nyuuzyou/ru-QnA-333K},
abstract= {# Dataset Card for Russian QnA

### Dataset Summary
This dataset contains a collection of questions and answers in Russian. The dataset includes questions across various categories with corresponding answers, ratings, and metadata.

### Languages
The dataset content is primarily in Russian:
- Russian (ru)

## Dataset Structure

### Data Files
- Single file containing all Q&A records: `data.parquet`

### Data Fields
Each record contains the following fields:
- `question_id`: Unique identifier for the question.
- `question_title`: Title/subject of the question.
- `question_description`: Extended description or body of the question.
- `question_images`: Array of image URLs associated with the question.
- `category`: Category/topic area of the question (e.g., "здоровье и медицина").
- `tags`: Array of tags associated with the question.
- `question_rating`: Rating/score of the question.
- `answers`: Array of answer objects, each containing:
  - `answer_text`: Text content of the answer
  - `answer_images`: Array of image URLs in the answer
  - `answer_rating`: Rating/score of the answer

### Data Splits
The dataset contains a single split with all Q&A records:

| Split   | Description                      | Number of Examples |
| :------ | :------------------------------- | -----------------: |
| `train` | All question-answer pairs        | 333,029            |
},
keywords= {},
terms= {},
license= {},
superseded= {}
}

No stats to report yet.

Send Feedback Start
   0.000008
DB Connect
   0.000434
Lookup hash in DB
   0.000410
Get torrent details
   0.000123
Get torrent details, finished
   0.000203
Get authors
   0.000026
Parse bibtex
   0.000127
Write header
   0.000293
get stars
   0.000099
home tab
   0.000227
render right panel
   0.000007
render ads
   0.000351
fetch current hosters
   0.000230
Start get stats
   0.000380
End get stats
   0.000001
related datasets
   0.009300
Done