Russian QnA 333K
nyuuzyou

folder main (2 files)
filedata.parquet 225.87MB
fileREADME.md 1.73kB
Type: Dataset
Tags:

Bibtex:
@article{,
title= {Russian QnA 333K},
journal= {},
author= {nyuuzyou},
year= {},
url= {https://huggingface.co/datasets/nyuuzyou/ru-QnA-333K},
abstract= {# Dataset Card for Russian QnA

### Dataset Summary
This dataset contains a collection of questions and answers in Russian. The dataset includes questions across various categories with corresponding answers, ratings, and metadata.

### Languages
The dataset content is primarily in Russian:
- Russian (ru)

## Dataset Structure

### Data Files
- Single file containing all Q&A records: `data.parquet`

### Data Fields
Each record contains the following fields:
- `question_id`: Unique identifier for the question.
- `question_title`: Title/subject of the question.
- `question_description`: Extended description or body of the question.
- `question_images`: Array of image URLs associated with the question.
- `category`: Category/topic area of the question (e.g., "здоровье и медицина").
- `tags`: Array of tags associated with the question.
- `question_rating`: Rating/score of the question.
- `answers`: Array of answer objects, each containing:
  - `answer_text`: Text content of the answer
  - `answer_images`: Array of image URLs in the answer
  - `answer_rating`: Rating/score of the answer

### Data Splits
The dataset contains a single split with all Q&A records:

| Split   | Description                      | Number of Examples |
| :------ | :------------------------------- | -----------------: |
| `train` | All question-answer pairs        | 333,029            |
},
keywords= {},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000004
DB Connect
   0.000416
Lookup hash in DB
   0.000491
Get torrent details
   0.000171
Get torrent details, finished
   0.000253
Get authors
   0.000026
Parse bibtex
   0.000122
Write header
   0.000284
get stars
   0.000140
home tab
   0.000296
render right panel
   0.000008
render ads
   0.000403
fetch current hosters
   0.000295
related datasets
   0.014053
Done