ETNet is a bounding box detection model based on a CNN feature extractor, a Transformer Encoder, and a prediction head. Given an image, the attention layers built in Transformer can capture long-range spatial relationships between extreme keypoints generated from COCO dataset. The keypoints then grouped using a center grouping method to generate bounding boxes for objects.
-
Notifications
You must be signed in to change notification settings - Fork 1
nasim-ahmed/ETNet
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published