; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013790 (gene) of Snake gourd v1 genome

Gene IDTan0013790
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:43887859..43889183
RNA-Seq ExpressionTan0013790
SyntenyTan0013790
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-11855.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSWQQL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L+    +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H +LCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.0e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

A0A5A7TU93 Gag/pol protein2.6e-11855.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSWQQL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L+    +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H +LCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

A0A5A7TWB9 Gag/pol protein1.0e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

A0A5A7V4M1 Gag/pol protein1.0e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

A0A5D3CPJ6 Gag/pol protein1.0e-11755.36Show/hide
Query:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---
        MNGA ID++SQVSFILE+L +SFLQF SN VMNKI YTL TLLNELQ F+SLM+I+  + EANVA   R +HRGSTSGTK + SS    K KK K G   
Subjt:  MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVA--YRSYHRGSTSGTKFVASSHPKGKNKKMKKG---

Query:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG
        KA+  AA+  KK K  A KG CF CN+  HWKRN PK+L E+K   Q K DLLV ETCLVE++DSA I+DSG TNHVC SFQGISSW+QL  G +T++VG
Subjt:  KADRVAAQKGKKVKELAEKGKCFICNRNDHWKRNYPKFLVERK--NQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVG

Query:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------
        +G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ                                                      
Subjt:  SGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSISFLIEQC-----------------------------------------------------

Query:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR
                          HINLNRI+RLVK+GLLS+LEENSLPVCE CLEGKMTKR F+ K +R KE LEL+H DLCGPM+VKARGG+EYF++F DDYSR
Subjt:  ------------------HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        YGY+YLM  KSE LEKFKEYK +VEN L KTIKT RSD+GGEYMD KF
Subjt:  YGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-0929.23Show/hide
Query:  INLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFS--RKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKF
        + + R +      LL+ L E S  +CE CL GK  +  F   + +  +K  L ++H D+CGP++        YFV F+D ++ Y   YL+  KS+    F
Subjt:  INLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFS--RKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKF

Query:  KEYKTKVENLLGKTIKTLRSDQGGEYMDTK
        +++  K E      +  L  D G EY+  +
Subjt:  KEYKTKVENLLGKTIKTLRSDQGGEYMDTK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1935.38Show/hide
Query:  HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFK
        H++   +  L K  L+S  +  ++  C+ CL GK  +  F     R   +L+L++ D+CGPM +++ GG +YFV+FIDD SR  ++Y++  K +  + F+
Subjt:  HINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFK

Query:  EYKTKVENLLGKTIKTLRSDQGGEYMDTKF
        ++   VE   G+ +K LRSD GGEY   +F
Subjt:  EYKTKVENLLGKTIKTLRSDQGGEYMDTKF

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.1e-0726.81Show/hide
Query:  HINLNRIDRLVKSGLLSQLEENSLP-------VCELCLEGKMTKRLF---SRKRYRVK-ELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLM
        H N   I + +K   ++ L+E+ +         C  CL GK TK      SR +Y+   E  + +H D+ GP+    +    YF+SF D+ +R+ ++Y +
Subjt:  HINLNRIDRLVKSGLLSQLEENSLP-------VCELCLEGKMTKRLF---SRKRYRVK-ELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLM

Query:  H--RKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEY
        H  R+   L  F      ++N     +  ++ D+G EY
Subjt:  H--RKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1023.42Show/hide
Query:  ILDSGTTNHVCFSFQGISSWQQLREGGVTLQVGSGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSI----------------SFLIEQCHINL
        +LDSG T+H+   F  +S  Q    GG  + V  G  +  +  G   L    + L L N+  VP   +NL+S+                SF ++  +  +
Subjt:  ILDSGTTNHVCFSFQGISSWQQLREGGVTLQVGSGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSI----------------SFLIEQCHINL

Query:  NRIDRLVKSGLLSQLEENSLPV-----------------------------------------------CELCLEGKMTKRLFSRKRYRVKELLELIHFD
          +    K  L      +S PV                                               C  CL  K  K  FS+        LE I+ D
Subjt:  NRIDRLVKSGLLSQLEENSLPV-----------------------------------------------CELCLEGKMTKRLFSRKRYRVKELLELIHFD

Query:  LCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYM
        +     + +   Y Y+V F+D ++RY ++Y + +KS+  E F  +K  +EN     I T  SD GGE++
Subjt:  LCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-0720.96Show/hide
Query:  GASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQS-LMRIRTPEAEANVAYRSYHRGSTSGTKFVASSHPKGKNKKMKKGKADRVA
        G  +D   QV  +LE L   +      +       +L  +   L N +S L+ + + E     A    HR + +          +  N    +  + + +
Subjt:  GASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQS-LMRIRTPEAEANVAYRSYHRGSTSGTKFVASSHPKGKNKKMKKGKADRVA

Query:  AQKGKKVKELAEK--GKCFICNRNDHWKRNYPKF--LVERKNQDKCDLLVT-----ETCLVES--NDSACILDSGTTNHVCFSFQGISSWQQLREGGVTL
        +   +      +   G+C IC+   H  +  P+        NQ +     T         V S  N +  +LDSG T+H+   F  + S+ Q   GG  +
Subjt:  AQKGKKVKELAEK--GKCFICNRNDHWKRNYPKF--LVERKNQDKCDLLVT-----ETCLVES--NDSACILDSGTTNHVCFSFQGISSWQQLREGGVTL

Query:  QVGSGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSI----------------SFLIEQCHINLNRIDRLVKSGLL------------------
         +  G  +     G   L    + L L+ +  VP   +NL+S+                SF ++  +  +  +    K  L                   
Subjt:  QVGSGEIVSAAAIGKVKLFFGGKYLLLDNLYIVPGFTRNLVSI----------------SFLIEQCHINLNRIDRLVKSGLL------------------

Query:  --------------------SQLEENSLPV---------CELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIY
                            S +  +SLPV         C  C   K  K  FS       + LE I+ D+     + +   Y Y+V F+D ++RY ++Y
Subjt:  --------------------SQLEENSLPV---------CELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIY

Query:  LMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYM
         + +KS+  + F  +K+ VEN     I TL SD GGE++
Subjt:  LMHRKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGAGCTTCGATCGATGATTCGAGCCAGGTCAGCTTCATCTTGGAGACTCTTTCAAAGAGTTTTCTTCAGTTTAGTAGCAATGTTGTTATGAACAAAATTATCTA
TACACTAGCCACCCTTCTGAACGAGCTACAAAATTTCCAGTCCTTGATGAGGATCAGGACACCAGAAGCTGAGGCAAATGTTGCCTACAGGTCCTATCACAGGGGTTCGA
CCTCTGGGACAAAATTTGTGGCTTCTTCTCACCCGAAAGGGAAGAATAAGAAAATGAAGAAGGGTAAAGCTGACCGAGTTGCCGCCCAAAAGGGCAAAAAGGTCAAGGAA
CTTGCAGAAAAAGGAAAGTGTTTCATCTGCAATCGGAACGACCACTGGAAGCGAAACTATCCCAAGTTCCTTGTCGAGAGGAAGAATCAAGATAAATGTGATTTACTAGT
AACTGAAACTTGTTTAGTGGAGAGTAATGATTCTGCCTGTATATTGGATTCGGGCACCACTAACCACGTTTGTTTTTCTTTTCAGGGAATTAGTTCTTGGCAGCAGCTGC
GAGAGGGTGGGGTGACTCTACAGGTTGGATCTGGGGAGATAGTCTCTGCTGCAGCGATCGGCAAAGTGAAGCTCTTTTTCGGCGGGAAATACTTATTATTAGATAATTTG
TATATAGTCCCAGGGTTTACTAGAAACCTTGTTTCTATTTCCTTCCTTATTGAACAATGCCACATTAATCTCAATAGGATTGACAGACTAGTGAAGAGTGGACTTCTAAG
CCAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTTATGCCTCGAAGGCAAAATGACCAAACGTCTTTTTAGTAGAAAAAGATATAGAGTCAAAGAGCTCCTTGAGCTTA
TACATTTTGACCTCTGTGGTCCGATGAGTGTTAAAGCACGAGGTGGTTACGAATACTTTGTATCTTTTATCGATGACTATTCAAGGTATGGGTATATTTACCTAATGCAT
AGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTAAGGTTGAGAACCTGTTAGGTAAAACGATTAAAACACTTCGATCGGATCAAGGTGGAGAGTATATGGA
CACTAAATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGGAGCTTCGATCGATGATTCGAGCCAGGTCAGCTTCATCTTGGAGACTCTTTCAAAGAGTTTTCTTCAGTTTAGTAGCAATGTTGTTATGAACAAAATTATCTA
TACACTAGCCACCCTTCTGAACGAGCTACAAAATTTCCAGTCCTTGATGAGGATCAGGACACCAGAAGCTGAGGCAAATGTTGCCTACAGGTCCTATCACAGGGGTTCGA
CCTCTGGGACAAAATTTGTGGCTTCTTCTCACCCGAAAGGGAAGAATAAGAAAATGAAGAAGGGTAAAGCTGACCGAGTTGCCGCCCAAAAGGGCAAAAAGGTCAAGGAA
CTTGCAGAAAAAGGAAAGTGTTTCATCTGCAATCGGAACGACCACTGGAAGCGAAACTATCCCAAGTTCCTTGTCGAGAGGAAGAATCAAGATAAATGTGATTTACTAGT
AACTGAAACTTGTTTAGTGGAGAGTAATGATTCTGCCTGTATATTGGATTCGGGCACCACTAACCACGTTTGTTTTTCTTTTCAGGGAATTAGTTCTTGGCAGCAGCTGC
GAGAGGGTGGGGTGACTCTACAGGTTGGATCTGGGGAGATAGTCTCTGCTGCAGCGATCGGCAAAGTGAAGCTCTTTTTCGGCGGGAAATACTTATTATTAGATAATTTG
TATATAGTCCCAGGGTTTACTAGAAACCTTGTTTCTATTTCCTTCCTTATTGAACAATGCCACATTAATCTCAATAGGATTGACAGACTAGTGAAGAGTGGACTTCTAAG
CCAGTTGGAAGAAAACTCTTTACCGGTATGTGAGTTATGCCTCGAAGGCAAAATGACCAAACGTCTTTTTAGTAGAAAAAGATATAGAGTCAAAGAGCTCCTTGAGCTTA
TACATTTTGACCTCTGTGGTCCGATGAGTGTTAAAGCACGAGGTGGTTACGAATACTTTGTATCTTTTATCGATGACTATTCAAGGTATGGGTATATTTACCTAATGCAT
AGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTAAGGTTGAGAACCTGTTAGGTAAAACGATTAAAACACTTCGATCGGATCAAGGTGGAGAGTATATGGA
CACTAAATTCTAG
Protein sequenceShow/hide protein sequence
MNGASIDDSSQVSFILETLSKSFLQFSSNVVMNKIIYTLATLLNELQNFQSLMRIRTPEAEANVAYRSYHRGSTSGTKFVASSHPKGKNKKMKKGKADRVAAQKGKKVKE
LAEKGKCFICNRNDHWKRNYPKFLVERKNQDKCDLLVTETCLVESNDSACILDSGTTNHVCFSFQGISSWQQLREGGVTLQVGSGEIVSAAAIGKVKLFFGGKYLLLDNL
YIVPGFTRNLVSISFLIEQCHINLNRIDRLVKSGLLSQLEENSLPVCELCLEGKMTKRLFSRKRYRVKELLELIHFDLCGPMSVKARGGYEYFVSFIDDYSRYGYIYLMH
RKSETLEKFKEYKTKVENLLGKTIKTLRSDQGGEYMDTKF