; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028415 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028415
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr8:21111017..21117316
RNA-Seq ExpressionLag0028415
SyntenyLag0028415
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054839.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-12156.76Show/hide
Query:  QHQDTTKKPLLVFSLRIKRLKFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSA
        Q  +  +   ++ SL    + F++NA +NKI FNLT LLNELQ +Q+L K KG+ E EANV  +KRKF++GSSS +K   + S +  +K G KGK     
Subjt:  QHQDTTKKPLLVFSLRIKRLKFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSA

Query:  AKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQENNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDR--------
           + K K  A++GKC+HC  +GH  RNCPKYLA KK EK+         F E +S++ L++GE+TL+VGTGE++SAKAVG  KL     DR        
Subjt:  AKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQENNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDR--------

Query:  IGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAEC
        IGRLVK+GLLS+LED+SLP C+SCLEGKMTKR F GKG RAK PL L+HSDLCGPMNVKA GGYEYFI+FIDDYSRYG++YL+  K ++ EKFKE+KAE 
Subjt:  IGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAEC

Query:  SAPSFAAAQ--NLLPQVAQQQILHKMIS--KASIL--------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDA
           S  A    N +P     +  +++    KA I+        P   + AMND D D+W+K+M+LEMESM+SNSVW L + PN VKPIGCKWIYKRKRD 
Subjt:  SAPSFAAAQ--NLLPQVAQQQILHKMIS--KASIL--------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDA

Query:  AGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        AGK QTFKARLVA GYTQKEG+D EETFSPVAM+KSIRILLSIAT+YDYEI
Subjt:  AGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-11748.49Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------CSAP--------SFAAAQNLLPQVAQQQIL-
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E                               C A         +   A ++L  V  + +  
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------CSAP--------SFAAAQNLLPQVAQQQIL-

Query:  ------------------------------------------------------------------------------HKMISKASIL------------
                                                                                      HK  SK  +             
Subjt:  ------------------------------------------------------------------------------HKMISKASIL------------

Query:  -------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPI
                                                               P   K AMND D D+WVKAM+LEMESM+ NSVWEL DL  GVKPI
Subjt:  -------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPI

Query:  GCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        GCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFS VAMLKSIRILLSIAT+YDYEI
Subjt:  GCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

KAA0061169.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-11648.16Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+ KEGA+NH+ SS +E
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-----------CSAP------------------------SFA------------AAQNLLPQ
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E            SAP                        S+A             A ++L  
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-----------CSAP------------------------SFA------------AAQNLLPQ

Query:  VAQQQIL-------------------------------------------------------------------------------HKMISKASIL----
        V  + +                                                                                HK  SK  +     
Subjt:  VAQQQIL-------------------------------------------------------------------------------HKMISKASIL----

Query:  ---------------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELAD
                                                                       P   K AMND D D+WVKAM+LEMESM+ NSVWEL D
Subjt:  ---------------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELAD

Query:  LPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        LP GVKPIGCKWIYKRKRD+A KVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  LPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

TYK04171.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-11848.85Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAP---------------------
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E                                      SAP                     
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAP---------------------

Query:  ---SFA------------AAQNLLPQVAQQQIL-------------------------------------------------------------------
           S+A             A ++L  V  + +                                                                    
Subjt:  ---SFA------------AAQNLLPQVAQQQIL-------------------------------------------------------------------

Query:  ------------HKMISKASI-----------------------LPNGLKH--------------AMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNG
                    HK+  K  +                         +G  H              AMND D D+WVKAM+LEMESM+ NSVWEL DLP G
Subjt:  ------------HKMISKASI-----------------------LPNGLKH--------------AMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNG

Query:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        VKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

TYK28885.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-11748.23Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HS R+F   SS         S+K QK+   KGK    A + KGKTKVV  KGKCFHC+VD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+++IGRL+KNGLL+KLEDDSLP CESC EGKMTKRPFT KGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAPSFAAAQNLLPQVAQQQILHKM
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+KAE                                      SAP     QN + +   + +L K+
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAPSFAAAQNLLPQVAQQQILHKM

Query:  ISKASIL---------------------------------------------------------------------------------------------
         S  S                                                                                               
Subjt:  ISKASIL---------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKP
                                                                P   K AMND D D+WVKA++LEMESM+ NSVWEL DLP GVKP
Subjt:  --------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKP

Query:  IGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        IGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  IGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

TrEMBL top hitse value%identityAlignment
A0A5A7USZ2 Gag/pol protein2.7e-11748.49Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------CSAP--------SFAAAQNLLPQVAQQQIL-
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E                               C A         +   A ++L  V  + +  
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------CSAP--------SFAAAQNLLPQVAQQQIL-

Query:  ------------------------------------------------------------------------------HKMISKASIL------------
                                                                                      HK  SK  +             
Subjt:  ------------------------------------------------------------------------------HKMISKASIL------------

Query:  -------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPI
                                                               P   K AMND D D+WVKAM+LEMESM+ NSVWEL DL  GVKPI
Subjt:  -------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPI

Query:  GCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        GCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFS VAMLKSIRILLSIAT+YDYEI
Subjt:  GCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

A0A5A7V634 Gag/pol protein6.0e-11748.16Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+ KEGA+NH+ SS +E
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-----------CSAP------------------------SFA------------AAQNLLPQ
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E            SAP                        S+A             A ++L  
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-----------CSAP------------------------SFA------------AAQNLLPQ

Query:  VAQQQIL-------------------------------------------------------------------------------HKMISKASIL----
        V  + +                                                                                HK  SK  +     
Subjt:  VAQQQIL-------------------------------------------------------------------------------HKMISKASIL----

Query:  ---------------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELAD
                                                                       P   K AMND D D+WVKAM+LEMESM+ NSVWEL D
Subjt:  ---------------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELAD

Query:  LPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        LP GVKPIGCKWIYKRKRD+A KVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  LPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

A0A5D3BWT8 Gag/pol protein8.4e-11948.85Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HSKR+F    S         S+K QK+   KGK    A + KGK K VA K KCFHCNVD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+DRIGRLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAP---------------------
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+K E                                      SAP                     
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAP---------------------

Query:  ---SFA------------AAQNLLPQVAQQQIL-------------------------------------------------------------------
           S+A             A ++L  V  + +                                                                    
Subjt:  ---SFA------------AAQNLLPQVAQQQIL-------------------------------------------------------------------

Query:  ------------HKMISKASI-----------------------LPNGLKH--------------AMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNG
                    HK+  K  +                         +G  H              AMND D D+WVKAM+LEMESM+ NSVWEL DLP G
Subjt:  ------------HKMISKASI-----------------------LPNGLKH--------------AMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNG

Query:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        VKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

A0A5D3DWF1 Gag/pol protein2.4e-12156.76Show/hide
Query:  QHQDTTKKPLLVFSLRIKRLKFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSA
        Q  +  +   ++ SL    + F++NA +NKI FNLT LLNELQ +Q+L K KG+ E EANV  +KRKF++GSSS +K   + S +  +K G KGK     
Subjt:  QHQDTTKKPLLVFSLRIKRLKFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSA

Query:  AKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQENNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDR--------
           + K K  A++GKC+HC  +GH  RNCPKYLA KK EK+         F E +S++ L++GE+TL+VGTGE++SAKAVG  KL     DR        
Subjt:  AKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQENNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDR--------

Query:  IGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAEC
        IGRLVK+GLLS+LED+SLP C+SCLEGKMTKR F GKG RAK PL L+HSDLCGPMNVKA GGYEYFI+FIDDYSRYG++YL+  K ++ EKFKE+KAE 
Subjt:  IGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAEC

Query:  SAPSFAAAQ--NLLPQVAQQQILHKMIS--KASIL--------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDA
           S  A    N +P     +  +++    KA I+        P   + AMND D D+W+K+M+LEMESM+SNSVW L + PN VKPIGCKWIYKRKRD 
Subjt:  SAPSFAAAQ--NLLPQVAQQQILHKMIS--KASIL--------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDA

Query:  AGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        AGK QTFKARLVA GYTQKEG+D EETFSPVAM+KSIRILLSIAT+YDYEI
Subjt:  AGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

A0A5D3DZX8 Gag/pol protein9.3e-11848.23Show/hide
Query:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE
        Q++ EANV HS R+F   SS         S+K QK+   KGK    A + KGKTKVV  KGKCFHC+VD H K NCPKYL  KK+EKEGA+NH+ SS QE
Subjt:  QMEGEANVVHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQE

Query:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
         +SF+QL D EMTL+VGTG+VISA+AVG AKLGH N+++IGRL+KNGLL+KLEDDSLP CESC EGKMTKRPFT KGYRAKEPLELIHSDLCGPMNVKAR
Subjt:  NNSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAPSFAAAQNLLPQVAQQQILHKM
        GG+EYFISFIDDYSRYGYLYLM  KSEALEKFKE+KAE                                      SAP     QN + +   + +L K+
Subjt:  GGYEYFISFIDDYSRYGYLYLMGRKSEALEKFKEFKAE-------------------------------------CSAPSFAAAQNLLPQVAQQQILHKM

Query:  ISKASIL---------------------------------------------------------------------------------------------
         S  S                                                                                               
Subjt:  ISKASIL---------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKP
                                                                P   K AMND D D+WVKA++LEMESM+ NSVWEL DLP GVKP
Subjt:  --------------------------------------------------------PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKP

Query:  IGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        IGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVD EETFSPVAMLKSIRILLSIAT+YDYEI
Subjt:  IGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-1430.6Show/hide
Query:  PQVA---QQQILHKMISKASIL----PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYT
        PQ++   +   L+K++  A  +    PN        +D   W +A+N E+ +   N+ W +   P     +  +W++  K +  G    +KARLVA+G+T
Subjt:  PQVA---QQQILHKMISKASIL----PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYT

Query:  QKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        QK  +D EETF+PVA + S R +LS+   Y+ ++
Subjt:  QKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

P04146 Copia protein1.4e-0931.06Show/hide
Query:  ISAKAVGAAKLGHTNIDRI--GRLVKNGLLSKLEDDSL--------PPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISF
        I+AK     +L H     I  G+L++    +   D SL          CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F
Subjt:  ISAKAVGAAKLGHTNIDRI--GRLVKNGLLSKLEDDSL--------PPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFISF

Query:  IDDYSRYGYLYLMGRKSEALEKFKEFKAECSA
        +D ++ Y   YL+  KS+    F++F A+  A
Subjt:  IDDYSRYGYLYLMGRKSEALEKFKEFKAECSA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2143.52Show/hide
Query:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSI
        P  LK  ++  + ++ +KAM  EMES+  N  ++L +LP G +P+ CKW++K K+D   K+  +KARLV KG+ QK+G+D +E FSPV  + SIR +LS+
Subjt:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSI

Query:  ATYYDYEI
        A   D E+
Subjt:  ATYYDYEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-1534.58Show/hide
Query:  KLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALE
        ++GH +   +  L K  L+S  +  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  +
Subjt:  KLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEALE

Query:  KFKEFKA
         F++F A
Subjt:  KFKEFKA

P92520 Uncharacterized mitochondrial protein AtMg008203.5e-1340.7Show/hide
Query:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W L   P     +GCKW++K K  + G +   KARLVAKG+ Q+EG+   ET+SPV    +IR +L++A
Subjt:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1442.16Show/hide
Query:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLS
        P     A+ DE    W  AM  E+ +   N  W+L    P+ V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+ G+D  ETFSPV    SIRI+L 
Subjt:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLS

Query:  IA
        +A
Subjt:  IA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-1543.14Show/hide
Query:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLS
        P     AM D   D W +AM  E+ +   N  W+L    P  V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+ G+D  ETFSPV    SIRI+L 
Subjt:  PNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELA-DLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLS

Query:  IA
        +A
Subjt:  IA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-2248.39Show/hide
Query:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI
        W  AM+ E+ +M +   WE+  LP   KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+EG+D  ETFSPV  L S++++L+I+  Y++ +
Subjt:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.8e-0535.29Show/hide
Query:  AKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        ++L H +   +  LVK G L   +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  AKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-1440.7Show/hide
Query:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W L   P     +GCKW++K K  + G +   KARLVAKG+ Q+EG+   ET+SPV    +IR +L++A
Subjt:  WVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDCEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGAAGATTAAATAGGAAATCTGTTGGTAGCGTCGAGACGCTAGGGTCACAGCGTCGCGACGCTTTGACCTTTTGCCACCTGAATCAAAAAAGGAAAGGCAGCGT
TGAGACACTCAAAGAGCAGCGTCCCGACGCTGATTTAGCTAGTAAAAGGGCAGAATGGTTAAGAAATCTAATTGCTGATATACCTGTATTTTCAATTGGCAACCCAGCTA
TCCCTTTACATTGTGACAGTCAAGCTACTTTAGCTAATGCTAATAATAAAATATATAATGGGAAATGCAGGCATATCAGAATTAGACATAATTCTATTAGACAATTATTG
TCTCATGGTGTGATTTCTCTTGATTTTGTCAGGTCAGAAGAGAACCTAGCGGATCCGTTTACAAAAGGCTTAGCGGGCAAGCGAGTTTCTGAATCATCGCGGGGAATGAG
ACTAAAGCCTATAAGGATCTTCCCTGCTTCTGATGATCAACACCAAGACACCACCAAAAAGCCTTTACTAGTCTTCTCCTTAAGAATAAAGAGGCTAAAGTTTCGAAGCA
ATGCAATGATGAATAAAATAACATTCAACCTGACTAGCCTCCTGAATGAGCTACAACTCTATCAGTCTCTTCTTAAGAACAAGGGACAGATGGAAGGAGAGGCAAACGTT
GTTCACTCTAAAAGAAAGTTCGAGAAGGGTTCATCCTCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAATGGAAACAAGGGGAAAGCTCTCGG
TTCTGCTGCTAAAAGCAAGGGAAAAACCAAAGTTGTGGCTGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACACTTGAAGAGAAACTGCCCAAAGTACCTTGCTG
AGAAAAAGGAGGAAAAGGAAGGAGCAAGTAATCATCTTTCCTCTTCTTTTCAGGAAAATAATTCCTTTCGGCAGTTGAACGATGGGGAAATGACACTCAGGGTCGGAACT
GGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAACTTGGTCATACAAACATCGATAGAATCGGTCGTTTGGTAAAGAATGGACTTCTAAGCAAGTTAGAAGATGA
TTCACTACCACCTTGTGAATCTTGTCTCGAGGGAAAAATGACCAAGAGACCCTTTACTGGAAAAGGTTACAGAGCCAAAGAACCCTTAGAGTTAATACATTCGGATCTAT
GTGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTATAGATGATTATTCAAGATATGGTTACTTGTACCTAATGGGACGTAAGTCTGAAGCC
CTTGAAAAGTTTAAGGAGTTTAAGGCAGAATGTTCTGCTCCCAGTTTTGCTGCAGCACAGAATTTGCTCCCTCAAGTCGCCCAACAACAAATTCTCCACAAAATGATTTC
CAAAGCTTCCATTCTTCCAAATGGGTTAAAACATGCAATGAATGACGAAGATAATGATGAATGGGTCAAAGCCATGAACCTTGAAATGGAGTCAATGCACTCCAATTCTG
TATGGGAACTCGCAGATCTACCAAATGGGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGGGATGCAGCTGGAAAGGTACAGACCTTCAAAGCTAGACTA
GTGGCAAAGGGTTATACCCAAAAGGAAGGGGTTGACTGTGAGGAAACCTTTTCTCCTGTAGCCATGTTAAAGTCCATAAGAATTCTCTTATCCATAGCCACATATTATGA
CTATGAAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGAAGATTAAATAGGAAATCTGTTGGTAGCGTCGAGACGCTAGGGTCACAGCGTCGCGACGCTTTGACCTTTTGCCACCTGAATCAAAAAAGGAAAGGCAGCGT
TGAGACACTCAAAGAGCAGCGTCCCGACGCTGATTTAGCTAGTAAAAGGGCAGAATGGTTAAGAAATCTAATTGCTGATATACCTGTATTTTCAATTGGCAACCCAGCTA
TCCCTTTACATTGTGACAGTCAAGCTACTTTAGCTAATGCTAATAATAAAATATATAATGGGAAATGCAGGCATATCAGAATTAGACATAATTCTATTAGACAATTATTG
TCTCATGGTGTGATTTCTCTTGATTTTGTCAGGTCAGAAGAGAACCTAGCGGATCCGTTTACAAAAGGCTTAGCGGGCAAGCGAGTTTCTGAATCATCGCGGGGAATGAG
ACTAAAGCCTATAAGGATCTTCCCTGCTTCTGATGATCAACACCAAGACACCACCAAAAAGCCTTTACTAGTCTTCTCCTTAAGAATAAAGAGGCTAAAGTTTCGAAGCA
ATGCAATGATGAATAAAATAACATTCAACCTGACTAGCCTCCTGAATGAGCTACAACTCTATCAGTCTCTTCTTAAGAACAAGGGACAGATGGAAGGAGAGGCAAACGTT
GTTCACTCTAAAAGAAAGTTCGAGAAGGGTTCATCCTCTGGAACTAAATCTGTAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAATGGAAACAAGGGGAAAGCTCTCGG
TTCTGCTGCTAAAAGCAAGGGAAAAACCAAAGTTGTGGCTGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACACTTGAAGAGAAACTGCCCAAAGTACCTTGCTG
AGAAAAAGGAGGAAAAGGAAGGAGCAAGTAATCATCTTTCCTCTTCTTTTCAGGAAAATAATTCCTTTCGGCAGTTGAACGATGGGGAAATGACACTCAGGGTCGGAACT
GGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAACTTGGTCATACAAACATCGATAGAATCGGTCGTTTGGTAAAGAATGGACTTCTAAGCAAGTTAGAAGATGA
TTCACTACCACCTTGTGAATCTTGTCTCGAGGGAAAAATGACCAAGAGACCCTTTACTGGAAAAGGTTACAGAGCCAAAGAACCCTTAGAGTTAATACATTCGGATCTAT
GTGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTATAGATGATTATTCAAGATATGGTTACTTGTACCTAATGGGACGTAAGTCTGAAGCC
CTTGAAAAGTTTAAGGAGTTTAAGGCAGAATGTTCTGCTCCCAGTTTTGCTGCAGCACAGAATTTGCTCCCTCAAGTCGCCCAACAACAAATTCTCCACAAAATGATTTC
CAAAGCTTCCATTCTTCCAAATGGGTTAAAACATGCAATGAATGACGAAGATAATGATGAATGGGTCAAAGCCATGAACCTTGAAATGGAGTCAATGCACTCCAATTCTG
TATGGGAACTCGCAGATCTACCAAATGGGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGGGATGCAGCTGGAAAGGTACAGACCTTCAAAGCTAGACTA
GTGGCAAAGGGTTATACCCAAAAGGAAGGGGTTGACTGTGAGGAAACCTTTTCTCCTGTAGCCATGTTAAAGTCCATAAGAATTCTCTTATCCATAGCCACATATTATGA
CTATGAAATATGA
Protein sequenceShow/hide protein sequence
MERRLNRKSVGSVETLGSQRRDALTFCHLNQKRKGSVETLKEQRPDADLASKRAEWLRNLIADIPVFSIGNPAIPLHCDSQATLANANNKIYNGKCRHIRIRHNSIRQLL
SHGVISLDFVRSEENLADPFTKGLAGKRVSESSRGMRLKPIRIFPASDDQHQDTTKKPLLVFSLRIKRLKFRSNAMMNKITFNLTSLLNELQLYQSLLKNKGQMEGEANV
VHSKRKFEKGSSSGTKSVATSSKKTQKKNGNKGKALGSAAKSKGKTKVVADKGKCFHCNVDGHLKRNCPKYLAEKKEEKEGASNHLSSSFQENNSFRQLNDGEMTLRVGT
GEVISAKAVGAAKLGHTNIDRIGRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGRKSEA
LEKFKEFKAECSAPSFAAAQNLLPQVAQQQILHKMISKASILPNGLKHAMNDEDNDEWVKAMNLEMESMHSNSVWELADLPNGVKPIGCKWIYKRKRDAAGKVQTFKARL
VAKGYTQKEGVDCEETFSPVAMLKSIRILLSIATYYDYEI