; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006942 (gene) of Snake gourd v1 genome

Gene IDTan0006942
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:60283993..60285507
RNA-Seq ExpressionTan0006942
SyntenyTan0006942
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-12351.3Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  ------------------------------------------------------------------------------------------------EWLA
                                                                                                        +WLA
Subjt:  ------------------------------------------------------------------------------------------------EWLA

Query:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH
         QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ VE+MR IPYAS  G  MYAMLCTRPDI 
Subjt:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH

Query:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA
        +                                                      D D R                   +KQGCIADSTMEAEYVAACEA
Subjt:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA

Query:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ
        AKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IAS HN+ DPFTK LTAKVFE HLE LGL+
Subjt:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ

KAA0054309.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-12758.18Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF  VAM+KSIRILL IAAY++YE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+ LL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------
         VE+MR IPYAS  G  MYAMLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADSTME EYVAACEAAKEVVWLR F+++LEVVPNM+ PIT+YCDN+G VANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+I 
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHLEGLGLQ
        S HN+ DPFTK LTAKVFE HLE LG++
Subjt:  SEHNIVDPFTKALTAKVFESHLEGLGLQ

KAA0055498.1 gag-pol fusion protein [Cucumis melo var. makuwa]3.9e-12859.24Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNKILALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------
         VE+MR IPYAS  G  MY MLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADST+EAEYVAACEAAKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IA
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHL
        S HN+ DPFTK L AKVFE HL
Subjt:  SEHNIVDPFTKALTAKVFESHL

KAA0059546.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.2e-12959.11Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ES++FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHF-----------------------------------------------------CDWDGR-----------
         VE+MR IPYAS  G  +YAMLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHF-----------------------------------------------------CDWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADSTMEAEYVAACEAAKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHR DVI+T+IA
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHLEGLGLQ
        S HN+ DPFTK L AKVFESHLE LGL+
Subjt:  SEHNIVDPFTKALTAKVFESHLEGLGLQ

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-12351.3Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  ------------------------------------------------------------------------------------------------EWLA
                                                                                                        +WLA
Subjt:  ------------------------------------------------------------------------------------------------EWLA

Query:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH
         QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ VE+MR IPYAS  G  MYAMLCTRPDI 
Subjt:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH

Query:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA
        +                                                      D D R                   +KQGCIADSTMEAEYVAACEA
Subjt:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA

Query:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ
        AKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IAS HN+ DPFTK LTAKVFE HLE LGL+
Subjt:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein8.1e-12451.3Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  ------------------------------------------------------------------------------------------------EWLA
                                                                                                        +WLA
Subjt:  ------------------------------------------------------------------------------------------------EWLA

Query:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH
         QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ VE+MR IPYAS  G  MYAMLCTRPDI 
Subjt:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH

Query:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA
        +                                                      D D R                   +KQGCIADSTMEAEYVAACEA
Subjt:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA

Query:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ
        AKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IAS HN+ DPFTK LTAKVFE HLE LGL+
Subjt:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ

A0A5A7UKD1 Gag-pol fusion protein1.9e-12859.24Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNKILALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------
         VE+MR IPYAS  G  MY MLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADST+EAEYVAACEAAKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IA
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHL
        S HN+ DPFTK L AKVFE HL
Subjt:  SEHNIVDPFTKALTAKVFESHL

A0A5A7ULH1 Gag/pol protein5.4e-12858.18Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF  VAM+KSIRILL IAAY++YE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+ LL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------
         VE+MR IPYAS  G  MYAMLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHFC-----------------------------------------------------DWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADSTME EYVAACEAAKEVVWLR F+++LEVVPNM+ PIT+YCDN+G VANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+I 
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHLEGLGLQ
        S HN+ DPFTK LTAKVFE HLE LG++
Subjt:  SEHNIVDPFTKALTAKVFESHLEGLGLQ

A0A5A7UZE3 Retrotransposon protein, putative, Ty1-copia subclass5.8e-13059.11Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ES++FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ
                                 +WLA QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ
Subjt:  -------------------------EWLAMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQ

Query:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHF-----------------------------------------------------CDWDGR-----------
         VE+MR IPYAS  G  +YAMLCTRPDI +                                                      D D R           
Subjt:  GVEDMRRIPYASPFGRAMYAMLCTRPDIHF-----------------------------------------------------CDWDGR-----------

Query:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA
                +KQGCIADSTMEAEYVAACEAAKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHR DVI+T+IA
Subjt:  --------VKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIA

Query:  SEHNIVDPFTKALTAKVFESHLEGLGLQ
        S HN+ DPFTK L AKVFESHLE LGL+
Subjt:  SEHNIVDPFTKALTAKVFESHLEGLGLQ

A0A5D3CPJ6 Gag/pol protein8.1e-12451.3Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M+ E+ESM+FNSVWDLVD+PDGVKPIGCKW+YKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETF PVAM+KSIRILL IAAY+DYE            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  ------------------------------------------------------------------------------------------------EWLA
                                                                                                        +WLA
Subjt:  ------------------------------------------------------------------------------------------------EWLA

Query:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH
         QFQMKDLG+AQFVLG QI  +RKNK+LALSQASY+DK++ ++ MQ+SK+GLL FRH + LSKEQCPKTPQ VE+MR IPYAS  G  MYAMLCTRPDI 
Subjt:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIH

Query:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA
        +                                                      D D R                   +KQGCIADSTMEAEYVAACEA
Subjt:  FC-----------------------------------------------------DWDGR-------------------VKQGCIADSTMEAEYVAACEA

Query:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ
        AKE VWLR F+++LEVVPNM+ PIT+YCDNSGAVANSREPRSHKRGKHIERKYHLI EIVHRGDVI+T+IAS HN+ DPFTK LTAKVFE HLE LGL+
Subjt:  AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-2221.74Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYY---------------
        ++ E+ +   N+ W +  +P+    +  +WV+  K    G    +KARLVA+G+TQ   +DYEETF PVA + S R +L +   Y               
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYY---------------

Query:  ---------------------------------------------------------------------------------------------DYEEWLA
                                                                                                     +++ +L 
Subjt:  ---------------------------------------------------------------------------------------------DYEEWLA

Query:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHL----SKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTR
         +F+M DL + +  +G +I   +++KI  LSQ++Y+ K+LS+F M++           I+     S E C             P  S  G  MY MLCTR
Subjt:  MQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHL----SKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTR

Query:  PDIHFC----------------------------------------------------DWDG------------------------RVKQGCIADSTMEA
        PD+                                                       DW G                          +Q  +A S+ EA
Subjt:  PDIHFC----------------------------------------------------DWDG------------------------RVKQGCIADSTMEA

Query:  EYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESH
        EY+A  EA +E +WL+  + ++ +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH   E V    + +  I +E+ + D FTK L A  F   
Subjt:  EYVAACEAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESH

Query:  LEGLGL
         + LGL
Subjt:  LEGLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-4226.6Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------
        M +EMES+  N  + LV+ P G +P+ CKWV+K K+  D K+  +KARLV KG+ Q +G+D++E F PV  + SIR +L +AA  D E            
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYE------------

Query:  ---------------------------------------EW----------------------------------------------------------L
                                               +W                                                          L
Subjt:  ---------------------------------------EW----------------------------------------------------------L

Query:  AMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDI
        +  F MKDLG AQ +LG +IV  R ++ L LSQ  Y+++VL RF M+++K         + LSK+ CP T +   +M ++PY+S  G  MYAM+CTRPDI
Subjt:  AMQFQMKDLGDAQFVLGNQIVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDI

Query:  -----------------HF--------------------------------------------------------CDWDGRVKQGCIADSTMEAEYVAAC
                         H+                                                          W  ++ Q C+A ST EAEY+AA 
Subjt:  -----------------HF--------------------------------------------------------CDWDGRVKQGCIADSTMEAEYVAAC

Query:  EAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGL
        E  KE++WL++F+  L +         +YCD+  A+  S+    H R KHI+ +YH I E+V    + + KI++  N  D  TK +    FE   E +G+
Subjt:  EAAKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGL

P92520 Uncharacterized mitochondrial protein AtMg008207.3e-1338.14Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYEEWLAMQFQM
        M +E++++  N  W LV  P     +GCKWV+K K   DG +   KARLVAKG+ Q EG+ + ET+ PV    +IR +L +A   +  + +   F+M
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYEEWLAMQFQM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1346.99Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDG-VKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIA
        M  E+ +   N  WDLV  P   V  +GC+W++ +K   DG +  +KARLVAKGY Q  G+DY ETF PV    SIRI+L +A
Subjt:  MDQEMESMHFNSVWDLVDKPDG-VKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-1346.99Show/hide
Query:  MDQEMESMHFNSVWDLV-DKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIA
        M  E+ +   N  WDLV   P  V  +GC+W++ +K   DG +  +KARLVAKGY Q  G+DY ETF PV    SIRI+L +A
Subjt:  MDQEMESMHFNSVWDLV-DKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.8e-2149.43Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDY
        MD E+ +M     W++   P   KPIGCKWVYK K   DG ++ +KARLVAKGYTQ EG+D+ ETF PV  + S++++L I+A Y++
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.2e-1438.14Show/hide
Query:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYEEWLAMQFQM
        M +E++++  N  W LV  P     +GCKWV+K K   DG +   KARLVAKG+ Q EG+ + ET+ PV    +IR +L +A   +  + +   F+M
Subjt:  MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYEEWLAMQFQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAAGAAATGGAGTCTATGCACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGGTCTACAAGAGAAAACGTGG
TGTAGATGGGAAGGTGCAAACCTTTAAAGCTAGGCTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTTACCTGTTGCTATGGTAAAGT
CTATTCGTATCCTTCTTGTCATTGCCGCATATTATGACTATGAGGAATGGCTAGCTATGCAATTCCAAATGAAAGATTTGGGTGATGCGCAGTTTGTTCTTGGGAACCAG
ATTGTCTGGAACCGCAAGAATAAAATACTAGCCTTGTCTCAAGCATCATACTTAGACAAAGTGTTGTCAAGATTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCTTTT
TAGACATGTAATCCATTTGTCTAAGGAACAATGTCCTAAGACACCTCAAGGAGTTGAGGATATGAGACGAATTCCTTACGCATCACCGTTTGGGAGAGCGATGTACGCCA
TGTTGTGTACTAGGCCCGACATCCATTTTTGCGATTGGGATGGTCGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGTGAAGCA
GCTAAGGAGGTCGTTTGGTTAAGGAAATTTATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGCCCATCACAATGTATTGTGATAACAGTGGTGCAGTGGCAAATTC
GAGGGAACCCCGAAGTCACAAGAGGGGAAAGCACATAGAGCGGAAGTATCATCTCATCGGGGAGATCGTGCATAGAGGAGACGTGATTATCACGAAGATAGCCTCAGAGC
ACAACATTGTTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTTTTTGAGAGTCACCTAGAAGGTCTAGGTTTACAAGTCTTCTCCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCAAGAAATGGAGTCTATGCACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGGTCTACAAGAGAAAACGTGG
TGTAGATGGGAAGGTGCAAACCTTTAAAGCTAGGCTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTTACCTGTTGCTATGGTAAAGT
CTATTCGTATCCTTCTTGTCATTGCCGCATATTATGACTATGAGGAATGGCTAGCTATGCAATTCCAAATGAAAGATTTGGGTGATGCGCAGTTTGTTCTTGGGAACCAG
ATTGTCTGGAACCGCAAGAATAAAATACTAGCCTTGTCTCAAGCATCATACTTAGACAAAGTGTTGTCAAGATTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCTTTT
TAGACATGTAATCCATTTGTCTAAGGAACAATGTCCTAAGACACCTCAAGGAGTTGAGGATATGAGACGAATTCCTTACGCATCACCGTTTGGGAGAGCGATGTACGCCA
TGTTGTGTACTAGGCCCGACATCCATTTTTGCGATTGGGATGGTCGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGTGAAGCA
GCTAAGGAGGTCGTTTGGTTAAGGAAATTTATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGCCCATCACAATGTATTGTGATAACAGTGGTGCAGTGGCAAATTC
GAGGGAACCCCGAAGTCACAAGAGGGGAAAGCACATAGAGCGGAAGTATCATCTCATCGGGGAGATCGTGCATAGAGGAGACGTGATTATCACGAAGATAGCCTCAGAGC
ACAACATTGTTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTTTTTGAGAGTCACCTAGAAGGTCTAGGTTTACAAGTCTTCTCCAACTAG
Protein sequenceShow/hide protein sequence
MDQEMESMHFNSVWDLVDKPDGVKPIGCKWVYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFLPVAMVKSIRILLVIAAYYDYEEWLAMQFQMKDLGDAQFVLGNQ
IVWNRKNKILALSQASYLDKVLSRFKMQDSKKGLLLFRHVIHLSKEQCPKTPQGVEDMRRIPYASPFGRAMYAMLCTRPDIHFCDWDGRVKQGCIADSTMEAEYVAACEA
AKEVVWLRKFMLNLEVVPNMTLPITMYCDNSGAVANSREPRSHKRGKHIERKYHLIGEIVHRGDVIITKIASEHNIVDPFTKALTAKVFESHLEGLGLQVFSN