; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001050 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001050
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Ty1-DR4 Gag-Pol polyprotein
Genome locationchr4:23033291..23037923
RNA-Seq ExpressionLag0001050
SyntenyLag0001050
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU49939.1 hypothetical protein TSUD_290960 [Trifolium subterraneum]1.4e-6530.21Show/hide
Query:  HAAGNFNSQGGNNGGSSAYIAT---PKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPG
        H+A +     G +   SA++A    P     P WL DS A++HIT D +N+++K +Y G + L V N   L I+H GS+++ +    S + L+N+L VP 
Subjt:  HAAGNFNSQGGNNGGSSAYIAT---PKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPG

Query:  IKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSL-------------------------SSSDKC
        + +NLIS+++L   N V IEF   +  VKD  + +++L G+ ++N+Y++    +Q   S A+++++  L                          SS+KC
Subjt:  IKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSL-------------------------SSSDKC

Query:  LG--SSNFH----AQKSMSCNVPLNLWHS----------------RFGHASSRVIQSVLKSC---NATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLD
        +   S+  H    ++ S++ + PL + +S                 F   +S    + L        TSS  +P  + +     +P    +L+P I + +
Subjt:  LG--SSNFH----AQKSMSCNVPLNLWHS----------------RFGHASSRVIQSVLKSC---NATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLD

Query:  TGVSDIP---PVEDQCDSHPP--VVSEPGD-----------------TVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCS
            + P   P+E Q   H P    SEP                   T+  P+   +H M TR K+GIFKPK   +F      +    EP  V +AL+  
Subjt:  TGVSDIP---PVEDQCDSHPP--VVSEPGD-----------------TVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCS

Query:  SWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------------------------------------FAVAKGWSIHQLDVN
         WK A++ E TAL+ N TW+LVPP P+ N+IG                                                      A++K W++ Q+DVN
Subjt:  SWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------------------------------------FAVAKGWSIHQLDVN

Query:  NAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSF
        NAFLNG L E VYM QP GF+ QS P++VCKLH ++YGL+Q+PRAWY+   + + N+GF  S++D S+F +   ++V   L+YVD+IL+TG++++F D+F
Subjt:  NAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSF

Query:  VKRLNSTFALKDLSPLSYY
           L + F+LKD+   S++
Subjt:  VKRLNSTFALKDLSPLSYY

GAU51573.1 hypothetical protein TSUD_85540 [Trifolium subterraneum]1.0e-7934.53Show/hide
Query:  CFLRFEESFN-DPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI
        CF RF+++++   H+A + N          A++A+   V D  W  DS A+NH+T    N     ++ GK +L VGN  KLKI  TGSS + S      +
Subjt:  CFLRFEESFN-DPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI

Query:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN
         L++IL+VP I +NL+S+++L ADNN++++F  + C VKDK++ KV+L G+LK+ LYQ+ L + + P   A  S+ +S                      
Subjt:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN

Query:  VPLNLWHSRFGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQC--DSHPPVVSEPGDTVVQPYVPGSHPM
             WH R GH ++ ++  VLKSC++ + A  P  E+ + A  +      ++      DT  +D    ED+   +  P +V +             + +
Subjt:  VPLNLWHSRFGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQC--DSHPPVVSEPGDTVVQPYVPGSHPM

Query:  QTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF-
         TR+KSGI KPK    +V    T +  +EP + KEAL    WK+A+  E  AL++N TW LVP     N++                          GF 
Subjt:  QTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF-

Query:  --------------------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFG
                                  AV   W + QLD+NNAFLNG LKE V+M+QP GF+D ++P+++CKL  AIYGL+Q+PRAW+D  K  LLNWGF 
Subjt:  --------------------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFG

Query:  NSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        N+++D+S+F     + +  +LIYVD+I+VTG+NN F  +F+K+LN  F+LKDL  L Y+
Subjt:  NSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

KAG7563269.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis suecica]2.6e-6732.66Show/hide
Query:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE
        A++AT  + N   WL DS AT+H+T D  N+++   Y G E +T+ + + + ISHTGS+ + +      + LN++L+VP +++NLIS+ R+   N V +E
Subjt:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE

Query:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQ------------------IELPSIQTPKS---CARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSR
        F   +  VKD  +   +L G  KN LY+                  +  P IQ P     C+    S S S+      +SN  + + +S   P     + 
Subjt:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQ------------------IELPSIQTPKS---CARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSR

Query:  FGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKP
          H     I   L   +  +++  PS         SP+H  S SP          +  P      S  P+   P      P  P  HPM+TR+K+ I KP
Subjt:  FGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKP

Query:  KNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG---------------------------------------
        K     VA +T +  ++ PK+V EALQ  +W+ A+  E+ A   N T+ LVPP    N+IG                                       
Subjt:  KNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG---------------------------------------

Query:  --------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYF
                       AV+KGWSI Q+DVNNAFL G L E VY+KQP GF+D+  P+YVC+L  A+YGL+Q+PRAWY + +  LL+ GF NS AD S+F  
Subjt:  --------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYF

Query:  VSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
         +   +  VL+YVD++L+TG+N +F  +F+  L++ F+LKDL  +SY+
Subjt:  VSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

RVW72548.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.2e-6229.22Show/hide
Query:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE
        A +A+   ++D  W  D  AT+H++     ++    Y G + + VGN   L+I HT ++   SS       L  +LHVP I  NLIS+++  ADNN   E
Subjt:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE

Query:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSRFGHASSRVIQSVLKSCNATSS
        FH  +  VKD+V++K++L G L++ LY+     + +P +   SS  +S                 ++S      LWHSR GH +  +++ +L SCN +  
Subjt:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSRFGHASSRVIQSVLKSCNATSS

Query:  AH---------------LPSQEQL----------------VTAVGSPT-----------------------HDGSLS-----------------------
         H               LP    +                 T++ S T                        D +LS                       
Subjt:  AH---------------LPSQEQL----------------VTAVGSPT-----------------------HDGSLS-----------------------

Query:  ---------------------------PIIGTLD--TGVSDIPPVEDQCDSHPPVVSEPGDT-------------------------------VVQPYVP
                                   P   TLD  + V  IP +     S PP+ S P  T                                 +P+  
Subjt:  ---------------------------PIIGTLD--TGVSDIPPVEDQCDSHPPVVSEPGDT-------------------------------VVQPYVP

Query:  GSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------
          HPM TRAK+GI K K     V  S+ +S   EP +  +A++ S+W  A+  E +AL  NNTW LVPPP N N+IG                       
Subjt:  GSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------

Query:  ------------------------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLL
                                       A++  WS+HQLDV NAFL+G L+E V+M QP GFI+   P +VCKL+ A+YGL+Q+PRAWY +   +LL
Subjt:  ------------------------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLL

Query:  NWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
         WGF  SRAD+S+F   S++ V+++LIYVD+ILVTG N++   SF+ RLNS+FAL+DL  ++Y+
Subjt:  NWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]3.1e-6529.69Show/hide
Query:  SAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVII
        SA+IA+P    D +W  DS A+NH+T     +    +  GK +L VGN  +L I  +GS+ +      + + L+N+L+VP I +NL+S+++LTADNN ++
Subjt:  SAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVII

Query:  EFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSRFGHASSRVIQSVLKSCNATS
        EF ++ C VKDK++ K +L G L++ LYQ+              S  KS  + D C   S           V  N WH + GH +++V++ VLK+CN  +
Subjt:  EFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSRFGHASSRVIQSVLKSCNATS

Query:  SAH-------------------------------------------------------------------LPSQEQLVTAV-------------------
        S++                                                                   L  + + +TA                    
Subjt:  SAH-------------------------------------------------------------------LPSQEQLVTAV-------------------

Query:  ---GSPTHDGSLSPIIGTLDTGVSDIPPVEDQ------CDSHPPVVSEPG-DTVV--------------------------QPYVPGSHPMQTRAKSGIF
           G  T   ++ P I  ++    D+   EDQ       ++      EP  DT V                          Q     +H M+TR+K+GI+
Subjt:  ---GSPTHDGSLSPIIGTLDTGVSDIPPVEDQ------CDSHPPVVSEPG-DTVV--------------------------QPYVPGSHPMQTRAKSGIF

Query:  KPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF----------
        KPK    ++  + T   + EP++V EA     WK+A++ E  AL AN+TW LVP     N+I                          GF          
Subjt:  KPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF----------

Query:  -----------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVF
                         AV   W + QLD+NNAFLNG LKE V+M QP G+ID +RP ++CKL  AIYGL+Q+PRAW+D+ K TLL WGF N+++D+S+F
Subjt:  -----------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVF

Query:  YFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
            ++ + L+LIYVD+I+VTG N  F ++F+ +LN  F+LKDL  L Y+
Subjt:  YFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

TrEMBL top hitse value%identityAlignment
A0A2N9J6B8 Uncharacterized protein3.8e-6931.4Show/hide
Query:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKM-DYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI
        C+ RF+ +F    +A              AY +T +   DP W  D+ ATNH+T D  N+ V+  +Y G + + VGN   L ++HTG+S +  S   S  
Subjt:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKM-DYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI

Query:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN
        ILNN+LHVP I +NLIS+ + T+D +  +EFH  Y LVKD+ ++K++  G  K+ LY    P   +  S    +L    +S D+                
Subjt:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN

Query:  VPLNLWHSRFGHASSRVIQSVL----------KSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQP
             WHSR GH + +V+  +L           + + +  A L S+ + +    SPT   +   +I T    V   P       S PP  ++P  +   P
Subjt:  VPLNLWHSRFGHASSRVIQSVL----------KSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQP

Query:  YVP--------GSHPMQTRAKSGIFKPKNW-----------GVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-
         +P         SHPM TR+K  I KPK +            +   N  ++S   EP     A++   W++A+N E  AL+ N+TWTLVP     NL+G 
Subjt:  YVP--------GSHPMQTRAKSGIFKPKNW-----------GVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-

Query:  ----------------------------------------------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLH
                                                             A++K W + QLDV NAFL+GCL E VYM QP GF     P++VCKLH
Subjt:  ----------------------------------------------------FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLH

Query:  NAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
         A+YGL+Q+PRAW+ +    LL++GF  S++D+S+F +  ++  +  LIYVD+I++T    S   S + +L S FA+KDL  L+Y+
Subjt:  NAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

A0A2Z6P9A9 Reverse transcriptase Ty1/copia-type domain-containing protein4.8e-8034.53Show/hide
Query:  CFLRFEESFN-DPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI
        CF RF+++++   H+A + N          A++A+   V D  W  DS A+NH+T    N     ++ GK +L VGN  KLKI  TGSS + S      +
Subjt:  CFLRFEESFN-DPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPI

Query:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN
         L++IL+VP I +NL+S+++L ADNN++++F  + C VKDK++ KV+L G+LK+ LYQ+ L + + P   A  S+ +S                      
Subjt:  ILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCN

Query:  VPLNLWHSRFGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQC--DSHPPVVSEPGDTVVQPYVPGSHPM
             WH R GH ++ ++  VLKSC++ + A  P  E+ + A  +      ++      DT  +D    ED+   +  P +V +             + +
Subjt:  VPLNLWHSRFGHASSRVIQSVLKSCNATSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQC--DSHPPVVSEPGDTVVQPYVPGSHPM

Query:  QTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF-
         TR+KSGI KPK    +V    T +  +EP + KEAL    WK+A+  E  AL++N TW LVP     N++                          GF 
Subjt:  QTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------------GF-

Query:  --------------------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFG
                                  AV   W + QLD+NNAFLNG LKE V+M+QP GF+D ++P+++CKL  AIYGL+Q+PRAW+D  K  LLNWGF 
Subjt:  --------------------------AVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFG

Query:  NSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        N+++D+S+F     + +  +LIYVD+I+VTG+NN F  +F+K+LN  F+LKDL  L Y+
Subjt:  NSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

A0A803NU85 Uncharacterized protein3.5e-7030.07Show/hide
Query:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPII
        C+ R++ESF         N++  ++   SA +A P+++ND  W ADS A+NH+T D   +  K +Y GKE +T+G+ +KL I H G+  + S    SP++
Subjt:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPII

Query:  LNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNV
        L+N+LHVP I +NLIS+++LT+DNNV +EF SD C+VK++ + +V+L G LK+ LYQ+  PS        RSS S S S S      S+      +  N 
Subjt:  LNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNV

Query:  PLNLWHSRFGHASSRVIQSVLK-------------------------------------------------------SCNATS--SAHLPSQEQLVTAVG
          ++WH + GH S+ V+  VLK                                                       SC  TS  +     + + +  +G
Subjt:  PLNLWHSRFGHASSRVIQSVLK-------------------------------------------------------SCNATS--SAHLPSQEQLVTAVG

Query:  SPTHDGSLSP-------------IIGTLDT-------------------------GVSDIPPVED-----------QC------DSHP--PVVSEPGDTV
              +  P             +I  L T                         GV+  P +             +C      +SH     +S  G   
Subjt:  SPTHDGSLSP-------------IIGTLDT-------------------------GVSDIPPVED-----------QC------DSHP--PVVSEPGDTV

Query:  VQ----------PYVPG-----------------------------------------SHPMQTRA----KSGIFKPKNWGVFVANSTTVSSEVEPKSVK
        +           P+ PG                                         + P+   A    ++GIFKP+   VF+  +T      EP SV+
Subjt:  VQ----------PYVPG-----------------------------------------SHPMQTRA----KSGIFKPKNWGVFVANSTTVSSEVEPKSVK

Query:  EALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------------------------------------FAVAKGWSI
        +AL    W  A+ +E+ AL  N TW LVPP P+ N++G                                                      AV+K W I
Subjt:  EALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-----------------------------------------------------FAVAKGWSI

Query:  HQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNN
         QLD+NNAFLNG L+E VYM QP GF D  +P YVCKL  +IYGL+Q+PRAWY+Q K TL  W F NS+AD+S+F F   N VI+VLIYVD+I+VTG ++
Subjt:  HQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNN

Query:  SFTDSFVKRLNSTFALKDLSPLSYY
           D F+ +LN +F+LKDL PL Y+
Subjt:  SFTDSFVKRLNSTFALKDLSPLSYY

A0A803NUC9 Uncharacterized protein1.3e-7734.01Show/hide
Query:  CFLRFEESF--NDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSP
        C+ R++E++  N P+ AG+ + Q       SA IATP+++ND  W ADS A+NH+T D   +  K +Y GKE +T+G+  KL ISH GS  + +    +P
Subjt:  CFLRFEESF--NDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSP

Query:  IILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSD--------------KC
        ++L N+LH+P I +NLIS+++LT DNNV +EF SD+C VKD+ + KV+L+  LK+ LYQ+   S Q+ +S +  S   +++ +               + 
Subjt:  IILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSD--------------KC

Query:  LGSSNFHAQKSM----SC----------------------------NVPLNLWHSRF---------------GHASS-RVIQS------VLKSCNATSSA
           S   AQ  +    SC                             +P   W   F                H S   VI S       LKS   T  +
Subjt:  LGSSNFHAQKSM----SC----------------------------NVPLNLWHSRF---------------GHASS-RVIQS------VLKSCNATSSA

Query:  ---HLP-SQEQLVTAVGSPTHDGSLSPIIGT-LDTG--VSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGS-----HPMQTRAKSGIFKPKNWGVFVANS
            LP + +    +   P+H  S    + T +D G  +S +P    +    P   S+P   V    +P +     HPM TR K GIFKP+   + ++ +
Subjt:  ---HLP-SQEQLVTAVGSPTHDGSLSPIIGT-LDTG--VSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGS-----HPMQTRAKSGIFKPKNWGVFVANS

Query:  TTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-------------------------------------------------
        T+     EP SV+EAL    W +A+  EM AL  N TW LVPP P+ +++G                                                 
Subjt:  TTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG-------------------------------------------------

Query:  ----FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVL
             AVA  W I QLD+NNAFLNG L E VYM QP GF D ++P YVCKL  +IYGL+Q+PRAWY++ K TL  W F NS+AD S F     + VI+VL
Subjt:  ----FAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVL

Query:  IYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        IYVD+I+VTG  +     F+ RLN  F+LKDL  L Y+
Subjt:  IYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

A0A803PYD1 Uncharacterized protein9.5e-6829.72Show/hide
Query:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPII
        C  RF+ES+        FN         S  +ATP+ ++D  W ADS ATNH+T D+  +  K++Y GKE + VG+ TKL I H GS+ V +  D   +I
Subjt:  CFLRFEESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPII

Query:  LNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNV
        L ++LHVP I +NLISI+ LT+DN+V +EF SD+C VKD+ + KV+L   LK+ LYQ    ++           S S+S       S +F + K      
Subjt:  LNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNV

Query:  PLNLWHSRFGHASSRVIQSVLKSCNA------TSSAH----------------------------------LPSQE-----------------QLVTAVG
          + WH R GH S+ V+  VL   N         S H                                  LP+ +                 + +   G
Subjt:  PLNLWHSRFGHASSRVIQSVLKSCNA------TSSAH----------------------------------LPSQE-----------------QLVTAVG

Query:  S-------------------------------------------------------PTHDGSLSPI----IGTLD--------------TGVSDIPPVED
        S                                                       P H G  +      + TLD              TG SD  P   
Subjt:  S-------------------------------------------------------PTHDGSLSPI----IGTLD--------------TGVSDIPPVED

Query:  QCDSHP----------------------------------PVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQC
           S P                                  P+ S      VQ     +HPM TRAK+GIFKPK    +++++     +  P SV EALQ 
Subjt:  QCDSHP----------------------------------PVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQC

Query:  SSWKDAVNIEMTALIANNTWTLVPPPPNLNLIGF--------------------AVAKG---------------------------------WSIHQLDV
          W  A++ E  AL    TW+LVP     N++G                      VAKG                                 W + Q+++
Subjt:  SSWKDAVNIEMTALIANNTWTLVPPPPNLNLIGF--------------------AVAKG---------------------------------WSIHQLDV

Query:  NNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDS
        NNAFLNG   E VYM QP GF D  +PD+VCKLH +IYGL+Q+PRAWYDQ ++ LL+W F NS+AD+S F      + I++L+YVD+I+VTG+N+   +S
Subjt:  NNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDS

Query:  FVKRLNSTFALKDLSPLSYY
        F+ RLN  F+LK L  L Y+
Subjt:  FVKRLNSTFALKDLSPLSYY

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.4e-2229.55Show/hide
Query:  VANSTTVSSEVEPKSVKEAL---QCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------GFAVAKGWS--------------
        V N+ T+ ++V P S  E       SSW++A+N E+ A   NNTWT+   P N N++                       VA+G++              
Subjt:  VANSTTVSSEVEPKSVKEAL---QCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLI--------------------GFAVAKGWS--------------

Query:  -------------------IHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSS
                           +HQ+DV  AFLNG LKE +YM+ P G    S  D VCKL+ AIYGL+Q+ R W++ F+  L    F NS  D  ++     
Subjt:  -------------------IHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSS

Query:  NM--VILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        N+   I VL+YVD++++   + +  ++F + L   F + DL+ + ++
Subjt:  NM--VILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1935.07Show/hide
Query:  LIGFAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFY-FVSSNMVILVL
        ++  A +    + QLDV  AFL+G L+E +YM+QP GF    +   VCKL+ ++YGL+Q+PR WY +F + + +  +  + +D  V++   S N  I++L
Subjt:  LIGFAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFY-FVSSNMVILVL

Query:  IYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSP
        +YVD++L+ G +          L+ +F +KDL P
Subjt:  IYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSP

P25600 Putative transposon Ty5-1 protein YCL074W8.7e-1832.52Show/hide
Query:  LDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSF
        +DV+ AFLN  + E +Y+KQP GF+++  PDYV +L+  +YGL+Q+P  W +    TL   GF     ++ +++  +S+  I + +YVD++LV   +   
Subjt:  LDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNSF

Query:  TDSFVKRLNSTFALKDLSPLSYY
         D   + L   +++KDL  +  +
Subjt:  TDSFVKRLNSTFALKDLSPLSYY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-4233.64Show/hide
Query:  SQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEA
        S  QL  ++ +P    S SP   T  +  S  P         PP +++  +   Q  +  +H M TRAK+GI KP       + + ++++E EP++  +A
Subjt:  SQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEA

Query:  LQCSSWKDAVNIEMTALIANNTWTLVPPPP--------------------NLN----------------------------------LIGFAVAKGWSIH
        L+   W++A+  E+ A I N+TW LVPPPP                    +LN                                  ++G AV + W I 
Subjt:  LQCSSWKDAVNIEMTALIANNTWTLVPPPP--------------------NLN----------------------------------LIGFAVAKGWSIH

Query:  QLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNS
        QLDVNNAFL G L + VYM QP GFID+ RP+YVCKL  A+YGL+Q+PRAWY + +  LL  GF NS +D S+F       ++ +L+YVD+IL+TG++ +
Subjt:  QLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEILVTGDNNS

Query:  FTDSFVKRLNSTFALKDLSPLSYY
           + +  L+  F++KD   L Y+
Subjt:  FTDSFVKRLNSTFALKDLSPLSYY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1540.29Show/hide
Query:  WLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVS
        WL DS AT+HIT D  N+++   Y G + + V + + + ISHTGS+++  S    P+ L+NIL+VP I +NLIS+ RL   N V +EF      VKD  +
Subjt:  WLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIEFHSDYCLVKDKVS

Query:  RKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSS
           +L G  K+ LY+  + S Q P S   S  SK+  SS
Subjt:  RKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-4232.63Show/hide
Query:  SSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVP-GSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVE
        SS H+P+    ++   SP+   + +P +                    PPV+  P    V    P  +H M TRAK GI KP       + +T++++  E
Subjt:  SSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVP-GSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVE

Query:  PKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPP--------------------NLN----------------------------------LIGFAV
        P++  +A++   W+ A+  E+ A I N+TW LVPPPP                    +LN                                  ++G AV
Subjt:  PKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPP--------------------NLN----------------------------------LIGFAV

Query:  AKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEIL
         + W I QLDVNNAFL G L + VYM QP GF+D+ RPDYVC+L  AIYGL+Q+PRAWY + +  LL  GF NS +D S+F       +I +L+YVD+IL
Subjt:  AKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVILVLIYVDEIL

Query:  VTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        +TG++       +  L+  F++K+   L Y+
Subjt:  VTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-1435.53Show/hide
Query:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE
        A +A     N   WL DS AT+HIT D  N++    Y G + + + + + + I+HTGS+++ +S     + LN +L+VP I +NLIS+ RL   N V +E
Subjt:  AYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLISIARLTADNNVIIE

Query:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSS
        F      VKD  +   +L G  K+ LY+  + S Q     A S  SK+  SS
Subjt:  FHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-2830.42Show/hide
Query:  VSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIGF--------------------AVAKGW------------------------
        ++   EP +  EA +   W  A++ E+ A+   +TW +   PPN   IG                      VAKG+                        
Subjt:  VSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIGF--------------------AVAKGW------------------------

Query:  ---------SIHQLDVNNAFLNGCLKEAVYMKQPSGFI----DQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVIL
                 ++HQLD++NAFLNG L E +YMK P G+     D   P+ VC L  +IYGL+Q+ R W+ +F  TL+ +GF  S +D++ F  +++ + + 
Subjt:  ---------SIHQLDVNNAFLNGCLKEAVYMKQPSGFI----DQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNSRADNSVFYFVSSNMVIL

Query:  VLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        VL+YVD+I++  +N++  D    +L S F L+DL PL Y+
Subjt:  VLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

ATMG00810.1 DNA/RNA polymerases superfamily protein2.5e-0445Show/hide
Query:  VLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY
        +L+YVD+IL+TG +N+  +  + +L+STF++KDL P+ Y+
Subjt:  VLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-0644Show/hide
Query:  MQTRAKSGIFK--PKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG
        M TR+K+GI K  PK      + + T + + EPKSV  AL+   W  A+  E+ AL  N TW LVPPP N N++G
Subjt:  MQTRAKSGIFK--PKNWGVFVANSTTVSSEVEPKSVKEALQCSSWKDAVNIEMTALIANNTWTLVPPPPNLNLIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGTGCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCG
AAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCC
AAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGACATCTGAAGACCCTCATTTACATCTTAAGTCT
TTTCTAGGAGTTAGATATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTC
TTTTGCTCCAGGATCAATTAGGACGTGGGATGAGTTAGCTGAAAAATTTTTGATTATGCTGCTGAAATTGTTGCTCAATCACTCTCTCATCTCATATCTTCGAGCTTGTT
TCTTCAAGCATTCCCTTCCCATTCCTTATCACTATAGACTCCCACAAGTCATTCTTGGCCCGGAGAATAGTAGGGAAGACTCTAGAGGCTGTAAAATCGAAGAAAACTGG
AGAATATGCAAGCAGAAAACAGCAACAGCAGCTGTTGGTCGTGAGAGCGTGGAGACGCTCTTCATGAAGCGTCCCGAAGCTGTAGAAAAAAATTCCAGGCGTCCAGATTT
TCCAGAAAATTGGTTCGGTCCAGCTGCTGTAGCTCATTTACTATTACAGAGAACTTGTAGCTTGTGGGACATTAAGACATGGCAGGAGTTAAGTTCTATACTAGTAACTT
TTGAAGGCACATTAGCTAGATATACTACTCCCACCAATGTCACAAATGAACTTCCTGACTCGGCTGCACATTTAGTCTATAATCAACAAGCGTGTTTTCTTCGTTTTGAG
GAGAGCTTCAATGATCCTCATGCTGCTGGAAATTTCAATTCTCAGGGTGGTAACAATGGTGGCTCATCTGCTTACATAGCCACCCCTAAAATAGTAAATGATCCAAAATG
GTTGGCTGATAGTGAGGCAACTAATCACATTACATGTGATGCAAGAAACATGGCCGTGAAGATGGACTATGCTGGTAAGGAGACTCTTACAGTGGGTAACGACACTAAAC
TTAAAATATCTCATACTGGTTCTAGTGCAGTAGCTTCAAGTCTTGATAAATCTCCTATTATCTTGAATAATATTCTTCATGTGCCTGGAATCAAAAGGAATTTGATTAGT
ATTGCACGTCTCACTGCTGATAATAATGTAATTATTGAATTTCACTCTGATTACTGTCTTGTGAAAGACAAGGTATCAAGAAAGGTGATGTTGAACGGAATGCTTAAGAA
CAACCTCTATCAGATTGAGCTTCCTTCAATTCAAACTCCAAAGTCGTGTGCCAGGTCTTCTTTAAGTAAAAGTCTCAGTTCTTCTGATAAGTGTCTTGGTTCGTCTAATT
TTCATGCTCAAAAGTCCATGTCTTGTAATGTGCCTCTTAATCTTTGGCATAGTCGTTTTGGTCATGCCTCGTCTAGAGTTATTCAAAGTGTCTTGAAGTCCTGTAATGCA
ACATCATCAGCCCATTTACCTTCACAAGAGCAGTTGGTCACGGCTGTTGGGAGCCCTACACATGATGGTTCTTTGTCTCCTATTATTGGCACTCTTGATACTGGAGTTTC
TGATATTCCGCCTGTTGAAGATCAGTGTGATTCTCATCCGCCTGTTGTTTCTGAACCTGGTGATACTGTAGTACAACCTTATGTTCCTGGCTCTCATCCAATGCAAACTC
GAGCCAAAAGTGGCATATTTAAACCTAAGAATTGGGGTGTTTTTGTGGCTAACTCTACGACAGTTTCTTCTGAGGTTGAACCTAAGTCAGTCAAAGAGGCTCTTCAATGC
AGTTCTTGGAAGGATGCGGTGAACATTGAGATGACTGCCTTGATTGCTAATAATACTTGGACTTTAGTCCCGCCTCCGCCTAATCTCAATCTCATTGGCTTTGCGGTTGC
TAAAGGTTGGTCTATTCATCAGCTCGATGTCAACAATGCATTCTTAAATGGATGTTTGAAGGAAGCCGTGTATATGAAACAACCATCTGGTTTCATTGATCAATCTCGAC
CAGATTATGTGTGTAAATTGCACAATGCAATCTATGGTCTTCGTCAGTCTCCGCGAGCCTGGTATGATCAGTTTAAAGCCACTCTTTTGAATTGGGGTTTTGGTAACTCT
CGTGCAGATAATTCTGTGTTTTACTTTGTCTCCTCCAATATGGTTATCCTAGTTCTTATTTACGTGGACGAAATTCTGGTGACTGGCGATAATAACTCATTCACTGACAG
CTTTGTAAAGAGGTTGAATTCTACTTTTGCCTTAAAGGACTTAAGCCCGTTGAGCTACTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACGTGCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCG
AAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCC
AAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGACATCTGAAGACCCTCATTTACATCTTAAGTCT
TTTCTAGGAGTTAGATATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTC
TTTTGCTCCAGGATCAATTAGGACGTGGGATGAGTTAGCTGAAAAATTTTTGATTATGCTGCTGAAATTGTTGCTCAATCACTCTCTCATCTCATATCTTCGAGCTTGTT
TCTTCAAGCATTCCCTTCCCATTCCTTATCACTATAGACTCCCACAAGTCATTCTTGGCCCGGAGAATAGTAGGGAAGACTCTAGAGGCTGTAAAATCGAAGAAAACTGG
AGAATATGCAAGCAGAAAACAGCAACAGCAGCTGTTGGTCGTGAGAGCGTGGAGACGCTCTTCATGAAGCGTCCCGAAGCTGTAGAAAAAAATTCCAGGCGTCCAGATTT
TCCAGAAAATTGGTTCGGTCCAGCTGCTGTAGCTCATTTACTATTACAGAGAACTTGTAGCTTGTGGGACATTAAGACATGGCAGGAGTTAAGTTCTATACTAGTAACTT
TTGAAGGCACATTAGCTAGATATACTACTCCCACCAATGTCACAAATGAACTTCCTGACTCGGCTGCACATTTAGTCTATAATCAACAAGCGTGTTTTCTTCGTTTTGAG
GAGAGCTTCAATGATCCTCATGCTGCTGGAAATTTCAATTCTCAGGGTGGTAACAATGGTGGCTCATCTGCTTACATAGCCACCCCTAAAATAGTAAATGATCCAAAATG
GTTGGCTGATAGTGAGGCAACTAATCACATTACATGTGATGCAAGAAACATGGCCGTGAAGATGGACTATGCTGGTAAGGAGACTCTTACAGTGGGTAACGACACTAAAC
TTAAAATATCTCATACTGGTTCTAGTGCAGTAGCTTCAAGTCTTGATAAATCTCCTATTATCTTGAATAATATTCTTCATGTGCCTGGAATCAAAAGGAATTTGATTAGT
ATTGCACGTCTCACTGCTGATAATAATGTAATTATTGAATTTCACTCTGATTACTGTCTTGTGAAAGACAAGGTATCAAGAAAGGTGATGTTGAACGGAATGCTTAAGAA
CAACCTCTATCAGATTGAGCTTCCTTCAATTCAAACTCCAAAGTCGTGTGCCAGGTCTTCTTTAAGTAAAAGTCTCAGTTCTTCTGATAAGTGTCTTGGTTCGTCTAATT
TTCATGCTCAAAAGTCCATGTCTTGTAATGTGCCTCTTAATCTTTGGCATAGTCGTTTTGGTCATGCCTCGTCTAGAGTTATTCAAAGTGTCTTGAAGTCCTGTAATGCA
ACATCATCAGCCCATTTACCTTCACAAGAGCAGTTGGTCACGGCTGTTGGGAGCCCTACACATGATGGTTCTTTGTCTCCTATTATTGGCACTCTTGATACTGGAGTTTC
TGATATTCCGCCTGTTGAAGATCAGTGTGATTCTCATCCGCCTGTTGTTTCTGAACCTGGTGATACTGTAGTACAACCTTATGTTCCTGGCTCTCATCCAATGCAAACTC
GAGCCAAAAGTGGCATATTTAAACCTAAGAATTGGGGTGTTTTTGTGGCTAACTCTACGACAGTTTCTTCTGAGGTTGAACCTAAGTCAGTCAAAGAGGCTCTTCAATGC
AGTTCTTGGAAGGATGCGGTGAACATTGAGATGACTGCCTTGATTGCTAATAATACTTGGACTTTAGTCCCGCCTCCGCCTAATCTCAATCTCATTGGCTTTGCGGTTGC
TAAAGGTTGGTCTATTCATCAGCTCGATGTCAACAATGCATTCTTAAATGGATGTTTGAAGGAAGCCGTGTATATGAAACAACCATCTGGTTTCATTGATCAATCTCGAC
CAGATTATGTGTGTAAATTGCACAATGCAATCTATGGTCTTCGTCAGTCTCCGCGAGCCTGGTATGATCAGTTTAAAGCCACTCTTTTGAATTGGGGTTTTGGTAACTCT
CGTGCAGATAATTCTGTGTTTTACTTTGTCTCCTCCAATATGGTTATCCTAGTTCTTATTTACGTGGACGAAATTCTGGTGACTGGCGATAATAACTCATTCACTGACAG
CTTTGTAAAGAGGTTGAATTCTACTTTTGCCTTAAAGGACTTAAGCCCGTTGAGCTACTATTAA
Protein sequenceShow/hide protein sequence
MENVPQLPQVHEGPAAANPQQNPLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKS
FLGVRYSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLIMLLKLLLNHSLISYLRACFFKHSLPIPYHYRLPQVILGPENSREDSRGCKIEENW
RICKQKTATAAVGRESVETLFMKRPEAVEKNSRRPDFPENWFGPAAVAHLLLQRTCSLWDIKTWQELSSILVTFEGTLARYTTPTNVTNELPDSAAHLVYNQQACFLRFE
ESFNDPHAAGNFNSQGGNNGGSSAYIATPKIVNDPKWLADSEATNHITCDARNMAVKMDYAGKETLTVGNDTKLKISHTGSSAVASSLDKSPIILNNILHVPGIKRNLIS
IARLTADNNVIIEFHSDYCLVKDKVSRKVMLNGMLKNNLYQIELPSIQTPKSCARSSLSKSLSSSDKCLGSSNFHAQKSMSCNVPLNLWHSRFGHASSRVIQSVLKSCNA
TSSAHLPSQEQLVTAVGSPTHDGSLSPIIGTLDTGVSDIPPVEDQCDSHPPVVSEPGDTVVQPYVPGSHPMQTRAKSGIFKPKNWGVFVANSTTVSSEVEPKSVKEALQC
SSWKDAVNIEMTALIANNTWTLVPPPPNLNLIGFAVAKGWSIHQLDVNNAFLNGCLKEAVYMKQPSGFIDQSRPDYVCKLHNAIYGLRQSPRAWYDQFKATLLNWGFGNS
RADNSVFYFVSSNMVILVLIYVDEILVTGDNNSFTDSFVKRLNSTFALKDLSPLSYY