; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028214 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028214
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:15582674..15585285
RNA-Seq ExpressionLag0028214
SyntenyLag0028214
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.5e-7835.94Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI
        +++TK    ++E  R +L FE  F V   G   GLA+LW  +++L ++S+   HID  +  ++G  WR T + G+P + ++K++W LL +L+  S+LPW+
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI

Query:  W--------------GGGILTRSSLLPRKREVEKETKLRCNLSERPWT---------IVISK----------IWGRNRLQGSIR--SAISRKEGEIIEIL
                       GG     S +   ++ V     L   L   P+T         I+ +K          +W +    G  +    +  K   I    
Subjt:  W--------------GGGILTRSSLLPRKREVEKETKLRCNLSERPWT---------IVISK----------IWGRNRLQGSIR--SAISRKEGEIIEIL

Query:  ADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDI
        +  D     EL+K E +++ +L++EEI+WK RSR DWL+ GD+NTK+FH+KAS+RRK+N+I G+ D  GKW    +E+ +    +F  LF+T++P  E +
Subjt:  ADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDI

Query:  KRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFR
            +  SAK++E  +  L+APF + E+ + L  + P+KAPG DG  A F+Q +W  +   V   CL +LN+  ++ P N   IALIP   KP  + EFR
Subjt:  KRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFR

Query:  PISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQARLSRRQPLDSTSCSFSQLGLVSS
        PISLCNVIY+I AK++AN +K +L+ I+SP QSAF+ +RLITDN++IG+E ++ I   K  K G  ALKLDI +A    R   +   C+  +LG  S+
Subjt:  PISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQARLSRRQPLDSTSCSFSQLGLVSS

XP_012477795.1 PREDICTED: uncharacterized protein LOC105793429 [Gossypium raimondii]6.7e-7437.92Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGW--WRFTGLCGNPITSKRKNSWELLEKLSEDSNLP-
        + +TK    ++E IR    F     V   G   G+ + W+EE+ +H+RS  + HID  VK +     WRFTG  G+P +  +  SW+LL+ L ++   P 
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGW--WRFTGLCGNPITSKRKNSWELLEKLSEDSNLP-

Query:  ------WIWGGGILTRSSLLPRKREVEKETKLRCNLSERPWTIVISKI-WGRN--RLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEE
              + +     T  S+    RE  K      ++ E+   + IS + W ++  + +  +   IS++  ++++   D DV+  Q ++K    L + +E+
Subjt:  ------WIWGGGILTRSSLLPRKREVEKETKLRCNLSERPWTIVISKI-WGRN--RLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEE

Query:  EEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFT
        EE+YW+ R+R +WL+  D+NT +FH  AS RR+ N I  L  + G+ IT E E+ + A+ YF+KLF+T+     D    + GI   IS   +  L   FT
Subjt:  EEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFT

Query:  KSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVL
          E+   +KG+  +KAPG DG   LF+Q YWDI+G  + + CLEVLN GKDV   N   I LIP K  PT + +FRPISLC V+YKI AKA+ANR++ V+
Subjt:  KSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVL

Query:  NDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
           I  AQSAFVP RLI+DNVLI +E +H++  ++KGK G  A+KLD+ +A
Subjt:  NDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.4e-7433.33Show/hide
Query:  VEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDS--GWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI---------
        +E+I++ + F     VPC G+S G+A+LW  EINL ++S+   HID  + E S    WR TG  G+P T KR +SW LL  L+    LPW+         
Subjt:  VEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDS--GWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI---------

Query:  -----WGG---------------------------------------------------------------------GILTRSSLLPRKREVEKETKLRC
             +GG                                                                           +LL     +    +++ 
Subjt:  -----WGG---------------------------------------------------------------------GILTRSSLLPRKREVEKETKLRC

Query:  NLSERPWT------IVISKIWG-------------------------RNRLQGSIRSAISRKEGEIIEI-LADRDVIKYQELEKAEKELEVLLEEEEIYW
           E  WT       +I   WG                          + + G I   I  K   +  + + + D     E+ +  +E+  LL++EE YW
Subjt:  NLSERPWT------IVISKIWG-------------------------RNRLQGSIRSAISRKEGEIIEI-LADRDVIKYQELEKAEKELEVLLEEEEIYW

Query:  KFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVE
          R++  WL+ GDRNTK+FH++AS RRKQN I G++D  G+W  +EE + + A  YF  ++  SS HP  I+   E I  K++E  + SL   FTK EV 
Subjt:  KFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVE

Query:  KTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIIS
          LK I+P+KAPG DG  A+F+Q YW I+G  VT + L VLN    +   NK  I+LIP    P RM +FRPISLCNV+YK+ +K LANR+K +L  IIS
Subjt:  KTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIIS

Query:  PAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
          QSAF   RLITDNVL+ FE +H ++ +  GK G  A+KLD+ +A
Subjt:  PAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]4.1e-7935.33Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDS--NLP
        + +TK    ++ ++   LN+E  F V   GK  GLA+LW  E  + I+SF   HID  ++ ++G   R TG+ G+P T +RK++W LL +LS+ S    P
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDS--NLP

Query:  WIWGGGIL-------------------------------------------------------TRSSLL----------PRKREVEKETKLR-CNLSERP
        + W  G                                                          R++L+            K  +EKE  L+ C  +  P
Subjt:  WIWGGGIL-------------------------------------------------------TRSSLL----------PRKREVEKETKLR-CNLSERP

Query:  WTIV--ISK-------IWGRNRLQGSIRSAISRKEGEIIEILADR-DVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRK
         ++   +SK       +W +   +G  +  + +   ++  +   R   +K  ++++ E++++ +L ++EIYWK RSR DWL+ GD+NTK+FH KASSR+K
Subjt:  WTIV--ISK-------IWGRNRLQGSIRSAISRKEGEIIEILADR-DVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRK

Query:  QNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDI
        +N+I G+ +  G WI + E +  E + YF  LF TS P+ + I   + GIS ++S   + SL  PFT  EV + L  + P+KAPG DG  A+F+Q +W  
Subjt:  QNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDI

Query:  IGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINL
        +   V   CL +LN+  DV PFN   I LI  K KP ++ +FRPISLCNVIY+I AKA+ANR+K VL ++ISP QSAF+P+ LITDN+++G+EC+H I  
Subjt:  IGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINL

Query:  RKKGKIGQAALKLDIIQA
         K  K G  ALKLD+ +A
Subjt:  RKKGKIGQAALKLDIIQA

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]5.3e-7938.17Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI
        + +TK  + ++++    L ++  F V  EG   GLA+LW EE+ + I+S+   H+D  V  ++G +WR TG+ G+P + K+K++WELL +L++ S+LPW+
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG-WWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI

Query:  WGGG----ILTRSSLLPRKREVEKETKLRCNLSERPWTIVIS----KIWGRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEI
          G     +     +  +++ V +  + R  L +   T + S      W   R    I       E ++   L +R    + + EKA   L     +   
Subjt:  WGGG----ILTRSSLLPRKREVEKETKLRCNLSERPWTIVIS----KIWGRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEI

Query:  YWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSE
             SR DWL+ GDRNTK+FHSKAS+RRK+N+I+G+ D  G W    E + +    YF  +F TSSP P  +   +E +  K+S   +  L+  FTK E
Subjt:  YWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSE

Query:  VEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDI
        + + L  + P+KAPG DG  A F+Q +W  +   V   CL +LNEG ++   N   IAL+P   KP ++ EFRPI+LCNVIY+I AK +ANR+K VLNDI
Subjt:  VEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDI

Query:  ISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
        IS  QSAFVP+RLITDN++IG+EC++ I   +  + G  A+KLDI +A
Subjt:  ISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA

TrEMBL top hitse value%identityAlignment
A0A2N9ELU5 Uncharacterized protein1.2e-8135.04Show/hide
Query:  IIERIRQTEKTNISHKTQVNGALESIQNESKDLKAGNKGTQLDKLESPEGIFLEIKDNIKNLHHTRQGEENGIKKEPIPNNMKWKRLAQTEPNTIFPNQR
        +I  +      N++   +V+    S   ESK+    +KG  L +  SP    ++++ ++ + +   + E + ++       +K K + Q  PN +F    
Subjt:  IIERIRQTEKTNISHKTQVNGALESIQNESKDLKAGNKGTQLDKLESPEGIFLEIKDNIKNLHHTRQGEENGIKKEPIPNNMKWKRLAQTEPNTIFPNQR

Query:  SPENKEVGGKMTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG--WWRFTGLCGNPITSKRKNSWELLE
                  + + K    ++ KIR  L F+  F VP EG+S G+A++W +   + I+++   HID  V++ SG   WR TG  G+P T+ R+  WELLE
Subjt:  SPENKEVGGKMTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSG--WWRFTGLCGNPITSKRKNSWELLE

Query:  KLSEDSNLPWIWGGGILTR---SSLLPRK---REVEKETKLRCNLSERPWTIVISKIWGRNRLQGSIRSAISRKEGEI------IEILADRDVIKYQE--
         L   S LPW        R    ++  R     E+ K++ L    SE P  +    +    RL    ++   + E +I      ++ L + +V    E  
Subjt:  KLSEDSNLPWIWGGGILTR---SSLLPRK---REVEKETKLRCNLSERPWTIVISKIWGRNRLQGSIRSAISRKEGEI------IEILADRDVIKYQE--

Query:  LEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAK
         +   +EL   ++ EE+ W+ RSR  WL  GDRNT++FH+KA+ RRKQ+++  + D +G+ +   +++G+ A  YFEK++  S  HP  +     GI  K
Subjt:  LEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAK

Query:  ISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYK
        ++ A + +L   FT  EV   +K + P+KAPG DG  ALFYQ YW I G  VT   L +LN+       NK  IAL+P  K P RM E+RPISLCNV YK
Subjt:  ISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYK

Query:  ITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
        I +K LANR K VL  IIS  QSAFVP RLITDNVL+ FE IH +N +++G+    ALKLD+ +A
Subjt:  ITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA

A0A2N9I611 Uncharacterized protein6.1e-8136.33Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI-
        +++TK  ++++E IR  L ++ +F VP +G+S GLA+LW E+++L I S+   HID  +K   G WRFTG  G+P T+KRK SW LL+ L E S LPW+ 
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI-

Query:  --------------------------WGGGILTRSSLLPRKREVEKETK-----------------LRCN-----------LSERPWT------IVISKI
                                  + GG L R       +E+ K                    L CN             E+ WT       VI  +
Subjt:  --------------------------WGGGILTRSSLLPRKREVEKETK-----------------LRCN-----------LSERPWT------IVISKI

Query:  W-------------------------GRNRLQ-GSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFH
        W                         G ++   G+I+  +   E  + ++ ++       +++  + E+  LLE+EE+YWK R+R  WL+ GDRNTK+FH
Subjt:  W-------------------------GRNRLQ-GSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFH

Query:  SKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHAL
        SKA+ R+K+N ++GL D  G+W     ++ + A  YFE +F  +S +  D+  + EGI   +++A ++SL   F + EV++ +  ++ SKAPG DG  A 
Subjt:  SKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHAL

Query:  FYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGF
        FYQ YW+ +GPKV    L +LN+   +   N   I LIP KK P  M EFRPISLCNVIYKI AK LANR+K +L  +IS  QSAFVP RLIT+N+LI +
Subjt:  FYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGF

Query:  ECIHSINLRKKGKIGQAALKLDI
        E +H +N R++GK    ALKLD+
Subjt:  ECIHSINLRKKGKIGQAALKLDI

A0A2N9IPS8 Reverse transcriptase domain-containing protein5.7e-7934.41Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTV--KEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPW
        +++T+     VE++R  + F+ +F VP  G   GLA+LW  ++++ + ++   HID  +  KE    +R TG  GNP T KRK SW LL+ LS  S+ PW
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTV--KEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPW

Query:  IWGG-------------------------------------GILTRSSLLPRKRE---------------------------------------------
        +  G                                     G +  S    RKR+                                             
Subjt:  IWGG-------------------------------------GILTRSSLLPRKRE---------------------------------------------

Query:  ----VEKETKL-----------RCN-----------LSERPWTIVISKI---------WGRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKE
            V+++ KL           +C                P  +V+ K+         W R R  GS+ S+I RK  ++  ++ +        + + + +
Subjt:  ----VEKETKL-----------RCN-----------LSERPWTIVISKI---------WGRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKE

Query:  LEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSR
        L  LLE+EEI+W+ RSR  W+  GD+NTK+FH++ + RR+ N I GL D  G W T + ++ + A  YF+ +F +S+P  E I   ++G+ + ++ A + 
Subjt:  LEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSR

Query:  SLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALA
         L A FTK EV   LK + P+KAPG DG  A+FYQ YWDI+GP+VT   L +L+ G  +   N   IALIP  K P  + +FRPISLCNVIYKI +K LA
Subjt:  SLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALA

Query:  NRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
        NR+K+VL  +IS AQSAFVP RLITDNVL+ FE +HS++L++KGK GQ ALKLD+ +A
Subjt:  NRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA

A0A803P941 Uncharacterized protein1.7e-7837Show/hide
Query:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGW-WRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI
        +++T+     +E+IR +L F+  F V  +GKS GLA+LWK+   ++I+SF + HID  V+   G+ WRFTG  G+P    RK++W L+E+L      PW+
Subjt:  MTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTVKEDSGW-WRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWI

Query:  WGGGILTRSSLLPRKREVEKETKLRCN-----LSERPWTIVISKIWGRNRLQGSIRSAISR-----------KEGEIIEILADRDVIK------------
         GG          +K     + K R +      SE     ++  +W      GS++    R            +G+  E+ A    +K            
Subjt:  WGGGILTRSSLLPRKREVEKETKLRCN-----LSERPWTIVISKIWGRNRLQGSIRSAISR-----------KEGEIIEILADRDVIK------------

Query:  --YQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIE
          +Q+  + E++L    ++EE+ WK RSR  WL  GDRNTK+FH KAS R+K+N I GLFD   +W   ++E+ +    Y+ +LF++S P+   I+    
Subjt:  --YQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIE

Query:  GISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLC
         +  ++S   +  L   FTK EV++ +  I+P KAPG+DG   LFY N+W+ +G +V   CLEVLN G +    N  L+ LIP  K PT + EFRP+SLC
Subjt:  GISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLC

Query:  NVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIG---QAALKLDIIQA
        NVIYK+ +K LANRMK  +  +IS +QSAF+  R I DN +IGFE +H +   KKG+ G   + ALKLD+ +A
Subjt:  NVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIG---QAALKLDIIQA

A0A803Q6L7 Uncharacterized protein7.7e-7634.67Show/hide
Query:  TKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTV-KEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWIW
        ++T+   +K E +R  L+FE  F V   GKS GL +LW   I+ +I SF   HID+ + KE+   WRFTG  G+P  ++R  SW+LL ++    + PW+ 
Subjt:  TKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIRSFFMGHIDTTV-KEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWIW

Query:  GGG---ILTRSSLL-------------------PRKREVEKETK----------------LRC------NLSERPW----------------TIVISKIW
        GG    IL R   +                      +EV+ E                  L+C      ++++  W                + ++++ W
Subjt:  GGG---ILTRSSLL-------------------PRKREVEKETK----------------LRC------NLSERPW----------------TIVISKIW

Query:  GRNRLQGSIRSAISRK--------------------------EGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHS
         R R   +    + +K                          E +I  +        +Q L++ E++  +LL++EE +WK +SR  WL+ GDR TK+FH 
Subjt:  GRNRLQGSIRSAISRK--------------------------EGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHS

Query:  KASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALF
        KA++R+K+N I G+ D    W+T  + +G+ A  YF++LFA +S   E+++     I  +IS   +  L APF+  +V + ++ I+P KAPG DG   LF
Subjt:  KASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALF

Query:  YQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFE
        Y+ +W  IG +VT VCL +LN+GK +   N  LI LIP  +KPTRM  FRPISLCNV+YKI AK LA RMK  L+  IS  QSAFV  RLI DN +IGFE
Subjt:  YQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFE

Query:  CIHSINLRKKGKIGQAALKLDIIQA
         +H +  R+ G   + ALKLD+ +A
Subjt:  CIHSINLRKKGKIGQAALKLDIIQA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-1528.12Show/hide
Query:  QELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKA---SSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIE
        QE+ K   EL+  +E ++   K      W  + +R  K     A     +R++N+I+ + ++ G   T   E+      Y++ L+A    + E++   ++
Subjt:  QELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKA---SSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIE

Query:  GIS-AKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKE-FRPIS
          +  ++++ +  SLN P T SE+   +  +   K+PG DG  A FYQ Y + + P +  +   +  EG     F +  I LIP   + T  KE FRPIS
Subjt:  GIS-AKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKE-FRPIS

Query:  LCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK
        L N+  KI  K LANR+++ +  +I   Q  F+P      N+      I  IN  K
Subjt:  LCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK

P08548 LINE-1 reverse transcriptase homolog4.3e-1526.09Show/hide
Query:  QELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGIS
        +E+ K   EL  +  +  I    +S+  +    ++  K   +    +R ++ I  + +   +  T   E+ K  + Y++KL++    + ++I + +E   
Subjt:  QELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTIEGIS

Query:  -AKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIK-KKPTRMKEFRPISLCN
          ++S+ +   LN P + SE+  T++ +   K+PG DG  + FYQ + + + P + ++   +  EG     F +  I LIP   K PTR + +RPISL N
Subjt:  -AKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIK-KKPTRMKEFRPISLCN

Query:  VIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK
        +  KI  K L NR+++ +  II   Q  F+P      N+      I  IN  K
Subjt:  VIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-1626.55Show/hide
Query:  AISRKEGEIIEILADRDVIKYQ-ELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYF
        A+ +KE    +    +++IK + E+ + E    +    +   W F       +   R TK    K    + +N+        G   T  EE+      ++
Subjt:  AISRKEGEIIEILADRDVIKYQ-ELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYF

Query:  EKLFATSSPHPEDIKRTIEGISA-KISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIA
        ++L++T   + +++ + ++     K+++ Q   LN+P +  E+E  +  +   K+PG DG  A FYQ + + + P +  +  ++  EG     F +  I 
Subjt:  EKLFATSSPHPEDIKRTIEGISA-KISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIA

Query:  LIP-IKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK
        LIP  +K PT+++ FRPISL N+  KI  K LANR++  +  II P Q  F+P      N+      IH IN  K
Subjt:  LIP-IKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRK

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-2327.61Show/hide
Query:  LQGSIRSAISRKEGEIIEIL-----ADRDVIKYQELEKAEKELEVLLEEEEIYWKF-RSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITH
        + G   + I    GE++++      ++   ++ + LE+  KE    +E+ +    F RSR   L   DR +++F++    +  + +I  LF   G  +  
Subjt:  LQGSIRSAISRKEGEIIEIL-----ADRDVIKYQELEKAEKELEVLLEEEEIYWKF-RSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITH

Query:  EEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGK
         E +   A  +++ LF+     P+  +   +G+   +SE +   L  P T  E+ + L+ +  +K+PG DG    F+Q +WD +GP    V  E   +G+
Subjt:  EEELGKEASGYFEKLFATSSPHPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGK

Query:  DVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLD
              + +++L+P K     +K +RP+SL +  YKI AKA++ R+K VL ++I P QS  VP R I DNV +  + +H     ++  +  A L LD
Subjt:  DVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAKALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLD

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.7e-1122.78Show/hide
Query:  GRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEE
        G   +Q   + A+   E    ++L +     ++    A K+        E +++ +SR  WL+ GD NT++FH    + + +N I+ L  +    + +  
Subjt:  GRNRLQGSIRSAISRKEGEIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEE

Query:  ELGKEASGYFEKLFATSSP--HPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGK
        ++ +    Y+  L  + S    P+ ++R  +    + ++  +  L+A  +  E+   +  +  +KAPG D   A F+   W ++         E    G 
Subjt:  ELGKEASGYFEKLFATSSP--HPEDIKRTIEGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGK

Query:  DVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKI
         +  FN   I LIP      ++  FRP+S C V+YKI
Subjt:  DVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.9e-0741.67Show/hide
Query:  LANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA
        +  R+K ++ ++I PAQ++F+P R+ TDN++   E +HS+  RKKG  G   LKLD+ +A
Subjt:  LANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATTACAAATACAAGAATCAAGAAAAGAAAACTGTAAAGGAGAAGATTCAGAGAGCTATGGACAGAAATCGAAAGAAGGAAATCAGGAAGGAGGAAGAGAGAA
CCGACAAGATACAAACCAAAAAGGCCCGATTAGCAGAGAGGAAGGGAACACAGACAACAATGGCAAAAACAAGGCCATAATCGAGCGCATAAGGCAGACGGAGAAGACAA
ACATCAGTCATAAGACCCAAGTGAATGGGGCTCTCGAATCCATTCAGAATGAAAGCAAAGACTTAAAAGCAGGGAATAAAGGAACACAATTGGACAAGTTGGAATCCCCA
GAGGGAATTTTTTTAGAGATCAAAGATAACATCAAGAATTTACATCACACAAGGCAGGGGGAGGAAAATGGCATTAAGAAAGAACCCATACCCAATAATATGAAATGGAA
AAGGCTGGCACAAACGGAACCAAATACTATATTCCCTAATCAGAGGTCACCTGAGAATAAGGAAGTGGGAGGGAAAATGACTAAAACTAAGTGTGGTTTGGACAAAGTGG
AAAAGATTAGAGATTTGTTAAATTTCGAGTGTAGTTTCGGGGTGCCTTGTGAAGGAAAGAGCGACGGGCTAGCTATCCTTTGGAAGGAAGAGATTAACCTGCACATTAGA
TCATTTTTCATGGGTCACATCGACACTACAGTAAAAGAGGACAGTGGGTGGTGGAGATTTACTGGTTTGTGTGGAAATCCGATTACTAGCAAGAGGAAAAACTCGTGGGA
GCTCCTTGAGAAGCTGAGCGAGGACTCTAATCTCCCTTGGATTTGGGGGGGAGGGATTTTAACGAGATCCTCCTTGCTTCCGAGAAAAAGGGAGGTGGAGAAAGAAACCA
AGCTCAGATGCAATCTTTCAGAGAGGCCGTGGACAATTGTAATCTCCAAGATCTGGGGTAGGAATCGGTTGCAAGGCTCTATTAGATCGGCCATAAGTAGAAAAGAAGGG
GAGATTATAGAGATCCTAGCCGACAGGGATGTGATCAAATATCAAGAATTGGAAAAGGCTGAAAAAGAGTTGGAAGTGCTCCTGGAAGAGGAAGAGATTTACTGGAAATT
CAGATCTCGGGAGGATTGGTTGAGATGGGGAGACCGCAACACTAAATGGTTCCACTCTAAGGCTAGTTCGAGGAGAAAACAAAACAAAATTGAGGGACTTTTCGACAACA
TGGGGAAATGGATAACTCACGAAGAAGAGCTAGGAAAGGAGGCCTCGGGGTATTTTGAAAAGTTGTTCGCCACTTCTTCCCCTCACCCAGAGGATATCAAACGAACGATA
GAAGGGATATCAGCAAAGATCTCAGAAGCTCAAAGCAGATCTCTCAATGCCCCTTTCACCAAATCAGAAGTTGAAAAAACCTTAAAAGGAATCAACCCTAGCAAAGCGCC
TGGAGAAGATGGAGCTCATGCCTTGTTTTATCAGAATTATTGGGATATTATAGGCCCAAAAGTTACTCATGTGTGCTTAGAAGTGCTTAATGAAGGAAAAGACGTAGGGC
CTTTCAACAAAATGTTAATTGCCTTGATTCCCATAAAGAAGAAACCTACAAGGATGAAGGAATTTAGGCCAATTAGCCTTTGCAATGTTATATACAAAATCACTGCGAAA
GCTTTGGCTAACAGAATGAAAAGGGTCCTCAATGACATTATATCTCCGGCCCAGTCCGCTTTTGTTCCTAGCAGATTAATCACTGACAATGTTTTGATAGGGTTTGAATG
CATCCATTCTATTAACTTGAGGAAAAAAGGAAAAATTGGTCAAGCAGCGCTTAAACTCGATATAATTCAGGCCCGTTTGTCTCGACGCCAGCCTCTTGACTCGACTTCCT
GCTCCTTTTCTCAGCTTGGTCTCGTTTCTTCTCGGTTCGGGGCTTCAATCTTTGGTTTTCGATGTCATAATTCAGGCATGCTTCAGCACAATTGGGTCTCTTCACCTCCT
CTTCGTCCAAGACATCGAGATGGCTCCAAAAACTCCGTAATTGGCCCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATTACAAATACAAGAATCAAGAAAAGAAAACTGTAAAGGAGAAGATTCAGAGAGCTATGGACAGAAATCGAAAGAAGGAAATCAGGAAGGAGGAAGAGAGAA
CCGACAAGATACAAACCAAAAAGGCCCGATTAGCAGAGAGGAAGGGAACACAGACAACAATGGCAAAAACAAGGCCATAATCGAGCGCATAAGGCAGACGGAGAAGACAA
ACATCAGTCATAAGACCCAAGTGAATGGGGCTCTCGAATCCATTCAGAATGAAAGCAAAGACTTAAAAGCAGGGAATAAAGGAACACAATTGGACAAGTTGGAATCCCCA
GAGGGAATTTTTTTAGAGATCAAAGATAACATCAAGAATTTACATCACACAAGGCAGGGGGAGGAAAATGGCATTAAGAAAGAACCCATACCCAATAATATGAAATGGAA
AAGGCTGGCACAAACGGAACCAAATACTATATTCCCTAATCAGAGGTCACCTGAGAATAAGGAAGTGGGAGGGAAAATGACTAAAACTAAGTGTGGTTTGGACAAAGTGG
AAAAGATTAGAGATTTGTTAAATTTCGAGTGTAGTTTCGGGGTGCCTTGTGAAGGAAAGAGCGACGGGCTAGCTATCCTTTGGAAGGAAGAGATTAACCTGCACATTAGA
TCATTTTTCATGGGTCACATCGACACTACAGTAAAAGAGGACAGTGGGTGGTGGAGATTTACTGGTTTGTGTGGAAATCCGATTACTAGCAAGAGGAAAAACTCGTGGGA
GCTCCTTGAGAAGCTGAGCGAGGACTCTAATCTCCCTTGGATTTGGGGGGGAGGGATTTTAACGAGATCCTCCTTGCTTCCGAGAAAAAGGGAGGTGGAGAAAGAAACCA
AGCTCAGATGCAATCTTTCAGAGAGGCCGTGGACAATTGTAATCTCCAAGATCTGGGGTAGGAATCGGTTGCAAGGCTCTATTAGATCGGCCATAAGTAGAAAAGAAGGG
GAGATTATAGAGATCCTAGCCGACAGGGATGTGATCAAATATCAAGAATTGGAAAAGGCTGAAAAAGAGTTGGAAGTGCTCCTGGAAGAGGAAGAGATTTACTGGAAATT
CAGATCTCGGGAGGATTGGTTGAGATGGGGAGACCGCAACACTAAATGGTTCCACTCTAAGGCTAGTTCGAGGAGAAAACAAAACAAAATTGAGGGACTTTTCGACAACA
TGGGGAAATGGATAACTCACGAAGAAGAGCTAGGAAAGGAGGCCTCGGGGTATTTTGAAAAGTTGTTCGCCACTTCTTCCCCTCACCCAGAGGATATCAAACGAACGATA
GAAGGGATATCAGCAAAGATCTCAGAAGCTCAAAGCAGATCTCTCAATGCCCCTTTCACCAAATCAGAAGTTGAAAAAACCTTAAAAGGAATCAACCCTAGCAAAGCGCC
TGGAGAAGATGGAGCTCATGCCTTGTTTTATCAGAATTATTGGGATATTATAGGCCCAAAAGTTACTCATGTGTGCTTAGAAGTGCTTAATGAAGGAAAAGACGTAGGGC
CTTTCAACAAAATGTTAATTGCCTTGATTCCCATAAAGAAGAAACCTACAAGGATGAAGGAATTTAGGCCAATTAGCCTTTGCAATGTTATATACAAAATCACTGCGAAA
GCTTTGGCTAACAGAATGAAAAGGGTCCTCAATGACATTATATCTCCGGCCCAGTCCGCTTTTGTTCCTAGCAGATTAATCACTGACAATGTTTTGATAGGGTTTGAATG
CATCCATTCTATTAACTTGAGGAAAAAAGGAAAAATTGGTCAAGCAGCGCTTAAACTCGATATAATTCAGGCCCGTTTGTCTCGACGCCAGCCTCTTGACTCGACTTCCT
GCTCCTTTTCTCAGCTTGGTCTCGTTTCTTCTCGGTTCGGGGCTTCAATCTTTGGTTTTCGATGTCATAATTCAGGCATGCTTCAGCACAATTGGGTCTCTTCACCTCCT
CTTCGTCCAAGACATCGAGATGGCTCCAAAAACTCCGTAATTGGCCCAAATTAA
Protein sequenceShow/hide protein sequence
MEELQIQESRKENCKGEDSESYGQKSKEGNQEGGRENRQDTNQKGPISREEGNTDNNGKNKAIIERIRQTEKTNISHKTQVNGALESIQNESKDLKAGNKGTQLDKLESP
EGIFLEIKDNIKNLHHTRQGEENGIKKEPIPNNMKWKRLAQTEPNTIFPNQRSPENKEVGGKMTKTKCGLDKVEKIRDLLNFECSFGVPCEGKSDGLAILWKEEINLHIR
SFFMGHIDTTVKEDSGWWRFTGLCGNPITSKRKNSWELLEKLSEDSNLPWIWGGGILTRSSLLPRKREVEKETKLRCNLSERPWTIVISKIWGRNRLQGSIRSAISRKEG
EIIEILADRDVIKYQELEKAEKELEVLLEEEEIYWKFRSREDWLRWGDRNTKWFHSKASSRRKQNKIEGLFDNMGKWITHEEELGKEASGYFEKLFATSSPHPEDIKRTI
EGISAKISEAQSRSLNAPFTKSEVEKTLKGINPSKAPGEDGAHALFYQNYWDIIGPKVTHVCLEVLNEGKDVGPFNKMLIALIPIKKKPTRMKEFRPISLCNVIYKITAK
ALANRMKRVLNDIISPAQSAFVPSRLITDNVLIGFECIHSINLRKKGKIGQAALKLDIIQARLSRRQPLDSTSCSFSQLGLVSSRFGASIFGFRCHNSGMLQHNWVSSPP
LRPRHRDGSKNSVIGPN