; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g001990 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g001990
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr07:5749208..5760293
RNA-Seq ExpressionLcy07g001990
SyntenyLcy07g001990
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.5e-10235.19Show/hide
Query:  MAAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL--
        + A K+++SK YDRVEW +L++ M  LGF+ KW++LI+ CI+T  FS+L+N    G     RGLRQG PLSPYLF+L AE  S+L+++      I GL  
Subjt:  MAAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL--

Query:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE
                                                                   ++   + SI  +  V  + KYLG+  +  R K      +  
Subjt:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE

Query:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV
        KV   +  W   LFS  GKEILIK+V  A+P+Y MSVF+LPKG+C++I K  ARFWWG+K++K  +HW  W  +   K  G L FR++  FNQAL+AK+ 
Subjt:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV

Query:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG---------------------------------------DW
        WRL+  P SL+AR +K  Y++NS    A +G NPSF+W+S+LWG +++ KG+R+RIG G                                        W
Subjt:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG---------------------------------------DW

Query:  DIGKLSNGVLSSDIN-IIKCIPINKHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLI
         + +L    +  DI  I+K +  +   +D+++WHFD+ G+++VKSGY++ L         SS +   +WK  W L++P K+K F WRAL N +PT  NL 
Subjt:  DIGKLSNGVLSSDIN-IIKCIPINKHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLI

Query:  SRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSF----QDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHE
         R  + +  C  C   VE+  H L +C  AR+IW L    V   +  N  F    Q+ WS+     S+ E EL+ V CW IWS RNK + E
Subjt:  SRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSF----QDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHE

XP_023901347.1 uncharacterized protein LOC112013188 [Quercus suber]3.6e-9935.03Show/hide
Query:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISG-----
        A K+++SK YDRVEW +L+++M KLGFN +W+ L++ C+ T ++S+L+N E KG    +RG+RQGDPLSP+LFLL  E L++LI K    GSI G     
Subjt:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISG-----

Query:  ---------------------------------------------------------LDRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEK
                                                                 LD  + +   LGV ++  + KYLG+ S   + K     Y+ E+
Subjt:  ---------------------------------------------------------LDRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEK

Query:  VWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVW
        VW+ +QGW+ SL S  G+E+LIK+V  AIP+Y M  F+LP G+C+EI K    FWWG + +++K+HWV W+K+C PKS G + F+ +  FN AL+AK+ W
Subjt:  VWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVW

Query:  RLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD----WD----------------IGKLSNGVLSSDIN-----
        RL+ N  SL  +  K  +F N +I+EA  G   S+ W+S+L GRE++ +G ++R+G G+    W                   +L N  +SS IN     
Subjt:  RLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD----WD----------------IGKLSNGVLSSDIN-----

Query:  -----------------IIKCIPINKHL-KDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVS----SSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIP
                         I+K IP+ +   +D L W + + G++T KSGY  FLK++   V+    S ++ M P+WKK+W+L++P K+++F WRA  N IP
Subjt:  -----------------IIKCIPINKHL-KDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVS----SSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIP

Query:  TRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSK-ELELVAVACWAIWSDRNKV
        T  NL  R ++  + C +C  H E   HAL  C    ++W       F  ++    F D W  I   F S    EL A+  W IW  RNKV
Subjt:  TRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSK-ELELVAVACWAIWSDRNKV

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]6.9e-10337.22Show/hide
Query:  MAAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHL--ISKEN--IRG---
        + A K+++SK YDRVEW +L++ MLK+GF    V LI+RC+++ +FS+L+N   KG     RGLRQG PLSPYLF+L AE LS+L  ++++N  IRG   
Subjt:  MAAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHL--ISKEN--IRG---

Query:  ----SISGLDRGD-----------------------YLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVG
            SI+ L   D                        +  I  +N V    KYLG+ S+  RKK    + +  KV   + GW+    S  GKE+LIK+  
Subjt:  ----SISGLDRGD-----------------------YLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVG

Query:  PAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE
         AIP+Y MSVF+LP+G CD+I ++ A+FWWGSK +K+ +HW  W+KL   K  G L FR    FNQAL+AK+ WRL+  P SLV+R L+  YF NS+ L 
Subjt:  PAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE

Query:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG---------------------------------------DWDIGKLSNGVLSSDINIIKCIPI-NKHL
        A  G N S++W+S++WGR+++ KG+R+RIG G                                        WD  KL    L  D   I  IP+  +  
Subjt:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG---------------------------------------DWDIGKLSNGVLSSDINIIKCIPI-NKHL

Query:  KDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDC
        +D+++WH+D+ G ++VKSGY++ L+SK    +S ++     W  +W L +P K+K F WRA NN +P+  NL  R ++ + TC  C   VE+  HAL +C
Subjt:  KDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDC

Query:  VRAREIW-KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHE--EDTPQVTIEKVE
          AR+IW +  F    L+ +  D F      ++      +LEL+   CW+ W  RNK + +  E  P ++  K E
Subjt:  VRAREIW-KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHE--EDTPQVTIEKVE

XP_030483666.1 uncharacterized protein LOC115700238 [Cannabis sativa]2.5e-10035.04Show/hide
Query:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKE----NIRG----
        +  K+++SK +DRVEW+Y++E+M K+GF+ KW++LI+ C+ST +FS +LN E+ G    +RGLRQGDPLSPYLFL+ +E LS L+  E    N+RG    
Subjt:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKE----NIRG----

Query:  ----SISGLDRGD--------------------------------------------------YLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE
            SIS L   D                                                  +   ILG+   E   +YLG+ +   R K +  S + E
Subjt:  ----SISGLDRGD--------------------------------------------------YLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE

Query:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV
        ++W+ +  W   LFS+ GKE+L+K+V  +IP+Y MS FRLP   C+++    A FWWGS ++  K+HW SWK LC  K  G + FR+   FNQAL+AK+ 
Subjt:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV

Query:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD
        WR+   P  L++R LK  YF N+  LEA LG +PS  W+ + WGR+LL +G+RF+IG G                                      +W+
Subjt:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD

Query:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS
        +  L+      D++ I  IP++     D+ IWH    G +TV SG+ +      +  SS S T    WK  W L++P+K+K F WR + N +P  T L  
Subjt:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS

Query:  RGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH
        R +I+ ATC +C N  ES  HA+ +C +A+++W+ T  K+  + + N    D    +S   S  + EL+    WAIW +RNKV H
Subjt:  RGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]2.4e-10334.35Show/hide
Query:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---
        +A K+++SK +DRVEW+Y++E+M K+GF++KW+ +I+ C+S+ +FS +LN E+ G    +RGLRQGDPLSPYLFL+ +E LS L+  E     + GL   
Subjt:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---

Query:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE
                                                                      D+    LG+   +   +YLG+ +   R K +  S + E
Subjt:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE

Query:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV
        ++W+ +  W   LFS+ GKE+L+K+V  +IP+Y MS FRLP   C ++    A FWWGS ++  K+HW SWK LC  K  G + FR+   FN+AL+AK+ 
Subjt:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV

Query:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD
        WR++  P SL++R LK  YF N+N LEA LG +PS  W+ + WGRELL++G+R++IG G                                      +W+
Subjt:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD

Query:  IGKLSNGVLSSDINIIKCIPIN-KHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS
        I  L+    S D++ I  IP++     D+LIWH    G +TV SG+ +      +  + +S +    WK  WNL +P+K+K F WR + N +P  T L+ 
Subjt:  IGKLSNGVLSSDINIIKCIPIN-KHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS

Query:  RGIISQATCPICYNHVESTDHALCDCVRAREIW---KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH
        R +I  ATC +C N  ES  HAL +C  AR++W   K T D       +N  +    S +    S ++LEL+    WAIW +RNKV+H
Subjt:  RGIISQATCPICYNHVESTDHALCDCVRAREIW---KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH

TrEMBL top hitse value%identityAlignment
A0A2N9FYH3 CCHC-type domain-containing protein2.0e-10337.66Show/hide
Query:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLD--R
        A K+++SK YDRVEW YLK++MLKLGF  +WV LI+ C+++ ++SIL+N E KG    SRGLRQGDPLSPYLFL+ AE L+ L+ K      + G+   R
Subjt:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLD--R

Query:  GDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKE
        G              F KYLG+  V  R K +  S + +++W+ +QGWK    S  GK +LIK+V  AIP+Y MS F+ P G+C+EI+    RFWWG KE
Subjt:  GDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKE

Query:  NKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD-
          +K+HW+S KKLC  K  G + FR+++ FNQAL+A++ WRL+ NP+SLV RFLK  YF +++ +EA +  N S+LW+S+   + +L  G+R+R+G G+ 
Subjt:  NKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD-

Query:  ----------------------------------------WDIGKLSNGVLSSDINIIKCIPIN-KHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGV-
                                                W++  L    L  D+ +I  IP++ +  +D LIW   + G FTVKS Y + L     G  
Subjt:  ----------------------------------------WDIGKLSNGVLSSDINIIKCIPIN-KHLKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGV-

Query:  -SSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSK
         SSSS+ +   WK +W+  +  K+K F WRA  N +PT+T L  +G+ + ++C  C    E+ DH L  C  A+++WK +  K+    + N SF D  + 
Subjt:  -SSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSK

Query:  ISLSFSSKELELVAVACWAIWSDRNKVLHEE
                 +E+     W++W  RN+++ E+
Subjt:  ISLSFSSKELELVAVACWAIWSDRNKVLHEE

A0A803NHG3 Uncharacterized protein1.3e-10234.19Show/hide
Query:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRGD
        A K++++K YDRVEW +L+E+ML+LG++ +W+  I+ C+++  FS L+N E +G     RG+RQGDPLSP+LFL  AE  S L+ +E     + G+  G 
Subjt:  AFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRGD

Query:  Y-------------------------------------LSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSV
                                              L+  LGV  V++ GKYLG+ S+  R K +    +  +VW  ++GWKG +FS+  KE+LIK++
Subjt:  Y-------------------------------------LSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSV

Query:  GPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNIL
          AIP+Y MS +RL K     I +  ARFWWGS   KKK+HW  W+ LC PK  G L FR++E FNQAL+AK++WR +  P SL ++ LK  YF + ++L
Subjt:  GPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNIL

Query:  EADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD---------------------------------------WDIGKLSNGVLSSDIN-IIKCIPINKH
         A  G + SF+W+SL+WG+E+++KG R+R+G G                                        WD   +       D   I++  P+++ 
Subjt:  EADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD---------------------------------------WDIGKLSNGVLSSDIN-IIKCIPINKH

Query:  LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHV-ESTDHALC
        L+DK++WH+ R G++TV+SGY++  + +    +   + M   W+K+W L +P K+KHF W+  N+ +PT +NL++R +++  TC  C N V E+  HAL 
Subjt:  LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHV-ESTDHALC

Query:  DCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHEEDTPQVTIEKVEEEWMKYLL
         C   + IWKL+  K  +     +       +++   +    E   V CW +W  RN   H    PQ    +V E   +YLL
Subjt:  DCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHEEDTPQVTIEKVEEEWMKYLL

A0A803P2K3 Uncharacterized protein5.7e-10336.25Show/hide
Query:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRG
        +A K+++SK +DRVEW Y++E+M  +GF+ +W+ +I+ C+S+ +FS +LN E+ G    +RGLRQGDPLSPYLFL+ +E LS L+  E    ++ GL   
Subjt:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRG

Query:  DYLSSILGVNKVED-----------------------FGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFR
         +  S+  +   +D                             V  +F R K +  S + E++W+ +  W   LFS+ GKE+L+K+V  +IP+Y MS FR
Subjt:  DYLSSILGVNKVED-----------------------FGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFR

Query:  LPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWK
        LP   C+++    A FWWGS  +  K+HW SWK LC  K  G + FR+   FN+AL+AK+ WR+   P SL++R LK  YF N+N LEA LG +PS  W+
Subjt:  LPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWK

Query:  SLLWGRELLMKGIRFRIGQG--------------------------------------DWDIGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGK
         + WGRELL++G+R++IG G                                      +W+I  L+    + D++ I  IP++    +D+LIWH    G 
Subjt:  SLLWGRELLMKGIRFRIGQG--------------------------------------DWDIGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGK

Query:  FTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIW---KLT
        +TV SG+ +      +  S +S +    WK  W+LN+P+K+K F WR + N +PT T L  R +I  A C +C N  ES  HAL +C +A+++W   K T
Subjt:  FTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIW---KLT

Query:  FDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH
         D       FN    D    +S   S ++ EL+    WAIW DRN+VLH
Subjt:  FDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH

A0A803PW06 Uncharacterized protein1.1e-10134.75Show/hide
Query:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---
        +A K+++SK +DRVEW+Y++E+M K+ F++KW+ +I+ C+S+ +FS +LN E+ G    +RGLRQGDPLSPYLFL+ +E LS L+  E     + GL   
Subjt:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---

Query:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE
                                                                      D+    LG+   +   +YLG+ +   R K +  S + E
Subjt:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE

Query:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV
        +VW+ +  W   LFS+ GKE+L+K+V  +IP+Y MS FRLP   C ++    A FWWGS ++  K+HW SWK LC  K  G + FR+   FN+AL+AK+ 
Subjt:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV

Query:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD
        WR+   P SL++R LK  YF N+N LEA LG +PS  W+ + WGRELL++G+R++IG G                                      +W+
Subjt:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD

Query:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVF--LKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNL
        I  L+    S D++ I  IP++     D+LIWH    G +TV   +  F  L+ K    SSS  T    WK  WNL +P+K+K F WR + N +P  T L
Subjt:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVF--LKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNL

Query:  ISRGIISQATCPICYNHVESTDHALCDCVRAREIW---KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH
        + R +I  ATC +C N  ES  H+L +C  AR++W   K T D       +N  +    S +    S ++LEL+    WAIW +RNKV+H
Subjt:  ISRGIISQATCPICYNHVESTDHALCDCVRAREIW---KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH

A0A803Q8E0 Uncharacterized protein7.5e-10334.58Show/hide
Query:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---
        +A K+++SK +DRVEW Y++E+M K+GF++KW+ +I+ C+S+  FS +LN E+ G    +RGLRQGDPLSPYLFL+ +E LS L+  E     + GL   
Subjt:  AAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL---

Query:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE
                                                                      D+    LG+   +   +YLG+ +   R K +  S + E
Subjt:  -----------------------------------------------------------DRGDYLSSILGVNKVEDFGKYLGVSSVFSRKKFKDLSYLLE

Query:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV
        ++W+ +  W   LFS+ GKE+L+K+V  +IP+Y MS FRLP   C ++    A FWWGS ++  K+HW SWK LC  K  G + FR+   FN+AL+AK+ 
Subjt:  KVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKV

Query:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD
        WR+   P SL++R LK  YF N+N LEA LG +PS +W+ + WGRELL++G+R++IG G                                      +W+
Subjt:  WRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG--------------------------------------DWD

Query:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS
        I  L+    S D++ I  IP++     D+LIWH    G +TV SG+ +      +  + +S +    WK  WNL++P+K+K F WR + N +P  T L+ 
Subjt:  IGKLSNGVLSSDINIIKCIPINKH-LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLIS

Query:  RGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLS--FSSKELELVAVACWAIWSDRNKVLH
        R +I  ATC +C N  ES  HAL +C  AR++W+ T  K  +D +   +  +    I LS   S ++LEL+    WAIW +RNKV+H
Subjt:  RGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKISLS--FSSKELELVAVACWAIWSDRNKVLH

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog4.1e-0533.33Show/hide
Query:  INLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRG
        I+  K +D ++  ++   + K+G    ++ LI    S    +I+LN  K  SFP   G RQG PLSP LF +  E L+  I +E    +I G+  G
Subjt:  INLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRG

P0C2F6 Putative ribonuclease H protein At1g657502.4e-2926.13Show/hide
Query:  LLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIA
        +LE+V   + GW+    S  G+  L K+V  ++P + MS   LP+ I + + +    F WGS   KKK H V W K+C PK  G L  R  +  N+ALI+
Subjt:  LLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIA

Query:  KKVWRLISNPESLVARFLKGIY----FENSNILEADLGRNPSFLWKSLLWG-RELLMKGIRFRIGQGD--------WDIGK----LSNGVLSSDINII--
        K  WRL+    SL    L+  Y      +S  L      + S  W+S+  G R+++  G+ +  G G         W  GK    L NG   +D + +  
Subjt:  KKVWRLISNPESLVARFLKGIY----FENSNILEADLGRNPSFLWKSLLWG-RELLMKGIRFRIGQGD--------WDIGK----LSNGVLSSDINII--

Query:  -------------KCIPINKH----------------LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRAL
                     K  P   +                 +D+L W F + G+F+V+S Y++    ++         M   +  +W + +P ++K F W   
Subjt:  -------------KCIPINKH----------------LKDKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRAL

Query:  NNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIW--------KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDR
        N  + T      R + +   C +C   VES  H L DC     IW        +  F    L E   D+  DR     + +S+    + AV  W  W  R
Subjt:  NNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIW--------KLTFDKVFLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDR

Query:  NKVLHEEDTPQVTIEKVEEEW
           +  E+T      K  +EW
Subjt:  NKVLHEEDTPQVTIEKVEEEW

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-0720Show/hide
Query:  INLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHL-----------ISKENIRGS
        ++  K +D+++  ++ +++ + G    ++N+I    S    +I +N EK  + P   G RQG PLSPYLF +  E L+             I KE ++ S
Subjt:  INLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHL-----------ISKENIRGS

Query:  ISGLDRGDYLSSILG-----VNKVEDFG----------------------------------------KYLGVSSVFSRKKFKDLSY--LLEKVWKSVQG
        +   D   Y+S         +N +  FG                                        KYLGV+     K   D ++  L +++ + ++ 
Subjt:  ISGLDRGDYLSSILG-----VNKVEDFG----------------------------------------KYLGVSSVFSRKKFKDLSY--LLEKVWKSVQG

Query:  WKGSLFSIVGKEILIKS--VGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVW
        WK    S +G+  ++K   +  AI  +     ++P    +E+  +  +F W +K+ +     ++   L   ++ G +   +++ + +A++ K  W
Subjt:  WKGSLFSIVGKEILIKS--VGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVW

P92555 Uncharacterized mitochondrial protein AtMg012503.1e-0548Show/hide
Query:  LLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL
        ++N   +G    SRGLRQGDPLSPYLF+L  E LS L  +   +G + G+
Subjt:  LLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL

P93295 Uncharacterized mitochondrial protein AtMg003103.3e-3146.21Show/hide
Query:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPK-SLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE
        A+P Y MS FRL K +C ++T +   FWW S ENK+K+ WV+W+KLC  K   G L FR++  FNQAL+AK+ +R+I  P +L++R L+  YF +S+++E
Subjt:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPK-SLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE

Query:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG
          +G  PS+ W+S++ GRELL +G+   IG G
Subjt:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.1e-0825.41Show/hide
Query:  MWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKI-----SLSFSSKE
        +W L++  KIKHF WR +   + T T L SR I +   C  C    E+  H + +C   + +W+     +        SF+D  +++     + + +S +
Subjt:  MWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKI-----SLSFSSKE

Query:  LELVAVACWAIWSDRNKVLHEE
          L     W +W  RN  L ++
Subjt:  LELVAVACWAIWSDRNKVLHEE

AT3G09510.1 Ribonuclease H-like superfamily protein2.7e-2828.85Show/hide
Query:  LKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD-----------------------------------------WDIGKLSNGVLSS
        +K  YF++ +IL+A + +  S+ W SLL G  LL KG R  IG G                                          WD  K+S  V  S
Subjt:  LKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD-----------------------------------------WDIGKLSNGVLSS

Query:  DINIIKCIPINKHLK-DKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKK--MWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATC
        D   I  I + K  K DK+IW+++  G++TV+SGY +        + + +   G I  K  +WNL I  K+KHF WRAL+  + T   L +RG+    +C
Subjt:  DINIIKCIPINKHLK-DKLIWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKK--MWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATC

Query:  PICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKI-----SLSFSSKELELVAVACWAIWSDRNKVLHEE--DTPQVTIEKVEEEWM
        P C+   ES +HAL  C  A   W+L+   +  ++  ++ F++  S I       + S     L     W IW  RN V+  +  ++P  T+   + E  
Subjt:  PICYNHVESTDHALCDCVRAREIWKLTFDKVFLDESFNDSFQDRWSKI-----SLSFSSKELELVAVACWAIWSDRNKVLHEE--DTPQVTIEKVEEEWM

Query:  KYLLA
         +L A
Subjt:  KYLLA

AT4G29090.1 Ribonuclease H-like superfamily protein3.0e-4828.71Show/hide
Query:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEA
        A+P+Y M+ F LPK +C +I    A FWW +K+  K +HW +W  L   K+ G + F++IE FN AL+ K++WR++S PESL+A+  K  YF  S+ L A
Subjt:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEA

Query:  DLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD--------W------------------DIGKLSNGVLSSDI----------NIIKCI-----------
         LG  PSF+WKS+   +E+L +G R  +G G+        W                  +   +S+ +  SD+          ++I+ +           
Subjt:  DLGRNPSFLWKSLLWGRELLMKGIRFRIGQGD--------W------------------DIGKLSNGVLSSDI----------NIIKCI-----------

Query:  --PINKHLKDKLIWHFDRIGKFTVKSGYKV---FLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNH
          P  + + D   W +   G +TVKSGY V    +  +      S  ++ PI++K+W      KI+HF W+ L+N++P    L  R +  ++ C  C + 
Subjt:  --PINKHLKDKLIWHFDRIGKFTVKSGYKV---FLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNH

Query:  VESTDHALCDCVRAREIWKLTFDKVFLDESFNDSF-------------QDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH--EEDTPQVTIEKVE--
         E+ +H L  C  AR  W ++   + L   + DS                +W K S        +LV    W +W +RN+++    E   Q  + + E  
Subjt:  VESTDHALCDCVRAREIWKLTFDKVFLDESFNDSF-------------QDRWSKISLSFSSKELELVAVACWAIWSDRNKVLH--EEDTPQVTIEKVE--

Query:  -EEW
         EEW
Subjt:  -EEW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-3246.21Show/hide
Query:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPK-SLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE
        A+P Y MS FRL K +C ++T +   FWW S ENK+K+ WV+W+KLC  K   G L FR++  FNQAL+AK+ +R+I  P +L++R L+  YF +S+++E
Subjt:  AIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPK-SLGVLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILE

Query:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG
          +G  PS+ W+S++ GRELL +G+   IG G
Subjt:  ADLGRNPSFLWKSLLWGRELLMKGIRFRIGQG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.2e-0648Show/hide
Query:  LLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL
        ++N   +G    SRGLRQGDPLSPYLF+L  E LS L  +   +G + G+
Subjt:  LLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTTTTAAAATTAACCTTAGTAAGGTTTATGACCGAGTAGAATGGGTGTATCTAAAGGAAATCATGCTTAAACTAGGCTTTAACATTAAATGGGTTAACCTTAT
TTTGAGATGTATATCTACTGCCAATTTTTCCATTCTTTTAAATAGGGAGAAAAAAGGTAGCTTCCCTTCTTCTAGAGGTCTTAGGCAGGGTGATCCACTGTCACCTTATC
TTTTTCTTCTAGCAGCAGAGGAGCTATCCCATCTTATCTCAAAAGAAAATATTAGAGGAAGTATTTCGGGCCTAGATAGAGGAGATTATCTTAGCAGCATTCTAGGTGTT
AACAAAGTTGAAGATTTTGGAAAATATTTGGGAGTCTCTTCAGTTTTTTCAAGGAAAAAGTTTAAGGATCTTAGTTACCTCTTGGAAAAAGTCTGGAAATCGGTTCAAGG
GTGGAAAGGCTCTTTGTTTTCTATTGTGGGGAAAGAAATCCTTATCAAAAGTGTAGGGCCAGCAATACCTTCCTATGTGATGAGTGTTTTTAGACTCCCTAAGGGTATTT
GTGATGAGATCACTAAAAGTTTTGCAAGATTTTGGTGGGGTTCAAAAGAAAATAAGAAGAAACTTCATTGGGTGAGTTGGAAAAAACTCTGCCTTCCAAAAAGTCTTGGT
GTATTAAACTTTAGGAACATAGAAGGTTTTAACCAAGCCCTCATTGCAAAGAAAGTTTGGAGATTAATCTCAAACCCTGAGTCTTTAGTTGCTAGGTTCCTAAAGGGCAT
CTATTTCGAAAATTCTAATATATTAGAAGCTGATTTGGGTCGCAATCCATCCTTTCTTTGGAAGAGCCTCCTCTGGGGAAGAGAATTGCTGATGAAGGGGATTAGGTTCA
GAATTGGGCAAGGAGACTGGGATATAGGAAAGCTTAGTAATGGGGTCCTCAGCTCAGATATAAACATAATTAAGTGTATCCCTATCAACAAGCACCTAAAAGATAAGCTC
ATATGGCATTTTGATAGAATCGGTAAATTCACGGTTAAGAGTGGCTATAAGGTTTTCCTAAAATCGAAAATAGATGGGGTTTCTTCTAGTTCCAAAACTATGGGGCCAAT
CTGGAAGAAAATGTGGAATCTGAACATCCCAACTAAAATAAAACATTTTTGTTGGAGAGCTTTGAACAATACGATCCCAACTAGGACTAATCTTATATCTAGAGGCATAA
TTTCTCAAGCTACCTGTCCTATTTGCTACAATCATGTTGAATCTACTGATCATGCATTATGTGATTGTGTACGTGCGAGGGAAATTTGGAAGCTAACCTTTGACAAAGTC
TTCCTAGATGAGAGCTTCAACGACAGCTTCCAGGATAGGTGGTCCAAGATAAGTTTGAGCTTTTCTTCAAAGGAACTAGAGCTTGTAGCGGTGGCGTGTTGGGCTATCTG
GTCCGATAGGAACAAGGTCCTGCATGAAGAAGATACCCCCCAAGTCACAATAGAGAAGGTAGAGGAAGAGTGGATGAAATATTTGTTGGCATGTGCACTTGTAATGACAT
TTGAAAATTCCTCACTTGCTTCAACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTTTTAAAATTAACCTTAGTAAGGTTTATGACCGAGTAGAATGGGTGTATCTAAAGGAAATCATGCTTAAACTAGGCTTTAACATTAAATGGGTTAACCTTAT
TTTGAGATGTATATCTACTGCCAATTTTTCCATTCTTTTAAATAGGGAGAAAAAAGGTAGCTTCCCTTCTTCTAGAGGTCTTAGGCAGGGTGATCCACTGTCACCTTATC
TTTTTCTTCTAGCAGCAGAGGAGCTATCCCATCTTATCTCAAAAGAAAATATTAGAGGAAGTATTTCGGGCCTAGATAGAGGAGATTATCTTAGCAGCATTCTAGGTGTT
AACAAAGTTGAAGATTTTGGAAAATATTTGGGAGTCTCTTCAGTTTTTTCAAGGAAAAAGTTTAAGGATCTTAGTTACCTCTTGGAAAAAGTCTGGAAATCGGTTCAAGG
GTGGAAAGGCTCTTTGTTTTCTATTGTGGGGAAAGAAATCCTTATCAAAAGTGTAGGGCCAGCAATACCTTCCTATGTGATGAGTGTTTTTAGACTCCCTAAGGGTATTT
GTGATGAGATCACTAAAAGTTTTGCAAGATTTTGGTGGGGTTCAAAAGAAAATAAGAAGAAACTTCATTGGGTGAGTTGGAAAAAACTCTGCCTTCCAAAAAGTCTTGGT
GTATTAAACTTTAGGAACATAGAAGGTTTTAACCAAGCCCTCATTGCAAAGAAAGTTTGGAGATTAATCTCAAACCCTGAGTCTTTAGTTGCTAGGTTCCTAAAGGGCAT
CTATTTCGAAAATTCTAATATATTAGAAGCTGATTTGGGTCGCAATCCATCCTTTCTTTGGAAGAGCCTCCTCTGGGGAAGAGAATTGCTGATGAAGGGGATTAGGTTCA
GAATTGGGCAAGGAGACTGGGATATAGGAAAGCTTAGTAATGGGGTCCTCAGCTCAGATATAAACATAATTAAGTGTATCCCTATCAACAAGCACCTAAAAGATAAGCTC
ATATGGCATTTTGATAGAATCGGTAAATTCACGGTTAAGAGTGGCTATAAGGTTTTCCTAAAATCGAAAATAGATGGGGTTTCTTCTAGTTCCAAAACTATGGGGCCAAT
CTGGAAGAAAATGTGGAATCTGAACATCCCAACTAAAATAAAACATTTTTGTTGGAGAGCTTTGAACAATACGATCCCAACTAGGACTAATCTTATATCTAGAGGCATAA
TTTCTCAAGCTACCTGTCCTATTTGCTACAATCATGTTGAATCTACTGATCATGCATTATGTGATTGTGTACGTGCGAGGGAAATTTGGAAGCTAACCTTTGACAAAGTC
TTCCTAGATGAGAGCTTCAACGACAGCTTCCAGGATAGGTGGTCCAAGATAAGTTTGAGCTTTTCTTCAAAGGAACTAGAGCTTGTAGCGGTGGCGTGTTGGGCTATCTG
GTCCGATAGGAACAAGGTCCTGCATGAAGAAGATACCCCCCAAGTCACAATAGAGAAGGTAGAGGAAGAGTGGATGAAATATTTGTTGGCATGTGCACTTGTAATGACAT
TTGAAAATTCCTCACTTGCTTCAACCTAA
Protein sequenceShow/hide protein sequence
MAAFKINLSKVYDRVEWVYLKEIMLKLGFNIKWVNLILRCISTANFSILLNREKKGSFPSSRGLRQGDPLSPYLFLLAAEELSHLISKENIRGSISGLDRGDYLSSILGV
NKVEDFGKYLGVSSVFSRKKFKDLSYLLEKVWKSVQGWKGSLFSIVGKEILIKSVGPAIPSYVMSVFRLPKGICDEITKSFARFWWGSKENKKKLHWVSWKKLCLPKSLG
VLNFRNIEGFNQALIAKKVWRLISNPESLVARFLKGIYFENSNILEADLGRNPSFLWKSLLWGRELLMKGIRFRIGQGDWDIGKLSNGVLSSDINIIKCIPINKHLKDKL
IWHFDRIGKFTVKSGYKVFLKSKIDGVSSSSKTMGPIWKKMWNLNIPTKIKHFCWRALNNTIPTRTNLISRGIISQATCPICYNHVESTDHALCDCVRAREIWKLTFDKV
FLDESFNDSFQDRWSKISLSFSSKELELVAVACWAIWSDRNKVLHEEDTPQVTIEKVEEEWMKYLLACALVMTFENSSLAST