; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025869 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025869
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:22595431..22601032
RNA-Seq ExpressionLag0025869
SyntenyLag0025869
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.6e-7928.99Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        GS G +A +   SKA+DRVEW F+ +++ K+GF+  W++LIM C+ T   S  +NG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               +P+       DDSL+FCQA+      ++  L  Y +ASGQ++N  KS + FSPN     +     +LGM +      YLG+P+   R + + F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---
         +I++++ + +  W   +FS+GG EVL+K+V QSIPTY MSCF+LP  LC++I +MMA+ WWG S+D K+   K    +C  K  GG+ FR    FN   
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---

Query:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-----------------------------VDVKDHLVANFITP
                         +V+KG Y SQ   + A      S+ W+  VW RELL  G+                                  + VA++IT 
Subjt:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-----------------------------VDVKDHLVANFITP

Query:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS
        +  W++  L++     DV+ I  IP+S +   D WIWHY   G Y+V SGY LA +L  E  S  S+ Q  WW + WKL+LP KVK+F WK     +P +
Subjt:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS

Query:  FGLWKRGVDVSPWCIICGAKWETIDHASF--------------------------GLAIKHL----DEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKR
          L+ R +  S  C +C + WE+I HA F                          G  + HL    ++  F       W +W+DRNN+ +   V   ++ 
Subjt:  FGLWKRGVDVSPWCIICGAKWETIDHASF--------------------------GLAIKHL----DEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKR

Query:  KERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM
          + + Y  + R  TS+        P+ S+ T   +       +    +  +L  DA +++     G G ++    GQ++ A+
Subjt:  KERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM

XP_030495229.1 uncharacterized protein LOC115711029 [Cannabis sativa]3.0e-7830.53Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG------------------AP------------------------
        GS G +A +   SKA+DRVEW +L +++ K+GF+   +SLIM C+ TT  S L+NG                  +P                        
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG------------------AP------------------------

Query:  ----------------TDDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
                         DDSL+FC A+      ++  L  Y +ASGQ +N  KS + FSPN     +   S +LGM V     +YL +P+     ++  F
Subjt:  ----------------TDDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRKNLSR----ICLPKELGGLNFRDMELFNK--
         D ++R+ + L  W   +FS+GG E+L+K+V QSIPTY MSCFKL K+ CS + +MMA++WWGST  K K   +    IC  K  GGL FR    FN+  
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRKNLSR----ICLPKELGGLNFRDMELFNK--

Query:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-VDVKDHL---------------------------VANFITPS
                          ++K RY      L A    N S+ W+  VW REL+  G+ + V + L                           VA +IT  
Subjt:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-VDVKDHL---------------------------VANFITPS

Query:  MAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSF
          W+L  L+    V D++ I T P+SS   +D W+WH+ +HG+YT +SGY  A +L+ + +S +S+   +WWN  W L++P KVK+F WK     LPT+ 
Subjt:  MAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSF

Query:  GLWKRGVDVSPWCIICGAKWETIDHASF---------GLAIKHLDEE-------------------MFGI--ACITFWSLWNDRNNYKNQMPVMDGIKRK
         L++R +  S  C +C   WE++ HA F         GL+  H D +                    F +     T W +W DRNN  +    ++     
Subjt:  GLWKRGVDVSPWCIICGAKWETIDHASF---------GLAIKHLDEE-------------------MFGI--ACITFWSLWNDRNNYKNQMPVMDGIKRK

Query:  ERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQM
         + + Y       TS Q  N   + S S  T ++AA   P  +    S  +L  +A  ++     GYGA+I   +GQ+
Subjt:  ERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQM

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]2.2e-8129.86Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        G+ G SA +   SKA+DRVEW +++ ++ K+GFH  W+++IM C+ +T  S +LNG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               AP+       DDSL+FC+AS      L+ +L  Y  ASGQ +N  KS + FSPN     +      LGM +     RYLG+P+   R ++E F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK--
         D+++R+ Q L  W   +FS+GG EVL+K+V QSIPTY MSCF+LP T CS + SMMA  WWGST    K    +   +C  K  GG+ FR    FNK  
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK--

Query:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITP
                          ++K RY S  + L A +  + S+ W+   W RELL  G+                      D K           V++FIT 
Subjt:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITP

Query:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS
           W+++ L       DV+ I TIP+S     D  IWH+ + G YTV SG+ LA NL   Q + +S+    WW T W L LP KVK+F W+     LP +
Subjt:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS

Query:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIK
         GL +R V  S  C +C   WE++ HA F                                L+  H  +++  I C T W++W++RN       +  G+ 
Subjt:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIK

Query:  RKER--------ILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRG---AQMSP-----TELYTDATVNTLARGSGYGAVILGDDGQMRGAM
        +            L  +Q  R     Q ++     S+S +   N +      R       SP      ++  DA VN   +  G GA+I    G +  A+
Subjt:  RKER--------ILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRG---AQMSP-----TELYTDATVNTLARGSGYGAVILGDDGQMRGAM

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]2.2e-8132.77Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        GS G +A +   SKA+DRVEW FL +++ K+GF    +SLIM C+ T   S L+NG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               +P+       DDSL+FCQA+      ++  L  Y +ASGQ +N  KS + FSPN     ++    +LGM +      YLG+P+   R + + F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---
         +I++R+ + +  W   +FS+GG EVL+K+V Q+IPTY MSCF+L    C  I +MMAR WWG STD K+   KN   +C  K  GGL FR    FN   
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---

Query:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSG-VVDVKD----------------------------HLVANFITP
                         +V+KGRY  Q   + A V    S+ W+  VW RELL+ G ++ + D                            +LVA++IT 
Subjt:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSG-VVDVKD----------------------------HLVANFITP

Query:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS
        +  WDL  L       D++ I TIP+S     D W WHY S G YTV+SGY LA +L  +  S SS  Q AWW   W L+LP KV++F W+  +  LP +
Subjt:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS

Query:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNY
          L+ R V  S  C +C   WE+I HA F                                L+      E+  + C T W +W+DRNNY
Subjt:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNY

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]4.5e-8230.68Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        GS G +A +   SKA+DRVEW F+  +++K+GF    V LI+ C+++   S LLNG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               AP+       DDS++FC+A+ +    +   L  Y +ASGQ IN  K  L FS N     +I   DLLGM + P   +YLG+PS   + +++ F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGGLNFRDMELFNK--
          I  ++ + L  WK H+FS GG EVL+K+V Q+IPTY MSCF+LP TLC  I SMMAR WWGST T +    KN + +C  K  GGL FR+   FN+  
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGGLNFRDMELFNK--

Query:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------VDVK---------------------DHLVANFIT
                          +++ RY S  + LIA + SN S+ WRS VW +ELL  G+         ++ K                     + LVA+ IT
Subjt:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------VDVK---------------------DHLVANFIT

Query:  PSMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPT
            WD+  L T  +  D+N + +IP+S    +D+ IW+    G Y V+SGY  A +L  +  S  SN    WW+  WKL LP KV++F+WK FH  LP 
Subjt:  PSMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPT

Query:  SFGLWKRGVDVSPWCIICGAKWETIDHASF---------------------------------GLAIKHLDEEMFGIACITFWSLWNDRNNYKN----QM
        +  L++R +  SP+C IC +  ET+ HA F                                   ++   + E+F + C   WS+W++RN   +    + 
Subjt:  SFGLWKRGVDVSPWCIICGAKWETIDHASF---------------------------------GLAIKHLDEEMFGIACITFWSLWNDRNNYKN----QM

Query:  PVMDGIKRKERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM
        P          + E+ Q           +    PS   +   +A       RG      +L TDA ++      G GAV+   DG +  A+
Subjt:  PVMDGIKRKERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM

TrEMBL top hitse value%identityAlignment
A0A803NM27 Uncharacterized protein1.8e-8129.49Show/hide
Query:  SMELAGCVE----GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNGAPT----------------------------
        + EL  C++    G  G SA +   SKA+DRVEW FL +++ K+GF   W++LI+ C++TT+LS ++NGA +                            
Subjt:  SMELAGCVE----GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNGAPT----------------------------

Query:  ------------------------------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLG
                                      DDSL+FC+A       ++ +L  Y +ASGQ++N  KS + FSPN     +    ++LGM +      YLG
Subjt:  ------------------------------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLG

Query:  VPSAFSRRRREDFQDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGG
        +P+   R +++ F  I++R+ + L  W   +FS+GG EVL+K+V QSIPTY MSCFKLP   C +I S+M+  WWGST  K+    K    +C  K  GG
Subjt:  VPSAFSRRRREDFQDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGG

Query:  LNFRDMELFN--------------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-----------------------------V
        L FR+   FN                    +V+KGRY S+   L A      S+ W+ F W RELL  G+                              
Subjt:  LNFRDMELFN--------------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV-----------------------------V

Query:  DVKDHLVANFITPSMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKM
            + VA++ITP   W+++KL       DV  I ++P+S     D W+WH  + G+Y V+SGY +A  L  E     S+   +WW + W+L LP KVK+
Subjt:  DVKDHLVANFITPSMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKM

Query:  FIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKH---------------------LDEEMFGIA-----------CITFWSLWNDR
        F WKA H  LP +  L+KR    S  C +C   WE++ HA F  A KH                     +++ +F I+             T WS+W+DR
Subjt:  FIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKH---------------------LDEEMFGIA-----------CITFWSLWNDR

Query:  NNYKN----QMPVMDGIKRKERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRG
        NN  +    Q P +   K    +  +    +    + +       + +DT   + A   P       +  +L  DA  +   +  G+GA+I    G ++ 
Subjt:  NNYKN----QMPVMDGIKRKERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRG

Query:  AM
        AM
Subjt:  AM

A0A803PIB6 Uncharacterized protein1.1e-8132.77Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        GS G +A +   SKA+DRVEW FL +++ K+GF    +SLIM C+ T   S L+NG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               +P+       DDSL+FCQA+      ++  L  Y +ASGQ +N  KS + FSPN     ++    +LGM +      YLG+P+   R + + F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---
         +I++R+ + +  W   +FS+GG EVL+K+V Q+IPTY MSCF+L    C  I +MMAR WWG STD K+   KN   +C  K  GGL FR    FN   
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWG-STDTKR---KNLSRICLPKELGGLNFRDMELFN---

Query:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSG-VVDVKD----------------------------HLVANFITP
                         +V+KGRY  Q   + A V    S+ W+  VW RELL+ G ++ + D                            +LVA++IT 
Subjt:  -----------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSG-VVDVKD----------------------------HLVANFITP

Query:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS
        +  WDL  L       D++ I TIP+S     D W WHY S G YTV+SGY LA +L  +  S SS  Q AWW   W L+LP KV++F W+  +  LP +
Subjt:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS

Query:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNY
          L+ R V  S  C +C   WE+I HA F                                L+      E+  + C T W +W+DRNNY
Subjt:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNY

A0A803PVM0 Uncharacterized protein1.4e-8431.52Show/hide
Query:  GSSGSSAQCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG------APT------------------------------------
        G S S+ +   SKA+DRVEW +++ +++K+GFH  WV LIM C+ +T+ S +LNG       PT                                    
Subjt:  GSSGSSAQCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG------APT------------------------------------

Query:  ----------------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDFQ
                        DDSL+FC+A+ +    ++ +L  Y QASGQ +N  KS + FSPN     +    + LGM +     RYLG+P+   R ++E F 
Subjt:  ----------------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDFQ

Query:  DIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK---
        D+++R+ Q L  W   +FS+GG EVL+K+V QSIPTY MSCF+LP T CS + SMMA  WWGS     K    +   +C  K  GG+ FR    FNK   
Subjt:  DIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK---

Query:  -----------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITPS
                         ++K RY S  S L A +  + S+ W++  W RELL  G+                      D K        +  VANFIT  
Subjt:  -----------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITPS

Query:  MAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSF
          W++  L       DV+ I TIP+S   + D  IWH+ + G YTV SG+ LA NL  E+ + +S+    WW T W L+LP KVK+F W+     LP + 
Subjt:  MAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSF

Query:  GLWKRGVDVSPWCIICGAKWETIDHASFGL----AIKHLDEEMFGIACITFWSLWNDRNNY------KNQMPVMD-GIKRKERILEYWQETRCGTSSQVD
        GL +R V  S  C +C   WE+I HA F       +  L ++ F     T W++W++RN        KN   ++D  ++  ++ +    + R    + V 
Subjt:  GLWKRGVDVSPWCIICGAKWETIDHASFGL----AIKHLDEEMFGIACITFWSLWNDRNNY------KNQMPVMD-GIKRKERILEYWQETRCGTSSQVD

Query:  NNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM
         NI   +    ++ +     P T     +  +L  DA VN   +  G GAV+    G +  A+
Subjt:  NNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDGQMRGAM

A0A803Q8E0 Uncharacterized protein1.4e-8630.96Show/hide
Query:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------
        G+ G SA +   SKA+DRVEW +++ ++ K+GFH  W+++IM C+ +TR S +LNG                                            
Subjt:  GSSGSSA-QCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG--------------------------------------------

Query:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF
               AP+       DDSL+FC+AS      LR +L  Y  ASGQ +N  KS + FSPN     +      LGM +     RYLG+P+   R ++E F
Subjt:  -------APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDF

Query:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK--
         D+++R+ Q L  W   +FS+GG EVL+K+V QSIPTY MSCF+LP T CS + SMMA  WWGST    K    +   +C  K  GG+ FR    FNK  
Subjt:  QDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGGLNFRDMELFNK--

Query:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITP
                          ++K RY S  + L A +  + S+ W+   W RELL  G+                      D K           V++FIT 
Subjt:  ------------------VIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------------------VDVK--------DHLVANFITP

Query:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS
           W+++ L       DV+ I TIP+S   + D  IWH+ + G YTV SG+ LA NL   Q + +S+    WW T W L LP KVK+F W+     LP +
Subjt:  SMAWDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTS

Query:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIK
         GL +R V  S  C +C   WE+I HA F                                L+  H  +++  I C T W++W++RN       +  G+ 
Subjt:  FGLWKRGVDVSPWCIICGAKWETIDHASFG-------------------------------LAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIK

Query:  RKER--------ILEYWQETRCGTSSQVDN-NIPIPSTSDTTMNNAAGGGPLTR---GAQMSPTE-----LYTDATVNTLARGSGYGAVILGDDGQMRGA
        +  +         L  +   R     Q +N N+   S +   +NN+A   PL R    +  SP +     +  DA VN   +  G GA+I G  G +  A
Subjt:  RKER--------ILEYWQETRCGTSSQVDN-NIPIPSTSDTTMNNAAGGGPLTR---GAQMSPTE-----LYTDATVNTLARGSGYGAVILGDDGQMRGA

Query:  M
        +
Subjt:  M

A0A803QHU6 Uncharacterized protein3.1e-8130.7Show/hide
Query:  SSAQCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG-------------------------------------------------
        S+ +   SKA+DRVEW +L+++++K+GF YAW SLIM C+ T+  S  LNG                                                 
Subjt:  SSAQCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNG-------------------------------------------------

Query:  --APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDFQDIRQ
          +P+       DDSL+FCQ++ +    ++  L  Y +ASGQ +N  KS + FSPN     +   S  L M +     RYLG+PS   R + E F +I++
Subjt:  --APT-------DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDFQDIRQ

Query:  RVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGS----TDTKRKNLSRICLPKELGGLNFRDMELFN--------
        R+ + L  W   +FS+GG EVL+K+V QSIPTY MSCF+L K  C+ + SMMA  WW S    T    K    +C  K  GG+ FR    FN        
Subjt:  RVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGS----TDTKRKNLSRICLPKELGGLNFRDMELFN--------

Query:  ------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------VDVK--------------DHL------VANFITPSMAWD
                    +++K R+ S  + L A +  + S+ W+S  W +ELL  G+         +D K               +L      V++FIT    W+
Subjt:  ------------KVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGV---------VDVK--------------DHL------VANFITPSMAWD

Query:  LNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWK
        +  L T  H  D + I TIP+S     D  IWH+ S+G YTV+SG+ LA +L  +  S +SN QR WW   W L+LP K+++F+WK  H  LPT+  L+K
Subjt:  LNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWK

Query:  RGVDVSPWCIICGAKWETIDHASFGLAIKH--------------------------------LDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKR-KE
        + V  S  C +C + WE+I HA FG   KH                                L ++ F +     W +W DRN       V  G  R   
Subjt:  RGVDVSPWCIICGAKWETIDHASFGLAIKH--------------------------------LDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKR-KE

Query:  RILEY---WQETRCGTSSQV------DNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDG
         I+ Y   + E      + V       N  P  + S T  + A    PL  G +++      DA  N   +  G GA+I   +G
Subjt:  RILEY---WQETRCGTSSQV------DNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNTLARGSGYGAVILGDDG

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog4.9e-0722.7Show/hide
Query:  DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGV--PSAFSRRRREDFQDIRQRVGQTLQGWK
        DD +V+ + + +    L  ++ +Y   SG KIN  KS  +   N ++     + D +   VVP   +YLGV          +E+++ +R+ + + +  WK
Subjt:  DDSLVFCQASIEQVWILRNILAQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGV--PSAFSRRRREDFQDIRQRVGQTLQGWK

Query:  GHMFSMGG--NEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRKNLSRICL--PKELGGLNFRDMELFNKVI
            S  G  N V +  + ++I  +     K P +   D+  ++    W   + K+  +++  L    + GG+   D+ L+ K I
Subjt:  GHMFSMGG--NEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRKNLSRICL--PKELGGLNFRDMELFNKVI

P0C2F6 Putative ribonuclease H protein At1g657503.9e-2022.48Show/hide
Query:  VPSAFSRRRREDFQDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGG
        +P    R  ++ F +I +RV   + GW+    S  G   L K+V  S+P + MS   LP+++ + +  +     WGST  K+K      S++C PK+ GG
Subjt:  VPSAFSRRRREDFQDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKELGG

Query:  LNFRDMELFNK--------------------VIKGRY---ASQESLLIAPVKSNCSVFWRSF-VWARELLTSGV--------------------------
        L  R  +  N+                    V++ +Y     ++S  + P K + S  WRS  +  R++++ GV                          
Subjt:  LNFRDMELFNK--------------------VIKGRY---ASQESLLIAPVKSNCSVFWRSF-VWARELLTSGV--------------------------

Query:  ------VDVKDHLVANFITPSMAWDLNKLRTVAHVDDVNTIATIPISSI-HAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKL
               D    +  +   P   WD  K+      +    +  + +  +  A D   W +   G+++VRS Y+    +L     P  N   +++N LWK+
Subjt:  ------VDVKDHLVANFITPSMAWDLNKLRTVAHVDDVNTIATIPISSI-HAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKL

Query:  HLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDH
         +P++VK F+W   ++ + T     +R +  S  C +C    E++ H
Subjt:  HLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDH

P93295 Uncharacterized mitochondrial protein AtMg003103.0e-1233.33Show/hide
Query:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKE-LGGLNFRDMELFN--------------------KVIKGRYASQESLLI
        ++P Y MSCF+L K LC  + S M   WW S + KRK       ++C  KE  GGL FRD+  FN                    ++++ RY    S++ 
Subjt:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKE-LGGLNFRDMELFN--------------------KVIKGRYASQESLLI

Query:  APVKSNCSVFWRSFVWARELLTSGVV
          V +  S  WRS +  RELL+ G++
Subjt:  APVKSNCSVFWRSFVWARELLTSGVV

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.8e-0929.87Show/hide
Query:  KYTVRSGYKLA--RNLLAEQA---SPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKHLDE
        K  +RSGY +A   +LL E+A    P S E +     +WKLH+  K+K F+W+     L T+  L  R +D  P C  C  + ETI H  F         
Subjt:  KYTVRSGYKLA--RNLLAEQA---SPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKHLDE

Query:  EMFGIACITFW---SLWNDRNNYKNQMPVMDGIKRKERILEYWQETRCGTSSQV
            I     W   S + D  N   Q+         +R L +W   R   S  V
Subjt:  EMFGIACITFW---SLWNDRNNYKNQMPVMDGIKRKERILEYWQETRCGTSSQV

AT3G09510.1 Ribonuclease H-like superfamily protein1.0e-1227.92Show/hide
Query:  WDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKL-----ARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLP
        WD +K+       D   I  I ++     D  IW+Y + G+YTVRSGY L     + N+ A      S + +     +W L +  K+K F+W+A  + L 
Subjt:  WDLNKLRTVAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKL-----ARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLP

Query:  TSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKRKERILEYWQETRCGTSSQVDNNIPI
        T+  L  RG+ + P C  C  + E+I+HA F      +            W L +D +  +NQ+   D  +    IL + Q+T   T S     +P+
Subjt:  TSFGLWKRGVDVSPWCIICGAKWETIDHASFGLAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKRKERILEYWQETRCGTSSQVDNNIPI

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.3e-0636.21Show/hide
Query:  WWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASF
        W   +W L +  K+K+ IWKA +  LP    L  R + + P+C  C   +ETI H  F
Subjt:  WWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKWETIDHASF

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-2422.71Show/hide
Query:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGGLNFRDMELFN--------------------KVIKGRYASQESLLIA
        ++PTY M+CF LPKT+C  I S++A  WW +    +    K    +   K  GG+ F+D+E FN                    KV K RY  +   L A
Subjt:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKR----KNLSRICLPKELGGLNFRDMELFN--------------------KVIKGRYASQESLLIA

Query:  PVKSNCSVFWRSFVWARELLTSG---VVDVKDHLVA---NFITPSMAWDLNKLRTVAHVDDVNTIATIPISSIHAE------------------------
        P+ S  S  W+S   ++E+L  G   VV   + ++     ++    A    +++ V   +  +  + + +S +  E                        
Subjt:  PVKSNCSVFWRSFVWARELLTSG---VVDVKDHLVA---NFITPSMAWDLNKLRTVAHVDDVNTIATIPISSIHAE------------------------

Query:  ---------DMWIWHYCSHGKYTVRSGYKLARNLLAEQASP---SSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAK
                 D + W Y S G YTV+SGY +   ++ +++SP   S       +  +WK     K++ F+WK     LP +  L  R +     CI C + 
Subjt:  ---------DMWIWHYCSHGKYTVRSGYKLARNLLAEQASP---SSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAK

Query:  WETIDHASFGLAIKHLD-----------------------------------EEMFGIACITFWSLWNDRNNY---KNQMPVMDGIKRKERILEYW----
         ET++H  F      L                                    E+   +     W LW +RN       +    + ++R E  LE W    
Subjt:  WETIDHASFGLAIKHLD-----------------------------------EEMFGIACITFWSLWNDRNNY---KNQMPVMDGIKRKERILEYW----

Query:  QETRCGTSSQVDNN
        +   CGT  QV+ +
Subjt:  QETRCGTSSQVDNN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-1333.33Show/hide
Query:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKE-LGGLNFRDMELFN--------------------KVIKGRYASQESLLI
        ++P Y MSCF+L K LC  + S M   WW S + KRK       ++C  KE  GGL FRD+  FN                    ++++ RY    S++ 
Subjt:  SIPTYLMSCFKLPKTLCSDIHSMMARLWWGSTDTKRK----NLSRICLPKE-LGGLNFRDMELFN--------------------KVIKGRYASQESLLI

Query:  APVKSNCSVFWRSFVWARELLTSGVV
          V +  S  WRS +  RELL+ G++
Subjt:  APVKSNCSVFWRSFVWARELLTSGVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAAGAAAGGCGTTCTTGATATTTCAGAGACGACAACGAAGTCGTTGTGGAGAAGAAGACTCAATGGAGCTTGCGGGTTGCGTAGAAGGCTCAAGTGGTTCGTC
GGCGCAGTGTAGGAGGAGCAAGGCCTATGATCGTGTTGAATGGTGCTTCCTCGAGAGCTTATTGATGAAGCTTGGTTTTCATTATGCATGGGTCAGTCTGATTATGGAGT
GTGTCAGAACCACTCGTTTATCTGTTTTACTAAATGGTGCTCCAACAGATGATAGCCTTGTTTTCTGCCAGGCATCAATTGAACAGGTGTGGATTCTGAGGAATATTTTG
GCTCAATATGAACAGGCATCAGGCCAGAAAATAAATATCGGGAAGTCCGCTTTATATTTCTCCCCAAATGTACATCACGATTTTAGAATTGTGCTATCTGATTTATTGGG
TATGCTAGTAGTTCCAAATTTGGGACGTTATCTGGGGGTACCGTCAGCATTCAGCAGAAGAAGGAGGGAGGACTTCCAAGATATTAGGCAAAGAGTCGGGCAAACACTTC
AGGGATGGAAGGGCCATATGTTCTCTATGGGAGGGAACGAAGTTCTGATTAAGAGTGTAGCTCAGTCCATTCCCACATATCTCATGAGTTGTTTTAAGCTTCCAAAAACT
TTGTGTTCGGATATTCACTCTATGATGGCTCGGTTGTGGTGGGGCTCTACTGACACAAAAAGGAAAAATTTGTCTCGGATTTGTTTACCAAAGGAGCTTGGAGGATTAAA
CTTCAGAGATATGGAGCTTTTCAATAAGGTTATCAAAGGGCGATATGCTAGTCAAGAATCTTTATTAATTGCCCCAGTTAAAAGCAATTGTTCTGTTTTCTGGAGGAGTT
TCGTTTGGGCTCGTGAACTGTTGACTAGTGGTGTTGTTGATGTAAAGGATCATTTGGTTGCCAACTTTATAACACCCTCGATGGCTTGGGATTTAAATAAATTACGTACT
GTGGCGCATGTGGATGATGTCAACACCATTGCAACCATTCCAATCAGCTCTATCCATGCAGAGGATATGTGGATTTGGCACTATTGCTCTCATGGGAAATATACAGTTCG
AAGTGGATATAAGCTTGCTCGTAATCTGTTGGCTGAACAGGCATCCCCTAGTTCTAATGAACAACGGGCATGGTGGAACACACTTTGGAAGTTGCACTTGCCACAAAAAG
TCAAGATGTTTATTTGGAAGGCATTTCATGAATGTTTACCAACTTCTTTTGGTTTGTGGAAGCGGGGTGTTGATGTATCACCTTGGTGTATTATTTGTGGGGCAAAGTGG
GAGACTATTGACCACGCATCGTTTGGTTTGGCTATCAAACACTTAGATGAGGAAATGTTTGGAATAGCCTGTATTACATTTTGGTCTTTATGGAATGACAGGAACAATTA
TAAGAACCAAATGCCAGTTATGGATGGGATCAAACGGAAGGAGAGGATATTGGAGTACTGGCAAGAAACACGTTGTGGGACTTCATCTCAAGTTGACAACAACATCCCAA
TTCCAAGCACTAGTGACACAACAATGAACAATGCTGCAGGTGGCGGTCCACTCACCAGGGGAGCTCAAATGTCGCCTACAGAGCTGTATACAGACGCTACGGTAAACACG
TTAGCACGAGGATCCGGTTACGGTGCAGTCATTTTGGGTGATGATGGGCAGATGCGTGGGGCCATGGAATTTTTTGATGATACGCGTCACAATCCTTTAGTTGCGGAAGT
GAATATGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAAGAAAGGCGTTCTTGATATTTCAGAGACGACAACGAAGTCGTTGTGGAGAAGAAGACTCAATGGAGCTTGCGGGTTGCGTAGAAGGCTCAAGTGGTTCGTC
GGCGCAGTGTAGGAGGAGCAAGGCCTATGATCGTGTTGAATGGTGCTTCCTCGAGAGCTTATTGATGAAGCTTGGTTTTCATTATGCATGGGTCAGTCTGATTATGGAGT
GTGTCAGAACCACTCGTTTATCTGTTTTACTAAATGGTGCTCCAACAGATGATAGCCTTGTTTTCTGCCAGGCATCAATTGAACAGGTGTGGATTCTGAGGAATATTTTG
GCTCAATATGAACAGGCATCAGGCCAGAAAATAAATATCGGGAAGTCCGCTTTATATTTCTCCCCAAATGTACATCACGATTTTAGAATTGTGCTATCTGATTTATTGGG
TATGCTAGTAGTTCCAAATTTGGGACGTTATCTGGGGGTACCGTCAGCATTCAGCAGAAGAAGGAGGGAGGACTTCCAAGATATTAGGCAAAGAGTCGGGCAAACACTTC
AGGGATGGAAGGGCCATATGTTCTCTATGGGAGGGAACGAAGTTCTGATTAAGAGTGTAGCTCAGTCCATTCCCACATATCTCATGAGTTGTTTTAAGCTTCCAAAAACT
TTGTGTTCGGATATTCACTCTATGATGGCTCGGTTGTGGTGGGGCTCTACTGACACAAAAAGGAAAAATTTGTCTCGGATTTGTTTACCAAAGGAGCTTGGAGGATTAAA
CTTCAGAGATATGGAGCTTTTCAATAAGGTTATCAAAGGGCGATATGCTAGTCAAGAATCTTTATTAATTGCCCCAGTTAAAAGCAATTGTTCTGTTTTCTGGAGGAGTT
TCGTTTGGGCTCGTGAACTGTTGACTAGTGGTGTTGTTGATGTAAAGGATCATTTGGTTGCCAACTTTATAACACCCTCGATGGCTTGGGATTTAAATAAATTACGTACT
GTGGCGCATGTGGATGATGTCAACACCATTGCAACCATTCCAATCAGCTCTATCCATGCAGAGGATATGTGGATTTGGCACTATTGCTCTCATGGGAAATATACAGTTCG
AAGTGGATATAAGCTTGCTCGTAATCTGTTGGCTGAACAGGCATCCCCTAGTTCTAATGAACAACGGGCATGGTGGAACACACTTTGGAAGTTGCACTTGCCACAAAAAG
TCAAGATGTTTATTTGGAAGGCATTTCATGAATGTTTACCAACTTCTTTTGGTTTGTGGAAGCGGGGTGTTGATGTATCACCTTGGTGTATTATTTGTGGGGCAAAGTGG
GAGACTATTGACCACGCATCGTTTGGTTTGGCTATCAAACACTTAGATGAGGAAATGTTTGGAATAGCCTGTATTACATTTTGGTCTTTATGGAATGACAGGAACAATTA
TAAGAACCAAATGCCAGTTATGGATGGGATCAAACGGAAGGAGAGGATATTGGAGTACTGGCAAGAAACACGTTGTGGGACTTCATCTCAAGTTGACAACAACATCCCAA
TTCCAAGCACTAGTGACACAACAATGAACAATGCTGCAGGTGGCGGTCCACTCACCAGGGGAGCTCAAATGTCGCCTACAGAGCTGTATACAGACGCTACGGTAAACACG
TTAGCACGAGGATCCGGTTACGGTGCAGTCATTTTGGGTGATGATGGGCAGATGCGTGGGGCCATGGAATTTTTTGATGATACGCGTCACAATCCTTTAGTTGCGGAAGT
GAATATGCTTTAA
Protein sequenceShow/hide protein sequence
MKSRKAFLIFQRRQRSRCGEEDSMELAGCVEGSSGSSAQCRRSKAYDRVEWCFLESLLMKLGFHYAWVSLIMECVRTTRLSVLLNGAPTDDSLVFCQASIEQVWILRNIL
AQYEQASGQKINIGKSALYFSPNVHHDFRIVLSDLLGMLVVPNLGRYLGVPSAFSRRRREDFQDIRQRVGQTLQGWKGHMFSMGGNEVLIKSVAQSIPTYLMSCFKLPKT
LCSDIHSMMARLWWGSTDTKRKNLSRICLPKELGGLNFRDMELFNKVIKGRYASQESLLIAPVKSNCSVFWRSFVWARELLTSGVVDVKDHLVANFITPSMAWDLNKLRT
VAHVDDVNTIATIPISSIHAEDMWIWHYCSHGKYTVRSGYKLARNLLAEQASPSSNEQRAWWNTLWKLHLPQKVKMFIWKAFHECLPTSFGLWKRGVDVSPWCIICGAKW
ETIDHASFGLAIKHLDEEMFGIACITFWSLWNDRNNYKNQMPVMDGIKRKERILEYWQETRCGTSSQVDNNIPIPSTSDTTMNNAAGGGPLTRGAQMSPTELYTDATVNT
LARGSGYGAVILGDDGQMRGAMEFFDDTRHNPLVAEVNML