; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034678 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034678
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr3:9720323..9722887
RNA-Seq ExpressionLag0034678
SyntenyLag0034678
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]8.0e-4323.5Show/hide
Query:  LWIQKIANKRGDFLEVTKVQSSGGKRNLVIPADLEFNGWKSFR-----------------------------------SPEGEPSGGAHSRCLKSAKSSN
        LW+Q I N+RG   E+ +V   G K  +++P  L+  GW  F                                    S + E     ++  + S+ SS+
Subjt:  LWIQKIANKRGDFLEVTKVQSSGGKRNLVIPADLEFNGWKSFR-----------------------------------SPEGEPSGGAHSRCLKSAKSSN

Query:  FQDLWTQEAGEKDVRGINWNETLVITRRDFHEDWGRILDTIQNKLNRH---------------------------------TLLTPFFL--TKPSLNAPR
                + E D        T  + RR FH+DW +I+D ++++ ++                                  T + PF++   K S NA  
Subjt:  FQDLWTQEAGEKDVRGINWNETLVITRRDFHEDWGRILDTIQNKLNRH---------------------------------TLLTPFFL--TKPSLNAPR

Query:  EIWLVCLQQTGDGFRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLIDRD-QTFVAHVVSFENHNLLIGKEVGKH
        +  ++        FR IPLH+W+L TF  I + +GGFID A  + N +E  E  IKV+ NY GF+P+ +++ D +   F+   V+      L  +    H
Subjt:  EIWLVCLQQTGDGFRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLIDRD-QTFVAHVVSFENHNLLIGKEVGKH

Query:  GGFSPEAARIFLVRKWATAQTLWTF------------------------GVFKMGFFVQWLTLCTPVPLWVTTIQKD-PLPNENPTLKPGLHSKK--KKK
        G F+  AA  F   ++      +TF                           KMG           V       + D    N +  ++   H  +  +KK
Subjt:  GGFSPEAARIFLVRKWATAQTLWTF------------------------GVFKMGFFVQWLTLCTPVPLWVTTIQKD-PLPNENPTLKPGLHSKK--KKK

Query:  KGKAVAFEKGSN--SQPSMKRGQIFSTSLAERHV-------ENELEIGSEASFLSISSVEDCPTEDAEEPIVDKEDPPSEFYMCFREAESQDLMEGEERA
        KGK +   + +   S  + KR   FS+   E  +          L++GS  S+  + +    PT+  ++ +         + +  R  E +      ++ 
Subjt:  KGKAVAFEKGSN--SQPSMKRGQIFSTSLAERHV-------ENELEIGSEASFLSISSVEDCPTEDAEEPIVDKEDPPSEFYMCFREAESQDLMEGEERA

Query:  MILVQPQ-----VELGDTPSLAISDQQ--EGTGSEGEGDFPFFRQMVKNLKKWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGL---
           + P      V+LG    L+ +D    E          P    +VK+          +++  + A + + ++    IR   +E  D+ +S +  L   
Subjt:  MILVQPQ-----VELGDTPSLAISDQQ--EGTGSEGEGDFPFFRQMVKNLKKWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGL---

Query:  -----------MKTRLNSM-NRRLVKSIWNAKRIGW--VAIDAVGSAGGVLV--------------------------LWKEDVILVVDSILGLCSDLWC
                     ++ NS+ N R++  +     +G   V+ D +  A  V +                           W+E     ++++  +C   W 
Subjt:  -----------MKTRLNSM-NRRLVKSIWNAKRIGW--VAIDAVGSAGGVLV--------------------------LWKEDVILVVDSILGLCSDLWC

Query:  LAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAW
        L GDFNV+RW  E S     + +MK FN+FI    LID P+ N +FTWS+LR + T +R+DRFL S  W   F   + + L+R TSDHFP++L   S +W
Subjt:  LAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAW

Query:  GPSPFRFENVWLEHPNF
        GPSPFRF N +L+ P++
Subjt:  GPSPFRFENVWLEHPNF

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-4022.62Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWK F                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNAPRE-IWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L    +   L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNAPRE-IWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          N     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMI----DENISVEHGL
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E M   RA++++++    +  + +    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMI----DENISVEHGL

Query:  MKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLW
             +S +  ++ S  N    G      +G  GG+LVLW +    V D  +G                                         LC   W
Subjt:  MKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLW

Query:  CLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQA
         +AGDFN+VRW  E +      RNM  FN+FI   ELID P+ N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      
Subjt:  CLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQA

Query:  WGPSPFRFENVWLEHPNF
        WGP PFR  N  L   +F
Subjt:  WGPSPFRFENVWLEHPNF

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.3e-4022.7Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWKSF                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L   P    L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          +     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E +   RA++++++      E  L    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR

Query:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA
         N + +      I + + +       +G  GG+LVLW +    V D  +G                                         LC   W +A
Subjt:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA

Query:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP
        GDFN+VRW  E +      RNM  FN+FI   ELID P  N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      WGP
Subjt:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP

Query:  SPFRFENVWLEHPNF
         PFR  N  L    F
Subjt:  SPFRFENVWLEHPNF

RVX17353.1 hypothetical protein CK203_003781 [Vitis vinifera]2.6e-4138.96Show/hide
Query:  KWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLW--------KEDVIL
        K++M I   +T+G  + KKK R V + +R+ + +++           +T+    +RR V S+W A+   W A+ A G++GG+L++W        +E + +
Subjt:  KWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLW--------KEDVIL

Query:  VVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTS
         +  I GL S  WC+ GDFNV+R  SEK  G R+T +MK F+ FI   ELID+P+++  FTWS+++  P   R+DRFL S+ W   F       L R TS
Subjt:  VVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTS

Query:  DHFPLLLSLGSQAWGPSPFRFENVWLEHPNF
        DH+P++L      WGP+PFRFEN+WL+HP+F
Subjt:  DHFPLLLSLGSQAWGPSPFRFENVWLEHPNF

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.3e-4022.7Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWKSF                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L   P    L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          +     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E +   RA++++++      E  L    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR

Query:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA
         N + +      I + + +       +G  GG+LVLW +    V D  +G                                         LC   W +A
Subjt:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA

Query:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP
        GDFN+VRW  E +      RNM  FN+FI   ELID P  N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      WGP
Subjt:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP

Query:  SPFRFENVWLEHPNF
         PFR  N  L    F
Subjt:  SPFRFENVWLEHPNF

TrEMBL top hitse value%identityAlignment
A0A438K826 Endo/exonuclease/phosphatase domain-containing protein1.2e-4138.96Show/hide
Query:  KWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLW--------KEDVIL
        K++M I   +T+G  + KKK R V + +R+ + +++           +T+    +RR V S+W A+   W A+ A G++GG+L++W        +E + +
Subjt:  KWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLW--------KEDVIL

Query:  VVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTS
         +  I GL S  WC+ GDFNV+R  SEK  G R+T +MK F+ FI   ELID+P+++  FTWS+++  P   R+DRFL S+ W   F       L R TS
Subjt:  VVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTS

Query:  DHFPLLLSLGSQAWGPSPFRFENVWLEHPNF
        DH+P++L      WGP+PFRFEN+WL+HP+F
Subjt:  DHFPLLLSLGSQAWGPSPFRFENVWLEHPNF

A0A5A7TTA1 DUF4283 domain-containing protein3.9e-4323.5Show/hide
Query:  LWIQKIANKRGDFLEVTKVQSSGGKRNLVIPADLEFNGWKSFR-----------------------------------SPEGEPSGGAHSRCLKSAKSSN
        LW+Q I N+RG   E+ +V   G K  +++P  L+  GW  F                                    S + E     ++  + S+ SS+
Subjt:  LWIQKIANKRGDFLEVTKVQSSGGKRNLVIPADLEFNGWKSFR-----------------------------------SPEGEPSGGAHSRCLKSAKSSN

Query:  FQDLWTQEAGEKDVRGINWNETLVITRRDFHEDWGRILDTIQNKLNRH---------------------------------TLLTPFFL--TKPSLNAPR
                + E D        T  + RR FH+DW +I+D ++++ ++                                  T + PF++   K S NA  
Subjt:  FQDLWTQEAGEKDVRGINWNETLVITRRDFHEDWGRILDTIQNKLNRH---------------------------------TLLTPFFL--TKPSLNAPR

Query:  EIWLVCLQQTGDGFRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLIDRD-QTFVAHVVSFENHNLLIGKEVGKH
        +  ++        FR IPLH+W+L TF  I + +GGFID A  + N +E  E  IKV+ NY GF+P+ +++ D +   F+   V+      L  +    H
Subjt:  EIWLVCLQQTGDGFRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLIDRD-QTFVAHVVSFENHNLLIGKEVGKH

Query:  GGFSPEAARIFLVRKWATAQTLWTF------------------------GVFKMGFFVQWLTLCTPVPLWVTTIQKD-PLPNENPTLKPGLHSKK--KKK
        G F+  AA  F   ++      +TF                           KMG           V       + D    N +  ++   H  +  +KK
Subjt:  GGFSPEAARIFLVRKWATAQTLWTF------------------------GVFKMGFFVQWLTLCTPVPLWVTTIQKD-PLPNENPTLKPGLHSKK--KKK

Query:  KGKAVAFEKGSN--SQPSMKRGQIFSTSLAERHV-------ENELEIGSEASFLSISSVEDCPTEDAEEPIVDKEDPPSEFYMCFREAESQDLMEGEERA
        KGK +   + +   S  + KR   FS+   E  +          L++GS  S+  + +    PT+  ++ +         + +  R  E +      ++ 
Subjt:  KGKAVAFEKGSN--SQPSMKRGQIFSTSLAERHV-------ENELEIGSEASFLSISSVEDCPTEDAEEPIVDKEDPPSEFYMCFREAESQDLMEGEERA

Query:  MILVQPQ-----VELGDTPSLAISDQQ--EGTGSEGEGDFPFFRQMVKNLKKWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGL---
           + P      V+LG    L+ +D    E          P    +VK+          +++  + A + + ++    IR   +E  D+ +S +  L   
Subjt:  MILVQPQ-----VELGDTPSLAISDQQ--EGTGSEGEGDFPFFRQMVKNLKKWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGL---

Query:  -----------MKTRLNSM-NRRLVKSIWNAKRIGW--VAIDAVGSAGGVLV--------------------------LWKEDVILVVDSILGLCSDLWC
                     ++ NS+ N R++  +     +G   V+ D +  A  V +                           W+E     ++++  +C   W 
Subjt:  -----------MKTRLNSM-NRRLVKSIWNAKRIGW--VAIDAVGSAGGVLV--------------------------LWKEDVILVVDSILGLCSDLWC

Query:  LAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAW
        L GDFNV+RW  E S     + +MK FN+FI    LID P+ N +FTWS+LR + T +R+DRFL S  W   F   + + L+R TSDHFP++L   S +W
Subjt:  LAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAW

Query:  GPSPFRFENVWLEHPNF
        GPSPFRF N +L+ P++
Subjt:  GPSPFRFENVWLEHPNF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein3.1e-4022.7Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWKSF                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L   P    L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          +     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E +   RA++++++      E  L    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR

Query:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA
         N + +      I + + +       +G  GG+LVLW +    V D  +G                                         LC   W +A
Subjt:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA

Query:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP
        GDFN+VRW  E +      RNM  FN+FI   ELID P  N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      WGP
Subjt:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP

Query:  SPFRFENVWLEHPNF
         PFR  N  L    F
Subjt:  SPFRFENVWLEHPNF

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein8.1e-4122.62Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWK F                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNAPRE-IWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L    +   L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNAPRE-IWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          N     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMI----DENISVEHGL
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E M   RA++++++    +  + +    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMI----DENISVEHGL

Query:  MKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLW
             +S +  ++ S  N    G      +G  GG+LVLW +    V D  +G                                         LC   W
Subjt:  MKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLW

Query:  CLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQA
         +AGDFN+VRW  E +      RNM  FN+FI   ELID P+ N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      
Subjt:  CLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQA

Query:  WGPSPFRFENVWLEHPNF
        WGP PFR  N  L   +F
Subjt:  WGPSPFRFENVWLEHPNF

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein3.1e-4022.7Show/hide
Query:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD
        +W++K  NK    +  E+ ++ + G K ++++P   +  GWKSF                               S + + S  ++++ L  +   + + 
Subjt:  LWIQKIANKRGDFL--EVTKVQSSGGKRNLVIPADLEFNGWKSFR------------------------------SPEGEPSGGAHSRCLKSAKSSNFQD

Query:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------
         +   + +   R             G ++ +T++ITRR FH+DW RI+ +++ +        PF   K  L   P    L+C  +  +G           
Subjt:  LWTQEAGEKDVR-------------GINWNETLVITRRDFHEDWGRILDTIQNKLNRHTLLTPFFLTKPSLNA-PREIWLVCLQQTGDG-----------

Query:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL
                              FR IPLHLW+  TF+ I    GGF+D AK    + + ++  IKVR NY GF+P+ + + D + + F+   V       
Subjt:  ----------------------FRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLID-RDQTFVAHVVSFENHNL

Query:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK
        L+ + V  HG F  +AA  F       A+T +T+  F+          G +    +    +       + +   +E       L S ++K+KGKA+    
Subjt:  LIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKM---------GFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEK

Query:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE
          +     KR +  S            +   +  EI ++   L IS++ D       P +  +  +      D ++   +  +  +   E   Q  +  +
Subjt:  GSNSQPSMKRGQIFSTSLAE-------RHVENELEIGSEASFLSISSVED------CPTEDAEEPIV-----DKEDPPSEFYMCFR---EAESQDLMEGE

Query:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR
           +  ++  ++  +   L   + Q   G+    D    + +  ++K+    N   +  + +G++   K   E +   RA++++++      E  L    
Subjt:  ERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDFPFFRQMVKNLKK---WNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTR

Query:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA
         N + +      I + + +       +G  GG+LVLW +    V D  +G                                         LC   W +A
Subjt:  LNSM-NRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG-----------------------------------------LCSDLWCLA

Query:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP
        GDFN+VRW  E +      RNM  FN+FI   ELID P  N  FTWS+LR  PT +R+DRFL+S  W  AF   + + L R  SDHFP+LL      WGP
Subjt:  GDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGP

Query:  SPFRFENVWLEHPNF
         PFR  N  L    F
Subjt:  SPFRFENVWLEHPNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.7e-0622.67Show/hide
Query:  GGVLVLWKEDVILVVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRIT---RNMKTFNSFIDRAELIDIPMKNGRFTWSDLREE-PTATRIDRFLISHSW
        G + ++W   V ++V         L  L GDF+ +   S+     + +   R ++ F + +  ++L+DIP +   +TWS+ +++ P   ++DR + +  W
Subjt:  GGVLVLWKEDVILVVDSILGLCSDLWCLAGDFNVVRWISEKSKGGRIT---RNMKTFNSFIDRAELIDIPMKNGRFTWSDLREE-PTATRIDRFLISHSW

Query:  LTAFKDMSLQKLSRPTSDHFPLLLSLGS-QAWGPSPFRFENVWLEHPNFM
         ++F            SDH P ++ L +        FR+ +    HP F+
Subjt:  LTAFKDMSLQKLSRPTSDHFPLLLSLGS-QAWGPSPFRFENVWLEHPNFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAAATTATTTAGGAAAACGGACTGCAACGGAGGCTTCCTGTGGATTCAAAAAATAGCTAACAAAAGAGGTGATTTCTTAGAAGTGACGAAGGTTCAGAGCTCGGG
AGGAAAGCGAAATCTGGTCATCCCTGCAGACTTGGAATTCAATGGATGGAAGTCTTTTCGCAGCCCTGAGGGGGAACCCTCGGGGGGAGCCCACAGTAGATGCTTGAAGA
GCGCGAAATCATCGAACTTCCAGGACCTTTGGACGCAGGAAGCTGGTGAAAAGGATGTTCGGGGTATAAATTGGAATGAGACTTTGGTCATCACTAGAAGAGACTTCCAT
GAAGACTGGGGGAGGATTCTTGACACAATCCAAAACAAACTCAACAGACATACATTATTAACCCCTTTCTTCCTGACAAAGCCTTCCTTAAATGCCCCACGGGAGATATG
GCTCGTCTGCTTACAACAAACAGGGGATGGATTTAGGAACATTCCTCTGCATTTATGGAGCTTGGCCACATTTAAAGCGATTGAGGATTTGTTTGGGGGCTTCATTGATT
ATGCAAAAGCCAACTCGAATCTCATTGAATGTATGGAAGTGGCTATAAAGGTTCGAGGAAATTACTGTGGATTCATTCCCTCGGAAGTTCGTCTGATAGATAGGGACCAA
ACCTTCGTTGCCCATGTAGTGTCTTTTGAGAACCATAACCTTCTGATCGGAAAAGAGGTCGGGAAACATGGAGGATTCTCGCCAGAGGCAGCGAGGATTTTTTTGGTTCG
GAAATGGGCTACGGCCCAAACCCTGTGGACATTTGGCGTGTTCAAGATGGGATTTTTTGTCCAGTGGTTAACATTATGTACCCCTGTCCCACTTTGGGTGACTACAATCC
AAAAAGACCCTCTCCCCAACGAGAACCCCACCCTGAAACCCGGACTTCATTCGAAAAAGAAAAAGAAGAAAGGGAAAGCGGTGGCTTTTGAGAAAGGCTCCAACTCCCAA
CCTTCCATGAAAAGGGGGCAAATCTTCTCGACAAGTCTAGCTGAGAGACATGTAGAAAATGAGTTAGAAATCGGTTCAGAAGCTTCTTTCTTGAGCATCAGTAGTGTGGA
AGATTGTCCGACAGAAGATGCTGAAGAGCCCATTGTTGATAAGGAAGATCCTCCTTCTGAGTTCTACATGTGTTTTAGAGAAGCTGAGTCGCAGGATCTTATGGAGGGTG
AAGAGAGAGCCATGATTTTGGTTCAGCCTCAGGTAGAGTTAGGAGATACTCCCTCTTTAGCGATCAGTGACCAGCAGGAAGGAACCGGGTCAGAAGGGGAGGGGGATTTC
CCGTTCTTTAGGCAAATGGTGAAGAATCTAAAGAAATGGAACATGTGTATTACACCAATCTCTACCAAGGGAAGTGCGGCGGTTAAGAAAAAGTCGAGGGAGGTCATGAA
TCAGATTCGGGCTTGGGAGAAGGAGATGATTGATGAGAATATTAGCGTAGAGCATGGCCTCATGAAGACTAGATTGAACTCGATGAATAGGCGGCTTGTTAAATCTATTT
GGAATGCCAAGCGCATAGGGTGGGTGGCCATAGATGCTGTGGGCTCGGCTGGAGGCGTGCTGGTTTTGTGGAAAGAAGATGTCATCTTGGTTGTAGATTCAATTTTGGGT
CTCTGTTCGGATTTATGGTGCTTAGCCGGCGACTTCAATGTGGTCAGATGGATTTCGGAAAAATCAAAGGGAGGGAGAATTACTAGAAACATGAAGACTTTCAATTCCTT
CATAGACAGGGCCGAGCTCATAGATATTCCCATGAAAAACGGTAGATTCACCTGGTCTGACTTGAGGGAGGAGCCTACAGCCACTAGGATTGACAGGTTTTTGATATCCC
ACTCTTGGCTTACTGCTTTTAAAGATATGTCCCTCCAAAAGCTGTCGAGACCCACTTCTGATCACTTTCCTCTGCTTCTCTCCTTGGGTAGTCAAGCTTGGGGTCCCTCC
CCTTTCCGTTTTGAAAATGTGTGGCTCGAACATCCGAACTTCATGAATATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAAATTATTTAGGAAAACGGACTGCAACGGAGGCTTCCTGTGGATTCAAAAAATAGCTAACAAAAGAGGTGATTTCTTAGAAGTGACGAAGGTTCAGAGCTCGGG
AGGAAAGCGAAATCTGGTCATCCCTGCAGACTTGGAATTCAATGGATGGAAGTCTTTTCGCAGCCCTGAGGGGGAACCCTCGGGGGGAGCCCACAGTAGATGCTTGAAGA
GCGCGAAATCATCGAACTTCCAGGACCTTTGGACGCAGGAAGCTGGTGAAAAGGATGTTCGGGGTATAAATTGGAATGAGACTTTGGTCATCACTAGAAGAGACTTCCAT
GAAGACTGGGGGAGGATTCTTGACACAATCCAAAACAAACTCAACAGACATACATTATTAACCCCTTTCTTCCTGACAAAGCCTTCCTTAAATGCCCCACGGGAGATATG
GCTCGTCTGCTTACAACAAACAGGGGATGGATTTAGGAACATTCCTCTGCATTTATGGAGCTTGGCCACATTTAAAGCGATTGAGGATTTGTTTGGGGGCTTCATTGATT
ATGCAAAAGCCAACTCGAATCTCATTGAATGTATGGAAGTGGCTATAAAGGTTCGAGGAAATTACTGTGGATTCATTCCCTCGGAAGTTCGTCTGATAGATAGGGACCAA
ACCTTCGTTGCCCATGTAGTGTCTTTTGAGAACCATAACCTTCTGATCGGAAAAGAGGTCGGGAAACATGGAGGATTCTCGCCAGAGGCAGCGAGGATTTTTTTGGTTCG
GAAATGGGCTACGGCCCAAACCCTGTGGACATTTGGCGTGTTCAAGATGGGATTTTTTGTCCAGTGGTTAACATTATGTACCCCTGTCCCACTTTGGGTGACTACAATCC
AAAAAGACCCTCTCCCCAACGAGAACCCCACCCTGAAACCCGGACTTCATTCGAAAAAGAAAAAGAAGAAAGGGAAAGCGGTGGCTTTTGAGAAAGGCTCCAACTCCCAA
CCTTCCATGAAAAGGGGGCAAATCTTCTCGACAAGTCTAGCTGAGAGACATGTAGAAAATGAGTTAGAAATCGGTTCAGAAGCTTCTTTCTTGAGCATCAGTAGTGTGGA
AGATTGTCCGACAGAAGATGCTGAAGAGCCCATTGTTGATAAGGAAGATCCTCCTTCTGAGTTCTACATGTGTTTTAGAGAAGCTGAGTCGCAGGATCTTATGGAGGGTG
AAGAGAGAGCCATGATTTTGGTTCAGCCTCAGGTAGAGTTAGGAGATACTCCCTCTTTAGCGATCAGTGACCAGCAGGAAGGAACCGGGTCAGAAGGGGAGGGGGATTTC
CCGTTCTTTAGGCAAATGGTGAAGAATCTAAAGAAATGGAACATGTGTATTACACCAATCTCTACCAAGGGAAGTGCGGCGGTTAAGAAAAAGTCGAGGGAGGTCATGAA
TCAGATTCGGGCTTGGGAGAAGGAGATGATTGATGAGAATATTAGCGTAGAGCATGGCCTCATGAAGACTAGATTGAACTCGATGAATAGGCGGCTTGTTAAATCTATTT
GGAATGCCAAGCGCATAGGGTGGGTGGCCATAGATGCTGTGGGCTCGGCTGGAGGCGTGCTGGTTTTGTGGAAAGAAGATGTCATCTTGGTTGTAGATTCAATTTTGGGT
CTCTGTTCGGATTTATGGTGCTTAGCCGGCGACTTCAATGTGGTCAGATGGATTTCGGAAAAATCAAAGGGAGGGAGAATTACTAGAAACATGAAGACTTTCAATTCCTT
CATAGACAGGGCCGAGCTCATAGATATTCCCATGAAAAACGGTAGATTCACCTGGTCTGACTTGAGGGAGGAGCCTACAGCCACTAGGATTGACAGGTTTTTGATATCCC
ACTCTTGGCTTACTGCTTTTAAAGATATGTCCCTCCAAAAGCTGTCGAGACCCACTTCTGATCACTTTCCTCTGCTTCTCTCCTTGGGTAGTCAAGCTTGGGGTCCCTCC
CCTTTCCGTTTTGAAAATGTGTGGCTCGAACATCCGAACTTCATGAATATTTGA
Protein sequenceShow/hide protein sequence
MQKLFRKTDCNGGFLWIQKIANKRGDFLEVTKVQSSGGKRNLVIPADLEFNGWKSFRSPEGEPSGGAHSRCLKSAKSSNFQDLWTQEAGEKDVRGINWNETLVITRRDFH
EDWGRILDTIQNKLNRHTLLTPFFLTKPSLNAPREIWLVCLQQTGDGFRNIPLHLWSLATFKAIEDLFGGFIDYAKANSNLIECMEVAIKVRGNYCGFIPSEVRLIDRDQ
TFVAHVVSFENHNLLIGKEVGKHGGFSPEAARIFLVRKWATAQTLWTFGVFKMGFFVQWLTLCTPVPLWVTTIQKDPLPNENPTLKPGLHSKKKKKKGKAVAFEKGSNSQ
PSMKRGQIFSTSLAERHVENELEIGSEASFLSISSVEDCPTEDAEEPIVDKEDPPSEFYMCFREAESQDLMEGEERAMILVQPQVELGDTPSLAISDQQEGTGSEGEGDF
PFFRQMVKNLKKWNMCITPISTKGSAAVKKKSREVMNQIRAWEKEMIDENISVEHGLMKTRLNSMNRRLVKSIWNAKRIGWVAIDAVGSAGGVLVLWKEDVILVVDSILG
LCSDLWCLAGDFNVVRWISEKSKGGRITRNMKTFNSFIDRAELIDIPMKNGRFTWSDLREEPTATRIDRFLISHSWLTAFKDMSLQKLSRPTSDHFPLLLSLGSQAWGPS
PFRFENVWLEHPNFMNI