; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019228 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019228
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1:46673550..46681570
RNA-Seq ExpressionSpg019228
SyntenySpg019228
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]4.4e-5525.54Show/hide
Query:  IVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALE
        ++  E+   E  +     ++  K+LT R  N E     + +IW I  S         +F+ +F   RDK ++  G PWS+D  +++F E +GN     + 
Subjt:  IVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALE

Query:  FNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQ
         ++  FW+  +NLP           +G  +G     E D +G +  ++ RVKV + V++PL+R   ++         + + YE+LP+FCY CG LGH+ +
Subjt:  FNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQ

Query:  ECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMGNRKGGEKFG
        +C                           ++   D  E R W    R +  RGR +         S  +++N                 + N +      
Subjt:  ECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMGNRKGGEKFG

Query:  KKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPD----KKGKAKEYPI------MELEQQSTKVLGEQ
        ++K           E   V E ++ Q E  SP +    P    + I    +  +   SP     P     KK    + P+      + ++  +   L  +
Subjt:  KKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPD----KKGKAKEYPI------MELEQQSTKVLGEQ

Query:  ESSKEINHVNVNI--------IPEENLNPPTEAIQKKETVTAGSKKKSFS-------DHNGPKREM-----HRVLDMDLDTNKQKEQKIEDHMGCNLENE
        +S   I   +V++        I +  +N  +    + E + A + K           D  G    +       +LD  L           ++  C     
Subjt:  ESSKEINHVNVNI--------IPEENLNPPTEAIQKKETVTAGSKKKSFS-------DHNGPKREM-----HRVLDMDLDTNKQKEQKIEDHMGCNLENE

Query:  QNKRKWKRRARMVNLEDGGEKSKIGEKRKS----------SGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRE
         N  KW R   +    + G K K  +  +S           GDFNE+LS  E  GG + +++ + +FRE V +  LRD G+SG  YTW RGK  +  IRE
Subjt:  QNKRKWKRRARMVNLEDGGEKSKIGEKRKS----------SGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRE

Query:  RLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKSIIERLWQNSPGVDMAAFERKVLHCLTKLSKWNRE
        RLDRFL + +    F  + V+H+  + SDH  I+  L   +R  K+++KK  +F  +W      +S++   W +S G+    FE ++      L  W+++
Subjt:  RLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKSIIERLWQNSPGVDMAAFERKVLHCLTKLSKWNRE

Query:  RLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMG
         L   + + +    EEIK L+         +L++   +L+ LL ++E YW +RSR   +K GD+NTK+FH KAS RK RN I G  +    W D+D+ + 
Subjt:  RLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMG

Query:  EIASNYFNNLFNSSNPSEEDMERVLEGIIP
         +   Y+ NLF SS PS+E +  VL+ ++P
Subjt:  EIASNYFNNLFNSSNPSEEDMERVLEGIIP

XP_035540109.1 uncharacterized protein LOC118344190 [Juglans regia]6.9e-5626Show/hide
Query:  GAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSY
        G + +++  E L+L EE + ++EI  DD EE   +   +I  KI   R+I  +V S  M +IW +       + G N+F+  F+   DK+R+  G PW +
Subjt:  GAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSY

Query:  DDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPI
        D+ +   +   G        F+   FWV  HNLP VC  ++    +G+S+G     +   +    G+ LRV+V + + + + RG  +K+    +  WI +
Subjt:  DDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPI

Query:  TYEKLPDFCYSCGKLGHVYQEC--AIQGSTDNQ--IKPFGIELRETKGSKGIYKIWKHDNRE-YRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEE
         YEKLP  C+ CG++ H Y+ C   ++G+   +   + FG+ LR   G +  +   K    E   RW  +  +      G GR G      +++ +    
Subjt:  TYEKLPDFCYSCGKLGHVYQEC--AIQGSTDNQ--IKPFGIELRETKGSKGIYKIWKHDNRE-YRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEE

Query:  EGGNSN--------------------TQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEG-AQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQR
        EGG                        ++ E+  MG +   +     + E G      K G  ++E++   +SE    +        +       GI + 
Subjt:  EGGNSN--------------------TQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEG-AQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQR

Query:  ERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKK-------ETVTAGSKKKSFSDHNG--------PKR
        E      SS+ +K+G A E  +  ++ ++ K    ++  KE    NV I      + P    +K+       ET    +K ++     G        P  
Subjt:  ERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKK-------ETVTAGSKKKSFSDHNG--------PKR

Query:  EMHRVLDMDLDTNKQKEQKIED-HMGCNLENEQNKRKW------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDN
            +L M  D ++ +       H+   + NE+N  +W            KR      L +    ++IG      GDFNE++S DEK GG  + +  ++ 
Subjt:  EMHRVLDMDLDTNKQKEQKIED-HMGCNLENEQNKRKW------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDN

Query:  FREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKS
        FR A+    LRD G+ G KYTW  G      I+ERLDR + N E    F  I V+ +    SDH  IL    K    N R+ K+  ++E  W K+ E   
Subjt:  FREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKS

Query:  IIERLWQNSPGVDMAAFERKVLHCLTKLSKWNR---ERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGD
        +IER W+   G      + K++ C   L  W++   E+   SI +  DK    +KIL+E +       +      L  LL++E ++WK R++  WLK GD
Subjt:  IIERLWQNSPGVDMAAFERKVLHCLTKLSKWNR---ERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGD

Query:  RNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIPSGCETPMPLKENLKG--------RHVYEILDER
        RNTK+FH  A+ R+ +N I    N++G  +     + E    YF  LF +++PS  ++E  +         T + + E+++G          V   L + 
Subjt:  RNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIPSGCETPMPLKENLKG--------RHVYEILDER

Query:  GCWK
          WK
Subjt:  GCWK

XP_042958006.1 uncharacterized protein LOC122293492 [Carya illinoinensis]7.6e-5526.54Show/hide
Query:  EEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDD
        E+++   + LKL +E   ++EI+++   +   + Q  +  K  + R I+ EV    + ++W I    +  +  PNIF      I +K ++  G PW +D+
Subjt:  EEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDD

Query:  AILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITY
         +L+ +E  G   ++ + FN  SFWV FHNLP  C        +G ++G  E  +  E+G   G+ LRV++ + +N+PL RG  + +  K +  WIP +Y
Subjt:  AILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITY

Query:  EKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIY--KIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNS
        EK+P  C+ CG + H   EC   GS       +G  LR T+  +  +   + K      + WK   +E R +G G       +RGS NK     +  G+ 
Subjt:  EKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIY--KIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNS

Query:  NTQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEF-SGINQRERTSPTASSLPDKKGKAKEYPIMEL
               EGMGN       GK++E         KEG    ++++    +   EK          + +  +G+ + E          D +   KE      
Subjt:  NTQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEF-SGINQRERTSPTASSLPDKKGKAKEYPIMEL

Query:  EQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKW---
          +  K++ +  S  E N  N+  I E  L        K E++      +         +    +L  D   N +     + H+   +  E+   KW   
Subjt:  EQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKW---

Query:  ---------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLV
                 KR+     L+    K K  E     GDFNE+L+ DEK GG  + +  ++ FRE + +  L+D G+ G K+T       S   +ERLDR + 
Subjt:  ---------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLV

Query:  NTEMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQRSNKRKRKKV--LKFEESWAKNLEAKSIIERLWQNSPGV--DMAAFERKVLHCLTKLSKWNRERL
        N++    +    V  L    SDH+ +L  L K D+R + + R+K    K+E SWA   E + ++ R W N+  +  ++              SK  ++R 
Subjt:  NTEMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQRSNKRKRKKV--LKFEESWAKNLEAKSIIERLWQNSPGV--DMAAFERKVLHCLTKLSKWNRERL

Query:  GGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNR---NEITGPLNSEGCWEDNDDKM
        G    + V++K + ++ L+ ++++     + K   EL  LL +E+++WK R++ +W K GDRNTK+FH  A+ R+ R   NE+    N+  C      ++
Subjt:  GGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNR---NEITGPLNSEGCWEDNDDKM

Query:  GEIASNYFNNLFNSSNPSEEDMERVLEGI
         E    YF N+F S+ PS+ ++E  L G+
Subjt:  GEIASNYFNNLFNSSNPSEEDMERVLEGI

XP_042962692.1 uncharacterized protein LOC122296963 [Carya illinoinensis]1.2e-5526.39Show/hide
Query:  EEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDD
        E+++   ++L+L EE   ++EI+D+   +   + Q ++  K+ + R I+ EV    + +IW I    +  +  PN F   F  I DK ++  G PW +D+
Subjt:  EEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDD

Query:  AILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITY
         +++ +E  G   ++ + F   SFWV FHNLP  C        +G ++G  +  + D +G   G+ LRV++ + +N+PL RG  + +  K    WIP +Y
Subjt:  AILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITY

Query:  EKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNT
        EK+P  C+SCG + H   EC   G        +G  LR  +  +  + +    N E   W  E +E  P   G G               + +EGG S  
Subjt:  EKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNT

Query:  QTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQ
        +    EG+   + G++ G K  E+       KEG    E+                    G      G+ +R+ ++   +    K+        + ++ +
Subjt:  QTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQ

Query:  ST-KVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKW-----
        +  K +G +  SKE+                                       G + E  R   + LD     ++ I   +      E+   KW     
Subjt:  ST-KVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKW-----

Query:  -------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNT
               KR+   V L+    K K GE     GDFNE+L+ DEK GG ++    ++ FRE + +  L D G+ G+KYTW      S   ++RLDR + N 
Subjt:  -------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNT

Query:  EMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQR--SNKRKRKKVLKFEESWAKNLEAKSIIERLW-----QNSPGVDMAAFERKVLHCLTKLSKWNRER
        +    +    V+ L    SDH+ +L  L + DQR  +  R++K+  K+E  WA   E + ++ + W      N   + +    RK L      SK  R+R
Subjt:  EMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQR--SNKRKRKKVLKFEESWAKNLEAKSIIERLW-----QNSPGVDMAAFERKVLHCLTKLSKWNRER

Query:  LGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELELLNE-EEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRN---EITGPLNSEGCWEDNDDK
         G    + ++ K + +K L+ +++      + K   +L LL E E+++WK R++ +W K+GDRNTK+FH  A+ RK RN   EI  P N+  C      +
Subjt:  LGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELELLNE-EEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRN---EITGPLNSEGCWEDNDDK

Query:  MGEIASNYFNNLFNSSNPSEEDMERVLEGI
        + E   NYF  +F S  PS+ ++E  L G+
Subjt:  MGEIASNYFNNLFNSSNPSEEDMERVLEGI

XP_042979975.1 uncharacterized protein LOC122310162 [Carya illinoinensis]2.1e-5726.44Show/hide
Query:  ENETGAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGG
        E +   E+++   ++L+L EE   ++EI+D+   +   + Q ++  K+ + R I+ EV    + +IW I    +  +  PN F   F  I DK ++  G 
Subjt:  ENETGAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGG

Query:  PWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERT
        PW +D+ +++ +E  G   ++ + F   SFWV FHNLP  C        +G ++G  E  +   +G   G+ LRV++ + +N+PL RG  + +  K    
Subjt:  PWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERT

Query:  WIPITYEKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEE
        WIP +YEK+P  C+SCG + H   EC   G        +G  LR  +  +  + +    N E   W  E +E  P   G G               + +E
Subjt:  WIPITYEKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEE

Query:  GGNSNTQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPI
        GG S  +    EG+   + G++ G K  E+       KEG    E+                    G      G+ +R+ ++   +    K+        
Subjt:  GGNSNTQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPI

Query:  MELEQQST-KVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRK
        + ++ ++  K +G +  SKE+                                       G + E  R   + LD     ++ I   +      E+   K
Subjt:  MELEQQST-KVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRK

Query:  W------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLD
        W            KR+   V L+    K K GE     GDFNE+L+ DEK GG ++    ++ FRE + +  L D G+ G+KYTW      S   +ERLD
Subjt:  W------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLD

Query:  RFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQR--SNKRKRKKVLKFEESWAKNLEAKSIIERLW-----QNSPGVDMAAFERKVLHCLTKLS
        R + N +    +    V+ L    SDH+ +L  L + DQR  +  R++K+  K+E  WA   E + ++ + W      N   + +    RK L      S
Subjt:  RFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGK-DQR--SNKRKRKKVLKFEESWAKNLEAKSIIERLW-----QNSPGVDMAAFERKVLHCLTKLS

Query:  KWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELELLNE-EEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRN---EITGPLNSEGCW
        K  R+R G    + +++K + +K L+ +++      + K   +L LL E E+++WK R++ +W K GDRNTK+FH  A+ RK RN   EI  P N+  C 
Subjt:  KWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELELLNE-EEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRN---EITGPLNSEGCW

Query:  EDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGI
             ++ E   NYF  +F S  PS+ ++E  L G+
Subjt:  EDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGI

TrEMBL top hitse value%identityAlignment
A0A2N9FN47 Uncharacterized protein7.9e-5826.22Show/hide
Query:  AEEMQLLLEKLKLEEGNRI-VEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYD
        AEE+     K++L E  R  +++    ++++  E + ++  ++LT+R  N E F   +  +W  +G + V     N+FL  F R  D  RI    PW++D
Subjt:  AEEMQLLLEKLKLEEGNRI-VEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYD

Query:  DAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPIT
          ++     + N     ++F +  FW+  +NLP +   ++    +G++IG     +  ENG   G  LR++V + V +PL RG  +    + +  W+   
Subjt:  DAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPIT

Query:  YEKLPDFCYSCGKLGHVYQECAIQG--STDNQI---KPFGIELRETKGSKGIYKIWKHDNREYRR--WKPEFRETRPRGRGRGR-AGRFARGSF------
        YE LP FCY CG++GH   EC ++G  S  +Q+     FG  LR      G        +R YR      +  E  P   G G   G    G        
Subjt:  YEKLPDFCYSCGKLGHVYQECAIQG--STDNQI---KPFGIELRETKGSKGIYKIWKHDNREYRR--WKPEFRETRPRGRGRGR-AGRFARGSF------

Query:  NKSINTEEEGG----------NSNTQTREKEGM----------GNRKGGEKF--GKKKEEEGSTENFAKEGAQVEEMEVLQSE--DFSPEKRPPEPSRRG
        N    TEEE              + +  E  G+           N    E F    +KEE G       + AQ+ E+ + + E  +F  E+   + S + 
Subjt:  NKSINTEEEGG----------NSNTQTREKEGM----------GNRKGGEKF--GKKKEEEGSTENFAKEGAQVEEMEVLQSE--DFSPEKRPPEPSRRG

Query:  M--DIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNII---PEENLNPPTEAIQKKETVTAGSK----KKSFSDH
            I+  G+ Q+  T    +S     G+ K   I+ + Q  T +    +S+  +N     ++   P+ ++ P      KK   T   +    +   +D 
Subjt:  M--DIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNII---PEENLNPPTEAIQKKETVTAGSK----KKSFSDH

Query:  NGPKR----EMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLE----DGGEKSKIGEKRKSS------GDFNELLSGDEKTGGSLKN
        +  KR     MH  L+ D +  K+ +   E H    LE        + R  +  LE      G     G  R+S       GDFNE+++ +EK G   ++
Subjt:  NGPKR----EMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLE----DGGEKSKIGEKRKSS------GDFNELLSGDEKTGGSLKN

Query:  QKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAK
         + +  FREA+  C L D G++G ++TW   ++    +R RLDR + + E    F   +V+H+   +SDH  +L +L       ++K+ ++ +F+ +W  
Subjt:  QKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAK

Query:  NLEAKSIIERLWQNS-PGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWL
          + + +I   W +S  G  M    +++ HC  KL +W++ ++  +      KK    ++  +   NY +  +    KEL  L+ +EE++WK RSR  WL
Subjt:  NLEAKSIIERLWQNS-PGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWL

Query:  KWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPS
          GD NT++FH+ AS RK  N + G  ++   W+ + D +  I   YF+NLF+SSNP+
Subjt:  KWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPS

A0A2N9GWG4 Uncharacterized protein4.2e-5926.34Show/hide
Query:  AEEMQLLLEKLKLEEGNRI-VEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYD
        AEE+     K++L E  R  +++    ++++  E + ++  ++LT+R  N E F   +  +W  +G + V     N+FL  F R  D  RI    PW++D
Subjt:  AEEMQLLLEKLKLEEGNRI-VEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYD

Query:  DAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPIT
          ++     + N     ++F +  FW+  +NLP +   ++    +G++IG     +  ENG   G  LR++V + V +PL RG  +    + +  W+   
Subjt:  DAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPIT

Query:  YEKLPDFCYSCGKLGHVYQECAIQG--STDNQI---KPFGIELRETKGSKGIYKIWKHDNREYRR--WKPEFRETRPRGRGRGR-AGRFARGSF------
        YE LP FCY CG++GH   EC ++G  S  +Q+     FG  LR      G        +R YR      +  E  P   G G   G    G        
Subjt:  YEKLPDFCYSCGKLGHVYQECAIQG--STDNQI---KPFGIELRETKGSKGIYKIWKHDNREYRR--WKPEFRETRPRGRGRGR-AGRFARGSF------

Query:  NKSINTEEEGG----------NSNTQTREKEGM----------GNRKGGEKF--GKKKEEEGSTENFAKEGAQVEEMEVLQSE--DFSPEKRPPEPSRRG
        N    TEEE              + +  E  G+           N    E F    +KEE G       + AQ+ E+ + + E  +F  E+   + S + 
Subjt:  NKSINTEEEGG----------NSNTQTREKEGM----------GNRKGGEKF--GKKKEEEGSTENFAKEGAQVEEMEVLQSE--DFSPEKRPPEPSRRG

Query:  M--DIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNII---PEENLNPPTEAIQKKETVTAGSK----KKSFSDH
            I+  G+ Q+  T P  +S     G+ K   I+ + Q  T +    +S+  +N     ++   P+ ++ P      KK   T   +    +   +D 
Subjt:  M--DIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNII---PEENLNPPTEAIQKKETVTAGSK----KKSFSDH

Query:  NGPKR----EMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLE----DGGEKSKIGEKRKSS------GDFNELLSGDEKTGGSLKN
        +  KR     MH  L+ D +  K+ +   E H    LE        + R  +  LE      G     G  R+S       GDFNE+++ +EK G   ++
Subjt:  NGPKR----EMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLE----DGGEKSKIGEKRKSS------GDFNELLSGDEKTGGSLKN

Query:  QKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAK
         + +  FREA+  C L D G++G ++TW   ++    +R RLDR + + E    F   +V+H+   +SDH  +L +L       ++K+ ++ +F+ +W  
Subjt:  QKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAK

Query:  NLEAKSIIERLWQNS-PGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWL
          + + +I   W +S  G  M    +++ HC  KL +W++ ++  +      KK    ++  +   NY +  +    KEL  L+ +EE++WK RSR  WL
Subjt:  NLEAKSIIERLWQNS-PGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWL

Query:  KWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPS
          GD NT++FH+ AS RK  N + G  ++   W+ + D +  I   YF+NLF+SSNP+
Subjt:  KWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPS

A0A2N9I921 Reverse transcriptase domain-containing protein5.1e-5727.39Show/hide
Query:  DLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFW
        DLE+ + E    +A K +T R +N E  +     +W  +    V+  G N  L  F    D  R+    PW+YD  +++F+  + +  VE + F+ V  W
Subjt:  DLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFW

Query:  VHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECA----
        V  H LP  C  R+ A+ +G  IG      S E  +      R+KVRL + +PL RG  VK+G K    WI   YE+LP+FCY CG L H  ++C+    
Subjt:  VHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECA----

Query:  --IQGSTDNQIKPFGIELR---ETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMG------N
            GS+D     FG  LR   E    K    +    N   R  +P+     P           AR S  K   TEE    S+ +T E +          
Subjt:  --IQGSTDNQIKPFGIELR---ETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMG------N

Query:  RKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEK----------RPPEPS---RRGMDIEFSGINQRE--RTSPTAS---------------
            + F +   E     NF    A ++ + V+  +  +             RP +P+        IE + I  R+   TSP                  
Subjt:  RKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEK----------RPPEPS---RRGMDIEFSGINQRE--RTSPTAS---------------

Query:  --------SLPDKKG------------KAKEYPIMELEQQSTKVLGEQESS-----------KEINHVNVNIIPEENLNPPTE------AIQKKETVTAG
                +LP K+             K  E   +E  Q+  K++ EQ+ S            ++  + V +     L  P        A+  K+T+   
Subjt:  --------SLPDKKG------------KAKEYPIMELEQQSTKVLGEQESS-----------KEINHVNVNIIPEENLNPPTE------AIQKKETVTAG

Query:  SKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMID
            S+S            +D  +D       +     G   E+ + +  W     +          +        GDFNE++  +EK G   K +  + 
Subjt:  SKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLENEQNKRKWKRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMID

Query:  NFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAK
        +FREA+  C   D GY G  +TW   + +   + ERLDR + +T    RF    V HL +  SDH+ +         +  R   K  +FEE W  +    
Subjt:  NFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAK

Query:  SIIERLWQN-SPGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNL---LKAEKELELLNEEEMYWKIRSREDWLKWG
          I   WQ+ S G  M     K+ HC  +L  W++    GS+ + + +K EE+K+ EE      SP+L   L+AE  + LL++EE  W+ RSR  WLK G
Subjt:  SIIERLWQN-SPGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPSPNL---LKAEKELELLNEEEMYWKIRSREDWLKWG

Query:  DRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEED
        DRNT +FH +A+HR+ RN I G  +S+G W+ + D++  I  +YF N+F SSNPS  D
Subjt:  DRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEED

A0A2N9J7E4 Uncharacterized protein2.2e-5525.81Show/hide
Query:  DDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVS
        D DL  T ++ ++ +A K LT R +N +  +     +W    S  V+  G N     F+   D  R+ +  PW+YD  +++F+  +G+  ++   F++ S
Subjt:  DDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVS

Query:  FWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECAIQ
        FWV  HNLP      + A ++G SIG  E   + E+ +     +RV++RL++N PL RG  VK   +  + W+   YE+LP+FCY CG L H  ++C + 
Subjt:  FWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECAIQ

Query:  GSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMGNRKGGEKFGKKKEE
                  GI+ R+T          K    ++  W     +  P        G   + S +KS   +    + +  T E          E+  + + E
Subjt:  GSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMGNRKGGEKFGKKKEE

Query:  EGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIP
        +G T +  +E  +  EME+ Q+  F    R  + +    + +   I+      P   + PD+  +       +LE    K  G +E S   N  + +  P
Subjt:  EGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIP

Query:  EENL-NPPTEAIQKKETVTAGSK----KKSFSDHNGPKREMHRVLDMDLDTNKQKEQ---------KIEDHMGCNLENEQNKRKWKRRARMVNLED----
         +N+ N P    + K + T   K     K   D     + + R   +  D   Q++          K   ++  N     N+   +  A +V ++D    
Subjt:  EENL-NPPTEAIQKKETVTAGSK----KKSFSDHNGPKREMHRVLDMDLDTNKQKEQ---------KIEDHMGCNLENEQNKRKWKRRARMVNLED----

Query:  -----------------GGEKSKIGEKRKS---------------SGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKAS
                         G  ++ + E   +                GDFNE++   EK+G   +++  +  FR  + +C   D G+ G  +TW   ++ +
Subjt:  -----------------GGEKSKIGEKRKS---------------SGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKAS

Query:  MAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKSIIERLW-QNSPGVDMAAFERKVLHCLTKL
             RLDRF+   +  LRF S  V HL    SDH+ I   L        R R+K+ +FE+ W  + + + ++ + W   + G  +A  + K+  C  +L
Subjt:  MAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKSIIERLW-QNSPGVDMAAFERKVLHCLTKL

Query:  SKWNRERLGGSINQAVDKKLEEIKILEEDQN-NYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWE
        ++W+R +  G+I + + +K E ++  E D    +    ++   KE+ +LL +EE  WK RSR+ WLK GDRNTK+FH +ASHR+ RN I   +  +G   
Subjt:  SKWNRERLGGSINQAVDKKLEEIKILEEDQN-NYPSPNLLKAEKEL-ELLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWE

Query:  DNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIP
         +   +G   ++Y+  LF ++NP  ED+E VL+GI P
Subjt:  DNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIP

A0A6P9DXY5 uncharacterized protein LOC1183441903.3e-5626Show/hide
Query:  GAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSY
        G + +++  E L+L EE + ++EI  DD EE   +   +I  KI   R+I  +V S  M +IW +       + G N+F+  F+   DK+R+  G PW +
Subjt:  GAEEMQLLLEKLKL-EEGNRIVEIEDDDLEETDREFQSAIACKILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSY

Query:  DDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPI
        D+ +   +   G        F+   FWV  HNLP VC  ++    +G+S+G     +   +    G+ LRV+V + + + + RG  +K+    +  WI +
Subjt:  DDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPI

Query:  TYEKLPDFCYSCGKLGHVYQEC--AIQGSTDNQ--IKPFGIELRETKGSKGIYKIWKHDNRE-YRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEE
         YEKLP  C+ CG++ H Y+ C   ++G+   +   + FG+ LR   G +  +   K    E   RW  +  +      G GR G      +++ +    
Subjt:  TYEKLPDFCYSCGKLGHVYQEC--AIQGSTDNQ--IKPFGIELRETKGSKGIYKIWKHDNRE-YRRWKPEFRETRPRGRGRGRAGRFARGSFNKSINTEE

Query:  EGGNSN--------------------TQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEG-AQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQR
        EGG                        ++ E+  MG +   +     + E G      K G  ++E++   +SE    +        +       GI + 
Subjt:  EGGNSN--------------------TQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEG-AQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQR

Query:  ERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKK-------ETVTAGSKKKSFSDHNG--------PKR
        E      SS+ +K+G A E  +  ++ ++ K    ++  KE    NV I      + P    +K+       ET    +K ++     G        P  
Subjt:  ERTSPTASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKK-------ETVTAGSKKKSFSDHNG--------PKR

Query:  EMHRVLDMDLDTNKQKEQKIED-HMGCNLENEQNKRKW------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDN
            +L M  D ++ +       H+   + NE+N  +W            KR      L +    ++IG      GDFNE++S DEK GG  + +  ++ 
Subjt:  EMHRVLDMDLDTNKQKEQKIED-HMGCNLENEQNKRKW------------KRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDN

Query:  FREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKS
        FR A+    LRD G+ G KYTW  G      I+ERLDR + N E    F  I V+ +    SDH  IL    K    N R+ K+  ++E  W K+ E   
Subjt:  FREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSIIVQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKS

Query:  IIERLWQNSPGVDMAAFERKVLHCLTKLSKWNR---ERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGD
        +IER W+   G      + K++ C   L  W++   E+   SI +  DK    +KIL+E +       +      L  LL++E ++WK R++  WLK GD
Subjt:  IIERLWQNSPGVDMAAFERKVLHCLTKLSKWNR---ERLGGSINQAVDKKLEEIKILEEDQNNYPSPNLLKAEKELE-LLNEEEMYWKIRSREDWLKWGD

Query:  RNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIPSGCETPMPLKENLKG--------RHVYEILDER
        RNTK+FH  A+ R+ +N I    N++G  +     + E    YF  LF +++PS  ++E  +         T + + E+++G          V   L + 
Subjt:  RNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIPSGCETPMPLKENLKG--------RHVYEILDER

Query:  GCWK
          WK
Subjt:  GCWK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0829.28Show/hide
Query:  APEM---SKEDFQEAHLLPIM--KAPYARLENQKSQGRWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEG
        APE+   + EDF+E      +  KA   ++E   S  +W  P     K N DA+W     R G+GWILR+  G  + MG + + ++ +V   E++A++  
Subjt:  APEM---SKEDFQEAHLLPIM--KAPYARLENQKSQGRWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEG

Query:  ILSLSNLQASHLGSSLPPIAVESDAAGVMKILNKEEEDLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIATS
        +L++S             I  ESDA  ++ +LN ++   T +    E+I QL        F F PR  N  A  +A+ + S
Subjt:  ILSLSNLQASHLGSSLPPIAVESDAAGVMKILNKEEEDLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIATS

AT3G42140.1 zinc ion binding;nucleic acid binding3.4e-0522.07Show/hide
Query:  FKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLK
        F+       I + GPWS++D + + +  +        EF  + FW+    +P      +   ++G+ +G F                     L+ N    
Subjt:  FKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLK

Query:  RGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECAIQGS
         G +V +        +   YEKL +FC +CG L H   EC   G+
Subjt:  RGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECAIQGS

AT4G29090.1 Ribonuclease H-like superfamily protein3.4e-1332.67Show/hide
Query:  NQKSQGRWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEGILSLSNLQASHLGSSLPPIAVESDAAGVMKI
        N+ S GRW PP     K N DA+W+    R G+GW+LR+  G    MG + + K  SV   E++A++  +LSLS  Q ++       +  ESD+  +++I
Subjt:  NQKSQGRWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEGILSLSNLQASHLGSSLPPIAVESDAAGVMKI

Query:  LNKEEEDLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIATS
        LN  +E    +    +++ +L S      F+F PRE N+ A  +A+ + S
Subjt:  LNKEEEDLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIATS

AT5G36228.1 nucleic acid binding;zinc ion binding3.2e-1123.33Show/hide
Query:  KILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKY
        +IL  +T + E   + +P  WG+   V         F  +F+   D +   +  PW +++  +  +  +     + L F  +  WVH   +P      + 
Subjt:  KILTYRTINAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKY

Query:  AIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQEC
           +  ++G   + + +E        +RVKVR+   EPL+    V+  S+ ER  I   YEKL   C +C ++ H    C
Subjt:  AIALGDSIGNFESAESDENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQEC

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0426.76Show/hide
Query:  RWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEGILSLSNLQASHLGSSLPPIAVESDAAGVMKILNKEEE
        +WSPP   + K N DAS  +     G+GWILR+S G+ I  G  +     + +  E   +      +  +QAS+ G     +  E D   + +++N +  
Subjt:  RWSPPESGEWKLNVDASWSDSQNRGGVGWILRDSLGSSICMGFKRINKSWSVKYLEMKAIQEGILSLSNLQASHLGSSLPPIAVESDAAGVMKILNKEEE

Query:  DLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIA
        +   +    + I     S     F F  RE N  A  LA+ A
Subjt:  DLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTAAGGGGCAACCATGAGGTTTGCGTTCAGGGGAAGGCGAGCAACATCAGCGTCAAAGCTGTGAGCAAGGACCCATCCATGGAAGAAGGAGAAGGATCACTGGA
GAGGTGTACTAAACGTAGCACAAGCAAAAATACGTACAACACCGAGAACGAAAACCAATCTATGGAGAATGAAACAGGGGCAGAGGAAATGCAACTGCTTCTGGAGAAAC
TAAAGCTGGAAGAAGGAAATAGAATCGTGGAAATAGAGGACGACGACTTGGAGGAAACAGACAGAGAATTCCAAAGTGCTATAGCCTGCAAAATCCTAACTTACAGGACG
ATAAACGCAGAGGTTTTTTCGGTGATGATGCCCAGAATCTGGGGAATAGAAGGATCTGTGAAAGTGGAAAAGGCGGGACCAAATATTTTCTTATGCAAGTTCAAAAGAAT
AAGAGACAAAATCAGAATCTCAAAAGGCGGCCCTTGGTCTTATGACGATGCCATTCTGATTTTTGAAGAACCAAAGGGGAACTGCTGCGTGGAAGCCCTAGAATTTAATT
ATGTTTCTTTTTGGGTTCATTTCCACAACTTACCTAGGGTGTGTTTTTGCAGGAAGTATGCCATAGCCCTGGGGGACTCCATTGGAAACTTTGAATCGGCTGAATCAGAC
GAGAATGGGAAAATGACGGGCGAAACTCTGAGGGTTAAAGTCAGATTAAAAGTTAACGAGCCATTGAAGAGAGGAACCAATGTGAAAATCGGATCAAAAGCAGAGAGAAC
CTGGATTCCTATAACGTACGAGAAATTACCGGACTTCTGTTATAGCTGCGGAAAATTGGGCCATGTGTACCAAGAATGTGCCATTCAGGGATCGACTGACAATCAGATCA
AACCATTTGGCATAGAGCTTAGAGAGACAAAAGGCAGCAAAGGAATTTACAAAATTTGGAAACACGACAACAGAGAATACAGAAGATGGAAACCAGAGTTCAGAGAAACA
CGTCCCCGAGGAAGAGGCAGAGGAAGAGCTGGAAGATTTGCCAGGGGAAGTTTCAACAAGTCCATAAACACAGAAGAGGAAGGCGGGAATTCTAACACACAAACGAGAGA
AAAAGAAGGAATGGGCAATAGGAAAGGAGGCGAAAAATTTGGGAAAAAGAAGGAGGAAGAGGGTTCGACGGAGAACTTTGCAAAAGAAGGGGCTCAAGTGGAGGAAATGG
AAGTCTTGCAATCGGAGGACTTCTCGCCAGAGAAACGACCGCCGGAACCCAGTCGGAGGGGGATGGATATCGAATTCTCCGGTATCAATCAAAGGGAGAGAACATCCCCA
ACGGCTAGTTCCTTGCCAGACAAAAAAGGAAAAGCAAAAGAATATCCAATCATGGAGTTAGAACAGCAGAGTACGAAAGTTTTAGGGGAACAGGAAAGTAGTAAGGAAAT
TAATCATGTCAATGTCAACATCATTCCAGAGGAAAATCTGAATCCACCGACAGAGGCAATTCAGAAAAAGGAAACAGTGACAGCTGGATCCAAAAAGAAAAGCTTCAGTG
ATCATAATGGGCCGAAAAGAGAAATGCACAGGGTTTTAGATATGGACCTTGATACCAATAAACAGAAAGAACAAAAGATTGAAGATCATATGGGCTGCAACCTAGAGAAT
GAACAAAACAAAAGGAAATGGAAAAGGAGAGCCAGAATGGTTAATTTAGAAGATGGGGGAGAAAAATCCAAGATTGGGGAAAAAAGGAAAAGCAGTGGAGATTTCAATGA
ATTGCTTTCGGGTGACGAAAAAACAGGAGGATCCCTCAAAAACCAAAAGATGATAGATAATTTCCGTGAAGCTGTTGGTAAATGTAGGCTAAGAGATGCGGGATATAGCG
GCAACAAATATACTTGGAGAAGGGGCAAGAAAGCTTCTATGGCCATCAGGGAGAGGCTTGACAGATTCTTAGTGAATACTGAAATGGACTTAAGGTTTAAAAGCATAATA
GTGCAGCACCTCCATTTCCATAATTCAGATCATCGGGCCATACTGGCTGATCTTGGGAAGGATCAGCGCAGTAACAAGAGAAAGAGGAAAAAAGTGTTAAAGTTTGAGGA
ATCTTGGGCCAAAAATCTGGAAGCTAAGTCCATCATTGAAAGATTGTGGCAAAATTCCCCTGGTGTTGATATGGCAGCCTTCGAAAGGAAAGTTCTTCACTGTCTAACAA
AACTTTCAAAATGGAACAGGGAAAGACTTGGAGGATCCATTAATCAGGCCGTGGACAAGAAATTGGAAGAGATCAAGATCTTGGAGGAGGATCAAAATAACTACCCATCG
CCGAACCTCTTAAAAGCCGAAAAAGAGCTGGAGTTGTTGAACGAAGAAGAGATGTATTGGAAAATTAGGTCTAGAGAGGATTGGTTAAAATGGGGAGATAGGAACACCAA
GTGGTTCCATAAAAAAGCCTCTCATAGAAAAAACAGAAATGAGATCACAGGCCCATTGAATTCTGAGGGTTGTTGGGAAGATAACGATGACAAAATGGGAGAGATTGCTT
CCAACTACTTCAATAATCTTTTCAATTCTTCTAATCCCTCGGAGGAAGATATGGAAAGGGTGTTAGAAGGAATCATCCCATCTGGATGTGAAACTCCTATGCCTCTAAAA
GAGAATTTGAAAGGTAGACATGTCTATGAAATCCTAGATGAAAGAGGTTGCTGGAAAGAAAGAGTCATAAAGGATAGTTTCTCTTTTATGGATTCTTCGACAATCCTTAA
TTCAACTTCGGGAGGCCCTCAGATTAAGGACGAAATAATATGGAACAGAGACAAGAAAGGTATCTTCTTGAAGGATTGGAATTCCCACTATGGCTACATCGGTATGACTC
TGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCCAGAGGAAGCCTTGAGGGGAATTCTCTGAGAAATATTTCTGTGTCCACGGATATCGACCAATGCAGAGTTCCTCTC
GAGCCAGGAGAGGACGGCGCGCCTTTGTTCAAGCCCCGGAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAAGAAGTGAATTCCATCTTGTACTGTT
ATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGAAATGTCCAAGGAGGATTTTCAGGAAGCGCACCTGCTGCCCATTATGAAAGCCCCCTATGCGCGACTGGAGAACCAAA
AGAGTCAAGGTAGATGGTCGCCGCCGGAGTCCGGAGAGTGGAAGCTGAACGTGGATGCATCTTGGAGCGACTCTCAAAACAGAGGAGGCGTGGGTTGGATTCTCCGTGAC
TCTTTGGGTTCTTCAATTTGCATGGGCTTCAAGAGGATCAACAAAAGCTGGTCCGTTAAGTATCTTGAAATGAAAGCGATTCAGGAAGGTATTCTAAGTTTATCTAACCT
GCAAGCGTCTCATCTTGGCTCGTCTCTTCCTCCCATAGCAGTCGAGTCGGATGCAGCTGGAGTCATGAAAATCCTCAACAAGGAAGAGGAAGATCTTACCGAAATTTCTT
TCCTAGCCGAAGAGATTCTGCAGCTGAGTTCTTCGTTAGGCATTTTTTCTTTTCTTTTTTGCCCGCGAGAGTATAATTCCGCAGCCCACAGTTTGGCGCAAATTGCAACC
TCTCCTATTCCCCCCTCTTTTTTGTCGTCCGGTATCTCTTCCAATTCGGAAGAAGCTGTTGGTTTTTGGTTCGGGCCTCCCCCTTCGTGGGTTGGTGAACTTTTATTTGG
GGTTGTTGGTTCTGTTGGCGTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTAAGGGGCAACCATGAGGTTTGCGTTCAGGGGAAGGCGAGCAACATCAGCGTCAAAGCTGTGAGCAAGGACCCATCCATGGAAGAAGGAGAAGGATCACTGGA
GAGGTGTACTAAACGTAGCACAAGCAAAAATACGTACAACACCGAGAACGAAAACCAATCTATGGAGAATGAAACAGGGGCAGAGGAAATGCAACTGCTTCTGGAGAAAC
TAAAGCTGGAAGAAGGAAATAGAATCGTGGAAATAGAGGACGACGACTTGGAGGAAACAGACAGAGAATTCCAAAGTGCTATAGCCTGCAAAATCCTAACTTACAGGACG
ATAAACGCAGAGGTTTTTTCGGTGATGATGCCCAGAATCTGGGGAATAGAAGGATCTGTGAAAGTGGAAAAGGCGGGACCAAATATTTTCTTATGCAAGTTCAAAAGAAT
AAGAGACAAAATCAGAATCTCAAAAGGCGGCCCTTGGTCTTATGACGATGCCATTCTGATTTTTGAAGAACCAAAGGGGAACTGCTGCGTGGAAGCCCTAGAATTTAATT
ATGTTTCTTTTTGGGTTCATTTCCACAACTTACCTAGGGTGTGTTTTTGCAGGAAGTATGCCATAGCCCTGGGGGACTCCATTGGAAACTTTGAATCGGCTGAATCAGAC
GAGAATGGGAAAATGACGGGCGAAACTCTGAGGGTTAAAGTCAGATTAAAAGTTAACGAGCCATTGAAGAGAGGAACCAATGTGAAAATCGGATCAAAAGCAGAGAGAAC
CTGGATTCCTATAACGTACGAGAAATTACCGGACTTCTGTTATAGCTGCGGAAAATTGGGCCATGTGTACCAAGAATGTGCCATTCAGGGATCGACTGACAATCAGATCA
AACCATTTGGCATAGAGCTTAGAGAGACAAAAGGCAGCAAAGGAATTTACAAAATTTGGAAACACGACAACAGAGAATACAGAAGATGGAAACCAGAGTTCAGAGAAACA
CGTCCCCGAGGAAGAGGCAGAGGAAGAGCTGGAAGATTTGCCAGGGGAAGTTTCAACAAGTCCATAAACACAGAAGAGGAAGGCGGGAATTCTAACACACAAACGAGAGA
AAAAGAAGGAATGGGCAATAGGAAAGGAGGCGAAAAATTTGGGAAAAAGAAGGAGGAAGAGGGTTCGACGGAGAACTTTGCAAAAGAAGGGGCTCAAGTGGAGGAAATGG
AAGTCTTGCAATCGGAGGACTTCTCGCCAGAGAAACGACCGCCGGAACCCAGTCGGAGGGGGATGGATATCGAATTCTCCGGTATCAATCAAAGGGAGAGAACATCCCCA
ACGGCTAGTTCCTTGCCAGACAAAAAAGGAAAAGCAAAAGAATATCCAATCATGGAGTTAGAACAGCAGAGTACGAAAGTTTTAGGGGAACAGGAAAGTAGTAAGGAAAT
TAATCATGTCAATGTCAACATCATTCCAGAGGAAAATCTGAATCCACCGACAGAGGCAATTCAGAAAAAGGAAACAGTGACAGCTGGATCCAAAAAGAAAAGCTTCAGTG
ATCATAATGGGCCGAAAAGAGAAATGCACAGGGTTTTAGATATGGACCTTGATACCAATAAACAGAAAGAACAAAAGATTGAAGATCATATGGGCTGCAACCTAGAGAAT
GAACAAAACAAAAGGAAATGGAAAAGGAGAGCCAGAATGGTTAATTTAGAAGATGGGGGAGAAAAATCCAAGATTGGGGAAAAAAGGAAAAGCAGTGGAGATTTCAATGA
ATTGCTTTCGGGTGACGAAAAAACAGGAGGATCCCTCAAAAACCAAAAGATGATAGATAATTTCCGTGAAGCTGTTGGTAAATGTAGGCTAAGAGATGCGGGATATAGCG
GCAACAAATATACTTGGAGAAGGGGCAAGAAAGCTTCTATGGCCATCAGGGAGAGGCTTGACAGATTCTTAGTGAATACTGAAATGGACTTAAGGTTTAAAAGCATAATA
GTGCAGCACCTCCATTTCCATAATTCAGATCATCGGGCCATACTGGCTGATCTTGGGAAGGATCAGCGCAGTAACAAGAGAAAGAGGAAAAAAGTGTTAAAGTTTGAGGA
ATCTTGGGCCAAAAATCTGGAAGCTAAGTCCATCATTGAAAGATTGTGGCAAAATTCCCCTGGTGTTGATATGGCAGCCTTCGAAAGGAAAGTTCTTCACTGTCTAACAA
AACTTTCAAAATGGAACAGGGAAAGACTTGGAGGATCCATTAATCAGGCCGTGGACAAGAAATTGGAAGAGATCAAGATCTTGGAGGAGGATCAAAATAACTACCCATCG
CCGAACCTCTTAAAAGCCGAAAAAGAGCTGGAGTTGTTGAACGAAGAAGAGATGTATTGGAAAATTAGGTCTAGAGAGGATTGGTTAAAATGGGGAGATAGGAACACCAA
GTGGTTCCATAAAAAAGCCTCTCATAGAAAAAACAGAAATGAGATCACAGGCCCATTGAATTCTGAGGGTTGTTGGGAAGATAACGATGACAAAATGGGAGAGATTGCTT
CCAACTACTTCAATAATCTTTTCAATTCTTCTAATCCCTCGGAGGAAGATATGGAAAGGGTGTTAGAAGGAATCATCCCATCTGGATGTGAAACTCCTATGCCTCTAAAA
GAGAATTTGAAAGGTAGACATGTCTATGAAATCCTAGATGAAAGAGGTTGCTGGAAAGAAAGAGTCATAAAGGATAGTTTCTCTTTTATGGATTCTTCGACAATCCTTAA
TTCAACTTCGGGAGGCCCTCAGATTAAGGACGAAATAATATGGAACAGAGACAAGAAAGGTATCTTCTTGAAGGATTGGAATTCCCACTATGGCTACATCGGTATGACTC
TGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCCAGAGGAAGCCTTGAGGGGAATTCTCTGAGAAATATTTCTGTGTCCACGGATATCGACCAATGCAGAGTTCCTCTC
GAGCCAGGAGAGGACGGCGCGCCTTTGTTCAAGCCCCGGAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAAGAAGTGAATTCCATCTTGTACTGTT
ATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGAAATGTCCAAGGAGGATTTTCAGGAAGCGCACCTGCTGCCCATTATGAAAGCCCCCTATGCGCGACTGGAGAACCAAA
AGAGTCAAGGTAGATGGTCGCCGCCGGAGTCCGGAGAGTGGAAGCTGAACGTGGATGCATCTTGGAGCGACTCTCAAAACAGAGGAGGCGTGGGTTGGATTCTCCGTGAC
TCTTTGGGTTCTTCAATTTGCATGGGCTTCAAGAGGATCAACAAAAGCTGGTCCGTTAAGTATCTTGAAATGAAAGCGATTCAGGAAGGTATTCTAAGTTTATCTAACCT
GCAAGCGTCTCATCTTGGCTCGTCTCTTCCTCCCATAGCAGTCGAGTCGGATGCAGCTGGAGTCATGAAAATCCTCAACAAGGAAGAGGAAGATCTTACCGAAATTTCTT
TCCTAGCCGAAGAGATTCTGCAGCTGAGTTCTTCGTTAGGCATTTTTTCTTTTCTTTTTTGCCCGCGAGAGTATAATTCCGCAGCCCACAGTTTGGCGCAAATTGCAACC
TCTCCTATTCCCCCCTCTTTTTTGTCGTCCGGTATCTCTTCCAATTCGGAAGAAGCTGTTGGTTTTTGGTTCGGGCCTCCCCCTTCGTGGGTTGGTGAACTTTTATTTGG
GGTTGTTGGTTCTGTTGGCGTTCTTTAA
Protein sequenceShow/hide protein sequence
MDVRGNHEVCVQGKASNISVKAVSKDPSMEEGEGSLERCTKRSTSKNTYNTENENQSMENETGAEEMQLLLEKLKLEEGNRIVEIEDDDLEETDREFQSAIACKILTYRT
INAEVFSVMMPRIWGIEGSVKVEKAGPNIFLCKFKRIRDKIRISKGGPWSYDDAILIFEEPKGNCCVEALEFNYVSFWVHFHNLPRVCFCRKYAIALGDSIGNFESAESD
ENGKMTGETLRVKVRLKVNEPLKRGTNVKIGSKAERTWIPITYEKLPDFCYSCGKLGHVYQECAIQGSTDNQIKPFGIELRETKGSKGIYKIWKHDNREYRRWKPEFRET
RPRGRGRGRAGRFARGSFNKSINTEEEGGNSNTQTREKEGMGNRKGGEKFGKKKEEEGSTENFAKEGAQVEEMEVLQSEDFSPEKRPPEPSRRGMDIEFSGINQRERTSP
TASSLPDKKGKAKEYPIMELEQQSTKVLGEQESSKEINHVNVNIIPEENLNPPTEAIQKKETVTAGSKKKSFSDHNGPKREMHRVLDMDLDTNKQKEQKIEDHMGCNLEN
EQNKRKWKRRARMVNLEDGGEKSKIGEKRKSSGDFNELLSGDEKTGGSLKNQKMIDNFREAVGKCRLRDAGYSGNKYTWRRGKKASMAIRERLDRFLVNTEMDLRFKSII
VQHLHFHNSDHRAILADLGKDQRSNKRKRKKVLKFEESWAKNLEAKSIIERLWQNSPGVDMAAFERKVLHCLTKLSKWNRERLGGSINQAVDKKLEEIKILEEDQNNYPS
PNLLKAEKELELLNEEEMYWKIRSREDWLKWGDRNTKWFHKKASHRKNRNEITGPLNSEGCWEDNDDKMGEIASNYFNNLFNSSNPSEEDMERVLEGIIPSGCETPMPLK
ENLKGRHVYEILDERGCWKERVIKDSFSFMDSSTILNSTSGGPQIKDEIIWNRDKKGIFLKDWNSHYGYIGMTLRLLEAGDWWESRGSLEGNSLRNISVSTDIDQCRVPL
EPGEDGAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPPFGLAPEMSKEDFQEAHLLPIMKAPYARLENQKSQGRWSPPESGEWKLNVDASWSDSQNRGGVGWILRD
SLGSSICMGFKRINKSWSVKYLEMKAIQEGILSLSNLQASHLGSSLPPIAVESDAAGVMKILNKEEEDLTEISFLAEEILQLSSSLGIFSFLFCPREYNSAAHSLAQIAT
SPIPPSFLSSGISSNSEEAVGFWFGPPPSWVGELLFGVVGSVGVL