; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010765 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010765
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold10:5079115..5101071
RNA-Seq ExpressionSpg010765
SyntenySpg010765
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]3.9e-5734.28Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG
        K FS++     + W+  + + LL  P T +FF +    +  +W+QKI N+RG   EI +V++ G K  ++VP G++  GW  F+D+L       KK    
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG

Query:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRK------INWEEVIVI
           Y +D       +  ++    +   K++ + V  S  +SIS  +   S   + K  P ++        K +P  +    E+RK      I+WE+ I++
Subjt:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRK------INWEEVIVI

Query:  TKRDFHDDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQ
        ++R FHDDW +I++ ++ Q  +        PFH DKAL+    +ELA+LL KN GW + GP  +K EKW+KNAH+    IPSYGGW + R +PLH+WN+ 
Subjt:  TKRDFHDDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQ

Query:  TFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRI--EDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH
        TF  IG+  GGFI+   ++   +E  E  IK+KENY GF+PA ++I  E+G+D +IVQ +T  +G  L  +   +HG+F+ +    F+
Subjt:  TFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRI--EDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.3e-5533.01Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL
        K FS++     + W+  S   LL  P T +FF +       +W+QK  N++G   EI +V++ G K  ++VP G E  GW +F  LL  KK  + + +Y 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL

Query:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ
             + T++R    G S +D  + +   S +    +GS   E             +  ++I    G S      +WE   V+T+R FHDDW RI+E + 
Subjt:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ

Query:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS
        +QL   +   PFH DKALI   + E A LL KN+GW + G   +K E+W++  H+    IPSYGGW+K+R +PLH WN+++F  IGD  GGF+E  ++  
Subjt:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS

Query:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRGPMDPLFCTTDIWRIENGLNYPVVKVLQSVEE
         L +  E  IKIK+NY GFIPA +++ D E+  +IVQ++   +G     +   IHGTF+      F     D     ++ +  E+       K +  V  
Subjt:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRGPMDPLFCTTDIWRIENGLNYPVVKVLQSVEE

Query:  PRISGGQLDGNK
         + +G  L+ NK
Subjt:  PRISGGQLDGNK

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]6.0e-5834.38Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG
        K FS++     + W+  + + LL  P T +FF +    +  +W+QKI N+RG   EI +V++ G K  ++VP G++  GW  FSD+L       KK ++ 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG

Query:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFH
           Y +D       K  ++    +   K++A++V  S S+S S  +   +    ++  P ++  +   E K+             I+WE+ I++++R FH
Subjt:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFH

Query:  DDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIG
        DDW +I++ +++Q  +        PFH DKAL+    ++LA LL KN GW + GP  +K EKW+K+ H+    IPSYGGW + R +PLH+WN+ TF  IG
Subjt:  DDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIG

Query:  DNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGE-DLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH
        +  GGFI+   ++   IE  E  IK+KENY GF+PA ++I D E  ++I+QIVT  +G  L  +   IHG+F  T    F+
Subjt:  DNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGE-DLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.5e-5634.82Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL
        K FS++     + W+  S   LL  P T +FF +    +  +W+QK  N++G   EI +V++ G K  ++VP G E  GW  F  LL  KK  + + +Y 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL

Query:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ
             + T+++ +       D  KR    S +    +GS   E               W       G S       WE  +V+T+R FHDDW +I+E + 
Subjt:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ

Query:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS
        +QL   +   PFH DKALI   + E A+L+ KN+GW + G   +K E+WN+ AH+    IPSYGGW+K+R +PLH WN+++F  IGD  GGFIE  ++  
Subjt:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS

Query:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFS
         L + +E  I+IK+NY GFIPA +++ D E+  +I+Q++   +G   + +   IHGTF+
Subjt:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFS

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]1.5e-6949.81Show/hide
Query:  SEEVRKINWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVK
        +EEVR++NWEE IVIT+RDFHDDW RIL  +++Q     IINPF  DKAL+KC S++LA LL  N+GWV+FGP+ +K+E WN   H R    PSYG WVK
Subjt:  SEEVRKINWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVK

Query:  IRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRG
        IRN+PLHLW++ TFKAIG+ LGGFI+Y+++NS  IEC +V IK+K NY GFIPAE+   DG   +  ++V+F+D   L  +  GIHG FSS     FH+G
Subjt:  IRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRG

Query:  PMDPLFCTTDIWRIENGLNYPVVKVLQSVEEPRISGGQLDGNKTDFEISR--LRNKGKNKE
          +    + D WR+ENG NYP V +      P     +  G+K  F  +R  L    K +E
Subjt:  PMDPLFCTTDIWRIENGLNYPVVKVLQSVEEPRISGGQLDGNKTDFEISR--LRNKGKNKE

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein2.9e-5834.38Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG
        K FS++     + W+  + + LL  P T +FF +    +  +W+QKI N+RG   EI +V++ G K  ++VP G++  GW  FSD+L       KK ++ 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG

Query:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFH
           Y +D       K  ++    +   K++A++V  S S+S S  +   +    ++  P ++  +   E K+             I+WE+ I++++R FH
Subjt:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFH

Query:  DDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIG
        DDW +I++ +++Q  +        PFH DKAL+    ++LA LL KN GW + GP  +K EKW+K+ H+    IPSYGGW + R +PLH+WN+ TF  IG
Subjt:  DDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIG

Query:  DNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGE-DLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH
        +  GGFI+   ++   IE  E  IK+KENY GF+PA ++I D E  ++I+QIVT  +G  L  +   IHG+F  T    F+
Subjt:  DNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGE-DLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH

A0A5A7TFK7 DUF4283 domain-containing protein6.1e-5633.01Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL
        K FS++     + W+  S   LL  P T +FF +       +W+QK  N++G   EI +V++ G K  ++VP G E  GW +F  LL  KK  + + +Y 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL

Query:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ
             + T++R    G S +D  + +   S +    +GS   E             +  ++I    G S      +WE   V+T+R FHDDW RI+E + 
Subjt:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ

Query:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS
        +QL   +   PFH DKALI   + E A LL KN+GW + G   +K E+W++  H+    IPSYGGW+K+R +PLH WN+++F  IGD  GGF+E  ++  
Subjt:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS

Query:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRGPMDPLFCTTDIWRIENGLNYPVVKVLQSVEE
         L +  E  IKIK+NY GFIPA +++ D E+  +IVQ++   +G     +   IHGTF+      F     D     ++ +  E+       K +  V  
Subjt:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRGPMDPLFCTTDIWRIENGLNYPVVKVLQSVEE

Query:  PRISGGQLDGNK
         + +G  L+ NK
Subjt:  PRISGGQLDGNK

A0A5D3CFS8 DUF4283 domain-containing protein7.2e-5734.82Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL
        K FS++     + W+  S   LL  P T +FF +    +  +W+QK  N++G   EI +V++ G K  ++VP G E  GW  F  LL  KK  + + +Y 
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL--KKFLNGEDDYL

Query:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ
             + T+++ +       D  KR    S +    +GS   E               W       G S       WE  +V+T+R FHDDW +I+E + 
Subjt:  EDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQ

Query:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS
        +QL   +   PFH DKALI   + E A+L+ KN+GW + G   +K E+WN+ AH+    IPSYGGW+K+R +PLH WN+++F  IGD  GGFIE  ++  
Subjt:  QQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNS

Query:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFS
         L + +E  I+IK+NY GFIPA +++ D E+  +I+Q++   +G   + +   IHGTF+
Subjt:  LLIECVEVKIKIKENYWGFIPAEVRIEDGED-LYIVQIVTFQDGSLLINQVAGIHGTFS

A0A5D3DLT1 DUF4283 domain-containing protein1.9e-5734.28Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG
        K FS++     + W+  + + LL  P T +FF +    +  +W+QKI N+RG   EI +V++ G K  ++VP G++  GW  F+D+L       KK    
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVEYRGWKNFSDLL-------KKFLNG

Query:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRK------INWEEVIVI
           Y +D       +  ++    +   K++ + V  S  +SIS  +   S   + K  P ++        K +P  +    E+RK      I+WE+ I++
Subjt:  EDDYLED------CKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRK------INWEEVIVI

Query:  TKRDFHDDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQ
        ++R FHDDW +I++ ++ Q  +        PFH DKAL+    +ELA+LL KN GW + GP  +K EKW+KNAH+    IPSYGGW + R +PLH+WN+ 
Subjt:  TKRDFHDDWGRILEVMQQQLMEP---LIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQ

Query:  TFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRI--EDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH
        TF  IG+  GGFI+   ++   +E  E  IK+KENY GF+PA ++I  E+G+D +IVQ +T  +G  L  +   +HG+F+ +    F+
Subjt:  TFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRI--EDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFH

A0A6J1D6X4 uncharacterized protein LOC1110181867.4e-7049.81Show/hide
Query:  SEEVRKINWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVK
        +EEVR++NWEE IVIT+RDFHDDW RIL  +++Q     IINPF  DKAL+KC S++LA LL  N+GWV+FGP+ +K+E WN   H R    PSYG WVK
Subjt:  SEEVRKINWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVK

Query:  IRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRG
        IRN+PLHLW++ TFKAIG+ LGGFI+Y+++NS  IEC +V IK+K NY GFIPAE+   DG   +  ++V+F+D   L  +  GIHG FSS     FH+G
Subjt:  IRNLPLHLWNMQTFKAIGDNLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRG

Query:  PMDPLFCTTDIWRIENGLNYPVVKVLQSVEEPRISGGQLDGNKTDFEISR--LRNKGKNKE
          +    + D WR+ENG NYP V +      P     +  G+K  F  +R  L    K +E
Subjt:  PMDPLFCTTDIWRIENGLNYPVVKVLQSVEEPRISGGQLDGNKTDFEISR--LRNKGKNKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein3.1e-1230.08Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ++++KD ++VV+KGDV G Y+   + +AQ G                           +FD   ++YKG  L K+ E+YL+S GL+   YS+ A+++  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEE
        +FG L  C  ++P+    K++E+
Subjt:  LFGKLVACPHEQPSATRGKMAEE

AT1G24090.1 RNase H family protein9.7e-0665.71Show/hide
Query:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT
        A +S +T F+EFDGASKGNPGL+GA AVL+  DG+
Subjt:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-1633.81Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME++KD +Y+V+KGD+ G YRS  E + QAG                           +  P  ++YKG    K AE  L+S G+++A +S++A++V  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESG
         FGKL+ CP +QPS+++G+   ++    R Q + + ESG
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESG

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0543.9Show/hide
Query:  GDNRSVDPSPFAGMPQLTLGRNNGVFRVCERSWEEERSLVRLNASIRASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT
        G++ S  PSP    PQ  L   N + R    S    R+ +R N         D+  +EFDGASKGNPG AGAGAVLRA+D +
Subjt:  GDNRSVDPSPFAGMPQLTLGRNNGVFRVCERSWEEERSLVRLNASIRASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-1633.81Show/hide
Query:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD
        ME++KD +Y+V+KGD+ G YRS  E + QAG                           +  P  ++YKG    K AE  L+S G+++A +S++A++V  D
Subjt:  MEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKD

Query:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESG
         FGKL+ CP +QPS+++G+   ++    R Q + + ESG
Subjt:  LFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESG

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0543.9Show/hide
Query:  GDNRSVDPSPFAGMPQLTLGRNNGVFRVCERSWEEERSLVRLNASIRASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT
        G++ S  PSP    PQ  L   N + R    S    R+ +R N         D+  +EFDGASKGNPG AGAGAVLRA+D +
Subjt:  GDNRSVDPSPFAGMPQLTLGRNNGVFRVCERSWEEERSLVRLNASIRASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT

AT5G51080.1 RNase H family protein7.2e-0928.91Show/hide
Query:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL
        +++KD ++VV+KGD+ G Y+   + +AQ G                           ++DP  ++YKG  L K+ E+ L++ GL+   Y   A ++ +D+
Subjt:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL

Query:  FGKLVAC--PHEQPSATRG--KMAEENP
        FG L  C    + PSA+    K+AE  P
Subjt:  FGKLVAC--PHEQPSATRG--KMAEENP

AT5G51080.1 RNase H family protein2.8e-0541.89Show/hide
Query:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT------------TEQNSDADALANCAIHLRVKGYTKSSV
        A  S +T  +EFDGASKGNPGL+GA AVL+  DG+            T   ++   L     H   KGYTK  V
Subjt:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT------------TEQNSDADALANCAIHLRVKGYTKSSV

AT5G51080.2 RNase H family protein7.2e-0928.91Show/hide
Query:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL
        +++KD ++VV+KGD+ G Y+   + +AQ G                           ++DP  ++YKG  L K+ E+ L++ GL+   Y   A ++ +D+
Subjt:  EEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLASRGLQSATYSISAANVTKDL

Query:  FGKLVAC--PHEQPSATRG--KMAEENP
        FG L  C    + PSA+    K+AE  P
Subjt:  FGKLVAC--PHEQPSATRG--KMAEENP

AT5G51080.2 RNase H family protein2.8e-0541.89Show/hide
Query:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT------------TEQNSDADALANCAIHLRVKGYTKSSV
        A  S +T  +EFDGASKGNPGL+GA AVL+  DG+            T   ++   L     H   KGYTK  V
Subjt:  ASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGT------------TEQNSDADALANCAIHLRVKGYTKSSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTTCTAGGCCACGATCAAGAAGCTAGACTAGAACTCGACATGTTTATGAGTGCTCTGTTTCACTTATTTGTACTCTGGATGGAGGAAGATAAAGATACCTATTA
CGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGTTGGAAGGAGTTCGAGGCTCAAGCTGGATGTTTTGTGATATTGTGGCGATTTTTCTCATCTGTCGTTGGAG
TTGCTCCTGACCATGGACAGTCACTGTCTATATGCTGTATATTTGATCCTAATGCAACGATCTACAAAGGGTGTCACTTATCTAAAGAAGCAGAGCAATACCTTGCATCA
CGTGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCCAATGTGACAAAGGATCTATTCGGAAAACTAGTTGCTTGCCCTCATGAGCAACCATCTGCCACCAGAGGAAA
AATGGCTGAAGAGAACCCCAGAGTTAGTAGACAACAAGTCCTTGAGAATACTGAATCTGGTTTTGTAGGTGCCAACTGGGTCTCAACAGATTCTCCAAAGGAAGAAATTA
TTTGGGATCACGGCTTTGAAGCCGTACCTGCTTCTTCGAGTTGTGAGGGCGTTGAAGAGGGGAGAGGGGTGCATCTCGTTAGTTGGGAGGTGGTTGGGAAACTTGTGGGT
CTTGGTGGGCTTGAGCTGGGTAACCCAAGGGCTCGGAATAGAGCCTTGTTAACTAAATGGTTGTGGTGTTTTCCCCTTGAGTCTACATCTTTAGGGCATCAGATCATCGC
TAGTAAATATGGTTTTCACCCTCATGAGTGGGTGTCGGGAGGGGTTAAGGGTGGGGGATGGAAAAGGATTAAAATTCCGAAGAAAGTTAGGTTCTTTACTTGGCAAGTTA
TCCATGGTAGAGTCAATACGATAGATAAACTTGCGAGGAAGATGACTTCTATGGTTGATCCTTTTTGTTGCATTCTATGTCGAAGGGCGGAGGAAGATCTGGACCATATT
TTGTGGACCTGCGAGTTTGCTAGATCGGTGTGGAGCTCTTTTTTCAACGCTTTCGGGCTTCAGGTTAGGAGGTTTAACGATTACAGAGAGATGATCCAGGAGTTCCTCCT
TCATCCGCTGTTTCGTGATAAGGGGAGGTTTTTGTGGCTTGTTGGGGTGTGTGCTTTATTGTGGAGATTATGGGGCGAGAGGAATAATAGAGTGTTTCAGAGTATTGAGA
GGGACTCTTCTGATGTGTGGACCCTCTCTAGATTCTGTGTTTCTCTTTGGGCTTCAAGTGTTGCTTACTTTCAACCATCTGTTTCTAGAGGAAAAAGGGCTGAAGAGTAC
TCTGAAGCTAAGAGACCACAAGTCCATGAGACTATTGAAAGTGGTTATATAGGTGGCTGTTATTGGACCTCAACACCTTTACCAATAGTTGCTTCAAGGCATTTGGATAC
TGGTTCTAAAACCATAGCTCCTTCATCTACTTGCGGAGTGCCGGGTGTTTTGGGCGCTGGAGACCTTTCCAGTGGGAAGGGTGGGGTTTTTTGTGGGTTGTCATCGGAAG
AGTTATGGAGGAGAAGTGCCATTAGGTTAGCTGATCTTCATGGCGTTTCAGAGAGGATGCAAAATCAACTCAGCTTGATTAAAATCAATGTTGGTGAGGGAGTGCCGGGT
GTTTTGGGTGCTGGAGACCTTTCCAGTGGGAAGGGTGGGGTTTTTTGTGGGTTGTCATCGGAAGAGTTATGGAGGAGAAGTGCCATTAGAAACTACAAGGGGAGAATGAT
TAGAATACAAGAAGACCACCTTAGGAAAAAGTTTTCCTTATCTGCGGAAGAGATAGTTATTGTTTGGATCTCAGACTCGATCGAGGACTTGCTCCTTGCTCCAGCCACCC
ACAAGTTTTTCAGAAAAGTTGATTGCAATAATGGATTCATATGGATTCAGAAAATTTCGAACAAAAGAGGAAGCTTCTTGGAAATCACGAAAGTTAACAACTCAGGAGGT
AAGCATAATTTGGTTGTGCCTGCAGGAGTAGAATATAGGGGATGGAAAAACTTCTCTGACCTTCTAAAAAAGTTCCTTAACGGAGAGGACGATTATTTGGAGGATTGTAA
GGAAGTAGAAACAGAGGTCAGGAGAAAAGGGAAAGGGAAGTCGTTTGCCGACATTGTTAAAAGATCCCCGAGTAATAGCATATCTATGAAAACTCCCCAAGGATCAAGAG
GAAAAGAGAGGAAAGTTGCTCCAGAGGTCAGGGGGAGTCTTGGTGGAACAGAATGGAAAGACATACCTGTGTGCAGAGGATGTTCTGAAGAAGTTAGAAAGATTAATTGG
GAAGAAGTCATAGTTATTACTAAAAGAGATTTTCATGATGATTGGGGAAGAATTCTTGAAGTGATGCAACAACAGTTGATGGAACCCCTTATTATTAACCCCTTTCACCC
GGACAAGGCCCTGATTAAATGCCATTCCAGAGAGCTTGCTGACCTGTTAACTAAAAATCAGGGGTGGGTAAGTTTTGGCCCGATTATCTTAAAGGTTGAGAAATGGAACA
AAAATGCTCATAGCAGAATTAGTTGTATTCCGAGCTACGGGGGGTGGGTCAAGATTCGAAACCTTCCCTTGCATTTATGGAATATGCAGACATTTAAAGCCATAGGGGAC
AACCTTGGAGGGTTTATTGAATATGAAGAAGATAATTCCCTGCTCATTGAATGCGTGGAAGTCAAAATAAAAATCAAGGAGAACTACTGGGGATTTATTCCGGCTGAAGT
TCGAATTGAAGATGGGGAAGACTTGTATATCGTGCAAATCGTCACCTTCCAAGATGGTTCTTTGCTGATAAATCAAGTGGCCGGAATTCATGGCACCTTCTCATCGACGA
CAATTCATGTCTTCCACAGAGGTCCTATGGACCCTCTCTTTTGCACAACAGATATTTGGAGAATCGAAAATGGCTTGAATTATCCAGTGGTCAAAGTCCTGCAGTCGGTA
GAAGAGCCCAGGATAAGTGGAGGACAGCTGGATGGCAATAAAACAGATTTTGAAATTTCCCGCCTAAGAAATAAAGGAAAAAACAAAGAGGAAGCTGGACCCTCAAAGAG
TGCGGGTATTTCTGTCCGAGTCGAAGCCCAACCCATTTTCGTCCAGCCGACTGGCCCCTCTGAGAGCGCGGAAGTCCCAACCCAAATCGAAGCCCAGTCTGATTTAGATC
ATGGAGATTGGATAGTGAAAAGATCTAAAAAGGAAAAGAGAAAAGGTGTCTCGTTTGCCAACGAGACCCAGATCACTTTGTTTAAACAGGGGGAGGTCCATCACACCCAA
AAAATTACCAGTCCTTTTTCGAAAGAAGGAGCGGCCGATGCTTGGAATGGAGAGCAAGGCCTCGAATCAGATTTGTCTTTCTCGAGTCCAGTAAGCTTAGACGGTGAAGG
CGGGGAAGAAAACGGTCTTAGCAGTAGGAAGGCCTCCGATAGTGATATCCCAGAAGGCTATCAGTTATGCTTCTACACTGATACCGATCAAGATATAAGCTCGATACCTC
TTAGTGTGGAGGAATCAGTAGACTTGGCGGAAGAAGCGCCTAGTACTAGAAAGGAGGATTTGATCCAGAGTCCGAAAGCTCCCCCAGAGTGCAGAACCACCGATGAAGAT
ATGCACGAAGAAAACCATTTGCTCTATCCAAATGCTTTAGCTATAATCCCTTCAGGTAAAGCTGACAGAGTTGAACCAGATACCGAAAATGGGGAGCTGCTCTTAAGTAA
AGAGCTAATTCTCACCCTTAGAAGGAATAACTTATGCATTAGGCCGATTGTTGGCTCCAGTGCTAAGAAAGGTAATTCTACCAAGAAAAAGCGCAGCAGGGAAGTTACCA
ACCTTTTTAGAACGTGGGAGAAAGAAGTAGAGCCTATCATAGAAGAGGAAAATGGCAATGAATCCAGGTTATCCGCCATTGATAGTCGCATTGTGAAGTCTGTTTGGAGC
TCTAGGCACATTGCTTGGGTGGCCTTGGATCCGGTGGCCTCGGGTGGGGGCATTATTATTCTTTGGAAGGAAGATAGAGTAGAGGTTGTTGACACTGTTTTGGGGGCTTT
TTTGATATCCATCAAATTTATTCTCCTCGACAAGTCAGAAGATTGTTGGATAGCTGAGCATCATTCATGGGACATTTCCCTTAGGAGGGGGCTCCTTGATCGAGAATTGT
CTAGTTGGTCGGCGCTGGTTGAGAAAATTCATATGATTAACTTGGTGGAAGGGCAAGATATGTTAAATTGGACCCTTGGTGGTTCAGGTAAGTTTTCGACGAAGTCTATG
TTCCAAAAGCTCTTGGAGTTGCCTCCCAGAATTCCCCCTCATATATGTAATCAAGTATTATGCCTCAAGGATGAAGAGAGGTTAGATCATTTGTTTTTGCATTGTGGCTT
TGCGGGTAAAGTGTGGAGCTTTATTGCCAAGAGTTTGGGATTTCTCATAAAGCCTGTGGATTTGGAGGTGGGGGCTATGTTGATGCAAGAACATCTAGTAAGATTCTACA
GTTGGGGTCTTATCCAACCAAGCATGGGAGACAACAGAAGTGTTGACCCTTCTCCTTTTGCTGGGATGCCTCAGTTGACCCTAGGGAGGAATAATGGAGTTTTTAGGGTT
TGTGAGAGATCTTGGGAAGAGGAGAGGTCCCTTGTTAGGCTTAATGCCTCTATTCGGGCTTCAGTCTCGAAGGATACCTATTTTCTTGAGTTTGATGGTGCCTCAAAGGG
AAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTGCGTGCTAACGATGGAACTACGGAACAAAATTCTGATGCCGATGCCCTAGCGAACTGTGCCATACATCTTCGAGTCA
AAGGATACACTAAATCCTCAGTGATGTCCGTCATGATGGCTGATGTAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAAAACTCAATCTCCTAATGAAGGCAGGGAAG
GCTATCGTCCATGAGGACCAGCCACAACACTCGGCGTCAGTTGCCTCTCTATCTGTCCAACAACTACAGGACATGATCATGAATTCCATCAGGACTCAATATGGTGGACC
CACTCAAAGTTCCCTCTTGTATTCCAAACCTTATACAAAGAGGATTGACAATTTGAGATTGCCCGCTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGATGGAAAGGGCA
ATCCAAAACAACACATCGCTCACTTCGTTGAAACCAGCGAGAATGCTGGAACTCGGGGAGACTTGCTTGTCAAACAGTTTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTTCTAGGCCACGATCAAGAAGCTAGACTAGAACTCGACATGTTTATGAGTGCTCTGTTTCACTTATTTGTACTCTGGATGGAGGAAGATAAAGATACCTATTA
CGTTGTACAGAAAGGGGATGTTTTTGGATTTTACAGGAGTTGGAAGGAGTTCGAGGCTCAAGCTGGATGTTTTGTGATATTGTGGCGATTTTTCTCATCTGTCGTTGGAG
TTGCTCCTGACCATGGACAGTCACTGTCTATATGCTGTATATTTGATCCTAATGCAACGATCTACAAAGGGTGTCACTTATCTAAAGAAGCAGAGCAATACCTTGCATCA
CGTGGACTTCAGAGTGCAACTTACTCTATAAGTGCTGCCAATGTGACAAAGGATCTATTCGGAAAACTAGTTGCTTGCCCTCATGAGCAACCATCTGCCACCAGAGGAAA
AATGGCTGAAGAGAACCCCAGAGTTAGTAGACAACAAGTCCTTGAGAATACTGAATCTGGTTTTGTAGGTGCCAACTGGGTCTCAACAGATTCTCCAAAGGAAGAAATTA
TTTGGGATCACGGCTTTGAAGCCGTACCTGCTTCTTCGAGTTGTGAGGGCGTTGAAGAGGGGAGAGGGGTGCATCTCGTTAGTTGGGAGGTGGTTGGGAAACTTGTGGGT
CTTGGTGGGCTTGAGCTGGGTAACCCAAGGGCTCGGAATAGAGCCTTGTTAACTAAATGGTTGTGGTGTTTTCCCCTTGAGTCTACATCTTTAGGGCATCAGATCATCGC
TAGTAAATATGGTTTTCACCCTCATGAGTGGGTGTCGGGAGGGGTTAAGGGTGGGGGATGGAAAAGGATTAAAATTCCGAAGAAAGTTAGGTTCTTTACTTGGCAAGTTA
TCCATGGTAGAGTCAATACGATAGATAAACTTGCGAGGAAGATGACTTCTATGGTTGATCCTTTTTGTTGCATTCTATGTCGAAGGGCGGAGGAAGATCTGGACCATATT
TTGTGGACCTGCGAGTTTGCTAGATCGGTGTGGAGCTCTTTTTTCAACGCTTTCGGGCTTCAGGTTAGGAGGTTTAACGATTACAGAGAGATGATCCAGGAGTTCCTCCT
TCATCCGCTGTTTCGTGATAAGGGGAGGTTTTTGTGGCTTGTTGGGGTGTGTGCTTTATTGTGGAGATTATGGGGCGAGAGGAATAATAGAGTGTTTCAGAGTATTGAGA
GGGACTCTTCTGATGTGTGGACCCTCTCTAGATTCTGTGTTTCTCTTTGGGCTTCAAGTGTTGCTTACTTTCAACCATCTGTTTCTAGAGGAAAAAGGGCTGAAGAGTAC
TCTGAAGCTAAGAGACCACAAGTCCATGAGACTATTGAAAGTGGTTATATAGGTGGCTGTTATTGGACCTCAACACCTTTACCAATAGTTGCTTCAAGGCATTTGGATAC
TGGTTCTAAAACCATAGCTCCTTCATCTACTTGCGGAGTGCCGGGTGTTTTGGGCGCTGGAGACCTTTCCAGTGGGAAGGGTGGGGTTTTTTGTGGGTTGTCATCGGAAG
AGTTATGGAGGAGAAGTGCCATTAGGTTAGCTGATCTTCATGGCGTTTCAGAGAGGATGCAAAATCAACTCAGCTTGATTAAAATCAATGTTGGTGAGGGAGTGCCGGGT
GTTTTGGGTGCTGGAGACCTTTCCAGTGGGAAGGGTGGGGTTTTTTGTGGGTTGTCATCGGAAGAGTTATGGAGGAGAAGTGCCATTAGAAACTACAAGGGGAGAATGAT
TAGAATACAAGAAGACCACCTTAGGAAAAAGTTTTCCTTATCTGCGGAAGAGATAGTTATTGTTTGGATCTCAGACTCGATCGAGGACTTGCTCCTTGCTCCAGCCACCC
ACAAGTTTTTCAGAAAAGTTGATTGCAATAATGGATTCATATGGATTCAGAAAATTTCGAACAAAAGAGGAAGCTTCTTGGAAATCACGAAAGTTAACAACTCAGGAGGT
AAGCATAATTTGGTTGTGCCTGCAGGAGTAGAATATAGGGGATGGAAAAACTTCTCTGACCTTCTAAAAAAGTTCCTTAACGGAGAGGACGATTATTTGGAGGATTGTAA
GGAAGTAGAAACAGAGGTCAGGAGAAAAGGGAAAGGGAAGTCGTTTGCCGACATTGTTAAAAGATCCCCGAGTAATAGCATATCTATGAAAACTCCCCAAGGATCAAGAG
GAAAAGAGAGGAAAGTTGCTCCAGAGGTCAGGGGGAGTCTTGGTGGAACAGAATGGAAAGACATACCTGTGTGCAGAGGATGTTCTGAAGAAGTTAGAAAGATTAATTGG
GAAGAAGTCATAGTTATTACTAAAAGAGATTTTCATGATGATTGGGGAAGAATTCTTGAAGTGATGCAACAACAGTTGATGGAACCCCTTATTATTAACCCCTTTCACCC
GGACAAGGCCCTGATTAAATGCCATTCCAGAGAGCTTGCTGACCTGTTAACTAAAAATCAGGGGTGGGTAAGTTTTGGCCCGATTATCTTAAAGGTTGAGAAATGGAACA
AAAATGCTCATAGCAGAATTAGTTGTATTCCGAGCTACGGGGGGTGGGTCAAGATTCGAAACCTTCCCTTGCATTTATGGAATATGCAGACATTTAAAGCCATAGGGGAC
AACCTTGGAGGGTTTATTGAATATGAAGAAGATAATTCCCTGCTCATTGAATGCGTGGAAGTCAAAATAAAAATCAAGGAGAACTACTGGGGATTTATTCCGGCTGAAGT
TCGAATTGAAGATGGGGAAGACTTGTATATCGTGCAAATCGTCACCTTCCAAGATGGTTCTTTGCTGATAAATCAAGTGGCCGGAATTCATGGCACCTTCTCATCGACGA
CAATTCATGTCTTCCACAGAGGTCCTATGGACCCTCTCTTTTGCACAACAGATATTTGGAGAATCGAAAATGGCTTGAATTATCCAGTGGTCAAAGTCCTGCAGTCGGTA
GAAGAGCCCAGGATAAGTGGAGGACAGCTGGATGGCAATAAAACAGATTTTGAAATTTCCCGCCTAAGAAATAAAGGAAAAAACAAAGAGGAAGCTGGACCCTCAAAGAG
TGCGGGTATTTCTGTCCGAGTCGAAGCCCAACCCATTTTCGTCCAGCCGACTGGCCCCTCTGAGAGCGCGGAAGTCCCAACCCAAATCGAAGCCCAGTCTGATTTAGATC
ATGGAGATTGGATAGTGAAAAGATCTAAAAAGGAAAAGAGAAAAGGTGTCTCGTTTGCCAACGAGACCCAGATCACTTTGTTTAAACAGGGGGAGGTCCATCACACCCAA
AAAATTACCAGTCCTTTTTCGAAAGAAGGAGCGGCCGATGCTTGGAATGGAGAGCAAGGCCTCGAATCAGATTTGTCTTTCTCGAGTCCAGTAAGCTTAGACGGTGAAGG
CGGGGAAGAAAACGGTCTTAGCAGTAGGAAGGCCTCCGATAGTGATATCCCAGAAGGCTATCAGTTATGCTTCTACACTGATACCGATCAAGATATAAGCTCGATACCTC
TTAGTGTGGAGGAATCAGTAGACTTGGCGGAAGAAGCGCCTAGTACTAGAAAGGAGGATTTGATCCAGAGTCCGAAAGCTCCCCCAGAGTGCAGAACCACCGATGAAGAT
ATGCACGAAGAAAACCATTTGCTCTATCCAAATGCTTTAGCTATAATCCCTTCAGGTAAAGCTGACAGAGTTGAACCAGATACCGAAAATGGGGAGCTGCTCTTAAGTAA
AGAGCTAATTCTCACCCTTAGAAGGAATAACTTATGCATTAGGCCGATTGTTGGCTCCAGTGCTAAGAAAGGTAATTCTACCAAGAAAAAGCGCAGCAGGGAAGTTACCA
ACCTTTTTAGAACGTGGGAGAAAGAAGTAGAGCCTATCATAGAAGAGGAAAATGGCAATGAATCCAGGTTATCCGCCATTGATAGTCGCATTGTGAAGTCTGTTTGGAGC
TCTAGGCACATTGCTTGGGTGGCCTTGGATCCGGTGGCCTCGGGTGGGGGCATTATTATTCTTTGGAAGGAAGATAGAGTAGAGGTTGTTGACACTGTTTTGGGGGCTTT
TTTGATATCCATCAAATTTATTCTCCTCGACAAGTCAGAAGATTGTTGGATAGCTGAGCATCATTCATGGGACATTTCCCTTAGGAGGGGGCTCCTTGATCGAGAATTGT
CTAGTTGGTCGGCGCTGGTTGAGAAAATTCATATGATTAACTTGGTGGAAGGGCAAGATATGTTAAATTGGACCCTTGGTGGTTCAGGTAAGTTTTCGACGAAGTCTATG
TTCCAAAAGCTCTTGGAGTTGCCTCCCAGAATTCCCCCTCATATATGTAATCAAGTATTATGCCTCAAGGATGAAGAGAGGTTAGATCATTTGTTTTTGCATTGTGGCTT
TGCGGGTAAAGTGTGGAGCTTTATTGCCAAGAGTTTGGGATTTCTCATAAAGCCTGTGGATTTGGAGGTGGGGGCTATGTTGATGCAAGAACATCTAGTAAGATTCTACA
GTTGGGGTCTTATCCAACCAAGCATGGGAGACAACAGAAGTGTTGACCCTTCTCCTTTTGCTGGGATGCCTCAGTTGACCCTAGGGAGGAATAATGGAGTTTTTAGGGTT
TGTGAGAGATCTTGGGAAGAGGAGAGGTCCCTTGTTAGGCTTAATGCCTCTATTCGGGCTTCAGTCTCGAAGGATACCTATTTTCTTGAGTTTGATGGTGCCTCAAAGGG
AAATCCTGGGCTAGCAGGTGCTGGAGCTGTTTTGCGTGCTAACGATGGAACTACGGAACAAAATTCTGATGCCGATGCCCTAGCGAACTGTGCCATACATCTTCGAGTCA
AAGGATACACTAAATCCTCAGTGATGTCCGTCATGATGGCTGATGTAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAAAACTCAATCTCCTAATGAAGGCAGGGAAG
GCTATCGTCCATGAGGACCAGCCACAACACTCGGCGTCAGTTGCCTCTCTATCTGTCCAACAACTACAGGACATGATCATGAATTCCATCAGGACTCAATATGGTGGACC
CACTCAAAGTTCCCTCTTGTATTCCAAACCTTATACAAAGAGGATTGACAATTTGAGATTGCCCGCTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGATGGAAAGGGCA
ATCCAAAACAACACATCGCTCACTTCGTTGAAACCAGCGAGAATGCTGGAACTCGGGGAGACTTGCTTGTCAAACAGTTTGTCTGA
Protein sequenceShow/hide protein sequence
MVVLGHDQEARLELDMFMSALFHLFVLWMEEDKDTYYVVQKGDVFGFYRSWKEFEAQAGCFVILWRFFSSVVGVAPDHGQSLSICCIFDPNATIYKGCHLSKEAEQYLAS
RGLQSATYSISAANVTKDLFGKLVACPHEQPSATRGKMAEENPRVSRQQVLENTESGFVGANWVSTDSPKEEIIWDHGFEAVPASSSCEGVEEGRGVHLVSWEVVGKLVG
LGGLELGNPRARNRALLTKWLWCFPLESTSLGHQIIASKYGFHPHEWVSGGVKGGGWKRIKIPKKVRFFTWQVIHGRVNTIDKLARKMTSMVDPFCCILCRRAEEDLDHI
LWTCEFARSVWSSFFNAFGLQVRRFNDYREMIQEFLLHPLFRDKGRFLWLVGVCALLWRLWGERNNRVFQSIERDSSDVWTLSRFCVSLWASSVAYFQPSVSRGKRAEEY
SEAKRPQVHETIESGYIGGCYWTSTPLPIVASRHLDTGSKTIAPSSTCGVPGVLGAGDLSSGKGGVFCGLSSEELWRRSAIRLADLHGVSERMQNQLSLIKINVGEGVPG
VLGAGDLSSGKGGVFCGLSSEELWRRSAIRNYKGRMIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKFFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGG
KHNLVVPAGVEYRGWKNFSDLLKKFLNGEDDYLEDCKEVETEVRRKGKGKSFADIVKRSPSNSISMKTPQGSRGKERKVAPEVRGSLGGTEWKDIPVCRGCSEEVRKINW
EEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELADLLTKNQGWVSFGPIILKVEKWNKNAHSRISCIPSYGGWVKIRNLPLHLWNMQTFKAIGD
NLGGFIEYEEDNSLLIECVEVKIKIKENYWGFIPAEVRIEDGEDLYIVQIVTFQDGSLLINQVAGIHGTFSSTTIHVFHRGPMDPLFCTTDIWRIENGLNYPVVKVLQSV
EEPRISGGQLDGNKTDFEISRLRNKGKNKEEAGPSKSAGISVRVEAQPIFVQPTGPSESAEVPTQIEAQSDLDHGDWIVKRSKKEKRKGVSFANETQITLFKQGEVHHTQ
KITSPFSKEGAADAWNGEQGLESDLSFSSPVSLDGEGGEENGLSSRKASDSDIPEGYQLCFYTDTDQDISSIPLSVEESVDLAEEAPSTRKEDLIQSPKAPPECRTTDED
MHEENHLLYPNALAIIPSGKADRVEPDTENGELLLSKELILTLRRNNLCIRPIVGSSAKKGNSTKKKRSREVTNLFRTWEKEVEPIIEEENGNESRLSAIDSRIVKSVWS
SRHIAWVALDPVASGGGIIILWKEDRVEVVDTVLGAFLISIKFILLDKSEDCWIAEHHSWDISLRRGLLDRELSSWSALVEKIHMINLVEGQDMLNWTLGGSGKFSTKSM
FQKLLELPPRIPPHICNQVLCLKDEERLDHLFLHCGFAGKVWSFIAKSLGFLIKPVDLEVGAMLMQEHLVRFYSWGLIQPSMGDNRSVDPSPFAGMPQLTLGRNNGVFRV
CERSWEEERSLVRLNASIRASVSKDTYFLEFDGASKGNPGLAGAGAVLRANDGTTEQNSDADALANCAIHLRVKGYTKSSVMSVMMADVDQDERMAEMERKLNLLMKAGK
AIVHEDQPQHSASVASLSVQQLQDMIMNSIRTQYGGPTQSSLLYSKPYTKRIDNLRLPAGYQPPKFQQFDGKGNPKQHIAHFVETSENAGTRGDLLVKQFV