; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCACTA en-spm transposon protein
Genome locationchr6:24006703..24010439
RNA-Seq ExpressionMoc06g31930
SyntenyMoc06g31930
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040145.1 transposon protein, putative, CACTA, En/Spm sub-class [Cucumis melo var. makuwa]3.5e-8033.03Show/hide
Query:  DGDAILQQVRTLNFPDFGASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALG
        D D   Q VR +     G S  +TLS  E+++ HWYVLNNC +IESYR +HL +I    ++V  L+ RH  +FP+WF+  + +L E+GE  DDLYS+ALG
Subjt:  DGDAILQQVRTLNFPDFGASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALG

Query:  PSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL--------------
        P    R+YSGC VNGLRFH+  RD+R TTQN+ +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK   DL              
Subjt:  PSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL--------------

Query:  ---------------------------------------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFIN
                                               E E+V  DD+  LLE    I V+ES  D    R  V+P VV+ +E  N+   +R  DDFIN
Subjt:  ---------------------------------------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFIN

Query:  DESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRT-----DVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSN
        DE++ +  + +D          +Q   +   ++  + P  +   P    + T      VRGY R I+L+  V  +  K++I  +    KP    A   + 
Subjt:  DESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRT-----DVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSN

Query:  AIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWE
         IG   R  + L    W  V    +  ++  L+ HF+FD ++ ++  ++E +M+   + +R+ LHKYY        AR  P  R+     DWNM+CDRWE
Subjt:  AIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWE

Query:  SDEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSG
        +D WK                               +G +  EI++F ET + E +GW+ + A+ AYL+M RI  ES + G + +   +  E VLG RS 
Subjt:  SDEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSG

Query:  YIKNAGWDPKPKSRRGSASSVHSSEREQELSQEISTLREEFEDRLTRKNNEFEE
           N      P+S     S+V SS RE+E ++   T  +E  ++LT +  ++E+
Subjt:  YIKNAGWDPKPKSRRGSASSVHSSEREQELSQEISTLREEFEDRLTRKNNEFEE

KAA0045598.1 uncharacterized protein E6C27_scaffold243G001040 [Cucumis melo var. makuwa]2.6e-10436.11Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-------------------------DF----------GASTSKTLSVVERKVA
        MGHRRYL  +H WR SR HDGK ERRPPPV+M+GD ILQQV ++NFP                         DF          G S  +TLS  E+++ 
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-------------------------DF----------GASTSKTLSVVERKVA

Query:  HWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSE
        HWYVLNNC +IESYR +HL +I    ++V  L+ RH  +FP+WF+  + +L E+GE  DDLYS+ALGP    R+YSGC VNGLRFH+  RD+RRTTQNS 
Subjt:  HWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSE

Query:  VMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL-----------------------------------------------
        +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK R DL                                               
Subjt:  VMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL-----------------------------------------------

Query:  ------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQ
              E E+V  DDR  LLE    I V+ES  D    R  V+P VV+ +E  N+   +R  DDFINDE++ +  + +D                     
Subjt:  ------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQ

Query:  GGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDF
                            VRGY R I+L+  V  +  K++I  +    KP    A   +  IG   R  + L    W  V    +  ++  L+ HF+F
Subjt:  GGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDF

Query:  DESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEKKE
        D ++ ++  ++E +M+ T + +R+ LHKYY        AR  P  R+     DWNM+CDRWE+D WK    TNKR+ + + FNH    K+F  +  E K+
Subjt:  DESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEKKE

Query:  EDEGDNFSEIDLFRETRYSESKGW-VEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQEI
        + +G +  EI++F ET + E +GW  + A+ AYL+M RI  ES + G + +   +  E VLG RS    N      P+S     S+V SS RE+E   E+
Subjt:  EDEGDNFSEIDLFRETRYSESKGW-VEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQEI

Query:  STLREEFEDRLTRKNNEFEE
        + L+E   ++LT +  ++E+
Subjt:  STLREEFEDRLTRKNNEFEE

KAA0056494.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.3e-8433.58Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP--------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQ
        MGHRRYLP+NH W  SRLHDGK+E + PPV+M+G  IL+Q+  L FP        DF          GAS+ + +S  E+++ HWY+LNN  EI  YR  
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP--------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQ

Query:  HLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMN
                         +H   FP+WF+AQ+L L E   L +D +SLA+GPS   R Y+GCIV+G+RFH    DSRRTTQNS +M+ G     G GD+ +
Subjt:  HLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMN

Query:  FYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN------DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPV
        FYGVLD+V  + Y   R V LFKC+W++TD +K  +  V++  + +        ++ + L               E+    +V++  +D +L R+ VDP 
Subjt:  FYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN------DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPV

Query:  VVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS--------------IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLE
        +VE      R I     D FI++  +++  A+  +S              ++  DD  N AG S     G +   ++S  P  P +R  V+  +RL++LE
Subjt:  VVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS--------------IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLE

Query:  NQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYL
          V A N ++ +   P V KP  P A  F  AIGV  R+   +   KW++V +     +   L+  F  D +D  M  F+EH+M  T K +    H+++ 
Subjt:  NQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYL

Query:  SQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETA
            PE AR  P   L+ R  DW+ L D + S    ++ +TNK  R K  +NH +  K+F +  + +  E +G+  + ++LF+ET         + AE A
Subjt:  SQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETA

Query:  YLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR------GSASSVHSSEREQELSQEISTLREEFE
        +  M+ ++ +    G +P+ + E+ + VLG R GY K  GW  KPK+RR       S S   S+E+E EL  +++   E  E
Subjt:  YLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR------GSASSVHSSEREQELSQEISTLREEFE

TYJ95853.1 uncharacterized protein E5676_scaffold110G001450 [Cucumis melo var. makuwa]3.7e-6633.02Show/hide
Query:  VAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQN
        + HWYVLNNC++IESYR +HLR+I    ++   L+ RH  +FPDWF+A + +L E+GE  DDLYS+AL P    R+YSGC VNGLRFH   RD+ RTTQN
Subjt:  VAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQN

Query:  SEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL---------------------------------------------
        S +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK R DL                                             
Subjt:  SEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL---------------------------------------------

Query:  --------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEH
                E E++  DDR+ LLE    IEV+ES  D    R  V+P VV+  E  N+     D DDFINDE +  L++S       V    +QA  +   
Subjt:  --------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEH

Query:  IQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHF
        ++  + PP +   P          GY R I+L+  V  +  KV+I       KP    A T +  IG   R  + L       V    K  ++  +    
Subjt:  IQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHF

Query:  DFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEK
                           TH+                                DWNM+CDRWE+D WK    TNK++R+ + FNH  R K+F  +  E 
Subjt:  DFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEK

Query:  KEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQ
        K++ +G +  E+++F ET + E +GW+ + A+ AY    RI  ES + G + +   +  E VLG RS    N      PKS     S++ SS RE+E ++
Subjt:  KEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQ

Query:  EISTLREEFEDRLTRKNNEFEECLTQKDKE
           T  +E  ++LT +  ++E+  T   K+
Subjt:  EISTLREEFEDRLTRKNNEFEECLTQKDKE

TYK29029.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.9e-7831.24Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-----------------------------------------------------
        MGHRRYLP+NH W  SRLHDGK+E + PPV+M+G  IL+Q+  L FP                                                     
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-----------------------------------------------------

Query:  ------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLAL
              DF          GAS+ + +S  E+++ HWY+LNN  EI  YR                   +H   FP+WF+AQ+L L E   L +D +SLA+
Subjt:  ------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLAL

Query:  GPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN-
        GPS   R Y+GCIV+G+RFH    DSRRTTQNS +M+ G     G GD+ +FYGVLD+V  + Y   R V LFKC+W++TD +K  +  V++  + +   
Subjt:  GPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN-

Query:  -----DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS-------------
             ++ + L               E+    +V++  +D +L R+ VDP +VE      R I     D FI++  +++  A+  +S             
Subjt:  -----DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS-------------

Query:  -IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWS
         ++  DD  N AG S     G +   ++S  P  P +R  V+  +RL++LE  V A N ++ +   P V KP  P A  F  AIGV  R+   +   KW+
Subjt:  -IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWS

Query:  EVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKM
        +V +     +   L+  F  D +D  M  F+EH+M  T K +    H+++     PE AR  P   L+ R  DW+ L D + S    ++ +TNK  R K 
Subjt:  EVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKM

Query:  SFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR
         +NH +  K+F +  + +  E +G+  + ++LF+ET         + AE A+  M+ ++ +    G +P+ + E+ + VLG R GY K  GW  KPK+RR
Subjt:  SFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR

Query:  ------GSASSVHSSEREQELSQEISTLREEFE
               S S   S+E+E EL  +++   E  E
Subjt:  ------GSASSVHSSEREQELSQEISTLREEFE

TrEMBL top hitse value%identityAlignment
A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.7e-8033.03Show/hide
Query:  DGDAILQQVRTLNFPDFGASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALG
        D D   Q VR +     G S  +TLS  E+++ HWYVLNNC +IESYR +HL +I    ++V  L+ RH  +FP+WF+  + +L E+GE  DDLYS+ALG
Subjt:  DGDAILQQVRTLNFPDFGASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALG

Query:  PSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL--------------
        P    R+YSGC VNGLRFH+  RD+R TTQN+ +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK   DL              
Subjt:  PSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL--------------

Query:  ---------------------------------------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFIN
                                               E E+V  DD+  LLE    I V+ES  D    R  V+P VV+ +E  N+   +R  DDFIN
Subjt:  ---------------------------------------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFIN

Query:  DESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRT-----DVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSN
        DE++ +  + +D          +Q   +   ++  + P  +   P    + T      VRGY R I+L+  V  +  K++I  +    KP    A   + 
Subjt:  DESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRT-----DVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSN

Query:  AIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWE
         IG   R  + L    W  V    +  ++  L+ HF+FD ++ ++  ++E +M+   + +R+ LHKYY        AR  P  R+     DWNM+CDRWE
Subjt:  AIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWE

Query:  SDEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSG
        +D WK                               +G +  EI++F ET + E +GW+ + A+ AYL+M RI  ES + G + +   +  E VLG RS 
Subjt:  SDEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSG

Query:  YIKNAGWDPKPKSRRGSASSVHSSEREQELSQEISTLREEFEDRLTRKNNEFEE
           N      P+S     S+V SS RE+E ++   T  +E  ++LT +  ++E+
Subjt:  YIKNAGWDPKPKSRRGSASSVHSSEREQELSQEISTLREEFEDRLTRKNNEFEE

A0A5A7TRX4 DUF4216 domain-containing protein1.3e-10436.11Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-------------------------DF----------GASTSKTLSVVERKVA
        MGHRRYL  +H WR SR HDGK ERRPPPV+M+GD ILQQV ++NFP                         DF          G S  +TLS  E+++ 
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-------------------------DF----------GASTSKTLSVVERKVA

Query:  HWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSE
        HWYVLNNC +IESYR +HL +I    ++V  L+ RH  +FP+WF+  + +L E+GE  DDLYS+ALGP    R+YSGC VNGLRFH+  RD+RRTTQNS 
Subjt:  HWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSE

Query:  VMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL-----------------------------------------------
        +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK R DL                                               
Subjt:  VMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL-----------------------------------------------

Query:  ------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQ
              E E+V  DDR  LLE    I V+ES  D    R  V+P VV+ +E  N+   +R  DDFINDE++ +  + +D                     
Subjt:  ------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQ

Query:  GGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDF
                            VRGY R I+L+  V  +  K++I  +    KP    A   +  IG   R  + L    W  V    +  ++  L+ HF+F
Subjt:  GGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDF

Query:  DESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEKKE
        D ++ ++  ++E +M+ T + +R+ LHKYY        AR  P  R+     DWNM+CDRWE+D WK    TNKR+ + + FNH    K+F  +  E K+
Subjt:  DESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEKKE

Query:  EDEGDNFSEIDLFRETRYSESKGW-VEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQEI
        + +G +  EI++F ET + E +GW  + A+ AYL+M RI  ES + G + +   +  E VLG RS    N      P+S     S+V SS RE+E   E+
Subjt:  EDEGDNFSEIDLFRETRYSESKGW-VEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQEI

Query:  STLREEFEDRLTRKNNEFEE
        + L+E   ++LT +  ++E+
Subjt:  STLREEFEDRLTRKNNEFEE

A0A5A7UMS8 CACTA en-spm transposon protein1.1e-8433.58Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP--------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQ
        MGHRRYLP+NH W  SRLHDGK+E + PPV+M+G  IL+Q+  L FP        DF          GAS+ + +S  E+++ HWY+LNN  EI  YR  
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP--------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQ

Query:  HLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMN
                         +H   FP+WF+AQ+L L E   L +D +SLA+GPS   R Y+GCIV+G+RFH    DSRRTTQNS +M+ G     G GD+ +
Subjt:  HLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMN

Query:  FYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN------DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPV
        FYGVLD+V  + Y   R V LFKC+W++TD +K  +  V++  + +        ++ + L               E+    +V++  +D +L R+ VDP 
Subjt:  FYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN------DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPV

Query:  VVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS--------------IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLE
        +VE      R I     D FI++  +++  A+  +S              ++  DD  N AG S     G +   ++S  P  P +R  V+  +RL++LE
Subjt:  VVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS--------------IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLE

Query:  NQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYL
          V A N ++ +   P V KP  P A  F  AIGV  R+   +   KW++V +     +   L+  F  D +D  M  F+EH+M  T K +    H+++ 
Subjt:  NQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYL

Query:  SQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETA
            PE AR  P   L+ R  DW+ L D + S    ++ +TNK  R K  +NH +  K+F +  + +  E +G+  + ++LF+ET         + AE A
Subjt:  SQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETA

Query:  YLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR------GSASSVHSSEREQELSQEISTLREEFE
        +  M+ ++ +    G +P+ + E+ + VLG R GY K  GW  KPK+RR       S S   S+E+E EL  +++   E  E
Subjt:  YLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR------GSASSVHSSEREQELSQEISTLREEFE

A0A5D3B974 DUF4216 domain-containing protein1.8e-6633.02Show/hide
Query:  VAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQN
        + HWYVLNNC++IESYR +HLR+I    ++   L+ RH  +FPDWF+A + +L E+GE  DDLYS+AL P    R+YSGC VNGLRFH   RD+ RTTQN
Subjt:  VAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQN

Query:  SEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL---------------------------------------------
        S +M  G  + D+ N+YGVL +V +L+Y   RRV LFKC WF+TD KKNK R DL                                             
Subjt:  SEVMVCG-GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKKNKIRVDL---------------------------------------------

Query:  --------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEH
                E E++  DDR+ LLE    IEV+ES  D    R  V+P VV+  E  N+     D DDFINDE +  L++S       V    +QA  +   
Subjt:  --------EQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEH

Query:  IQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHF
        ++  + PP +   P          GY R I+L+  V  +  KV+I       KP    A T +  IG   R  + L       V    K  ++  +    
Subjt:  IQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHF

Query:  DFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEK
                           TH+                                DWNM+CDRWE+D WK    TNK++R+ + FNH  R K+F  +  E 
Subjt:  DFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWK----TNKRNRAKMSFNHRARPKAFAIIAKEK

Query:  KEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQ
        K++ +G +  E+++F ET + E +GW+ + A+ AY    RI  ES + G + +   +  E VLG RS    N      PKS     S++ SS RE+E ++
Subjt:  KEEDEGDNFSEIDLFRETRYSESKGWV-EGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQ

Query:  EISTLREEFEDRLTRKNNEFEECLTQKDKE
           T  +E  ++LT +  ++E+  T   K+
Subjt:  EISTLREEFEDRLTRKNNEFEECLTQKDKE

A0A5D3E035 CACTA en-spm transposon protein9.2e-7931.24Show/hide
Query:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-----------------------------------------------------
        MGHRRYLP+NH W  SRLHDGK+E + PPV+M+G  IL+Q+  L FP                                                     
Subjt:  MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFP-----------------------------------------------------

Query:  ------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLAL
              DF          GAS+ + +S  E+++ HWY+LNN  EI  YR                   +H   FP+WF+AQ+L L E   L +D +SLA+
Subjt:  ------DF----------GASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQFPDWFKAQLLTLLEQGELDDDLYSLAL

Query:  GPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN-
        GPS   R Y+GCIV+G+RFH    DSRRTTQNS +M+ G     G GD+ +FYGVLD+V  + Y   R V LFKC+W++TD +K  +  V++  + +   
Subjt:  GPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCG-----GEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTD-SKKNKIRVDLEQEDVGN-

Query:  -----DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS-------------
             ++ + L               E+    +V++  +D +L R+ VDP +VE      R I     D FI++  +++  A+  +S             
Subjt:  -----DDRISLL--------------EVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNS-------------

Query:  -IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWS
         ++  DD  N AG S     G +   ++S  P  P +R  V+  +RL++LE  V A N ++ +   P V KP  P A  F  AIGV  R+   +   KW+
Subjt:  -IDDVDDEYNQAGQSQEHIQGGDHPPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWS

Query:  EVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKM
        +V +     +   L+  F  D +D  M  F+EH+M  T K +    H+++     PE AR  P   L+ R  DW+ L D + S    ++ +TNK  R K 
Subjt:  EVEQSEKMAILTHLKGHFDFDESDPIMMSFIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWES----DEWKTNKRNRAKM

Query:  SFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR
         +NH +  K+F +  + +  E +G+  + ++LF+ET         + AE A+  M+ ++ +    G +P+ + E+ + VLG R GY K  GW  KPK+RR
Subjt:  SFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRYSESKGWVEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRR

Query:  ------GSASSVHSSEREQELSQEISTLREEFE
               S S   S+E+E EL  +++   E  E
Subjt:  ------GSASSVHSSEREQELSQEISTLREEFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACATCGACGTTATCTTCCTCAAAATCATAGTTGGCGTGGGAGTAGACTGCATGATGGAAAGCTAGAACGACGACCCCCACCAGTGTTAATGGATGGAGAT
GCGATTCTTCAACAAGTACGTACGCTAAATTTTCCAGATTTTGGGGCATCAACTTCAAAGACATTATCTGTCGTTGAAAGAAAAGTTGCACATTGGTATGTACTG
AACAACTGTGCAGAGATAGAGTCGTATCGAGATCAACATTTAAGAGTAATTCGACTTCCAAGAGACAATGTTTCTGCCTTATATGAACGACACAGTACTCAATTT
CCTGATTGGTTCAAAGCACAGCTTTTGACATTACTGGAACAAGGCGAATTAGATGACGATCTTTATTCACTTGCATTAGGGCCCTCACTAACTGCACGCTCATAT
AGCGGTTGTATTGTAAACGGATTGCGTTTTCATAATACCATTCGAGATAGTCGTCGAACGACACAAAATAGCGAGGTCATGGTATGTGGAGGAGAAGGGGATGAC
ATGAATTTTTACGGTGTCTTGGATGATGTTTGGGAGTTAGATTATGCTAATAGGAGACGTGTCGTACTTTTTAAGTGCAAGTGGTTTAATACAGACTCTAAGAAG
AATAAAATTCGAGTTGACTTAGAACAAGAAGACGTTGGAAATGATGATCGTATTTCATTGTTAGAAGTAGGACCCAGGATAGAAGTAAATGAATCCACTCAAGAT
AGTTCATTGCGTCGAGATGGTGTTGATCCTGTTGTCGTTGAGGGTCAAGAACAATCAAATAGAAGCATACATACGCGCGATGTGGATGATTTTATAAATGACGAA
TCAGACAATGTACTGGACGCGTCAACTGACAATAGTATTGATGATGTTGATGATGAATACAACCAAGCAGGTCAGTCACAAGAACACATACAGGGTGGTGATCAT
CCACCAGCGAACTCGCCTGTTCCAGTTCTTCCTCCAGTTAGAACAGACGTGCGAGGATACACCCGATTGATAGATCTAGAAAATCAAGTCCGTGCAAATAATACA
AAAGTAAGAATCAGATGTGACCCCACAGTGAATAAACCAACTTGTCCTATGGCCACCACGTTTTCTAACGCAATCGGGGTACACACGCGCGAATGTGTTAGCCTA
AAACGTAGTAAATGGAGCGAGGTCGAACAAAGTGAGAAGATGGCAATTCTAACCCATCTCAAGGGTCATTTCGATTTCGACGAGTCAGACCCAATTATGATGTCT
TTCATCGAGCATGAAATGAAACAGACACATAAGGCGTATAGATCAAAATTGCACAAATACTACCTAAGTCAGCACACGCCTGAGATTGCTCGCAACACTCCGCAC
GAGCGGTTGTTGGCAAGACCAAATGATTGGAATATGTTATGCGATAGATGGGAGTCAGATGAATGGAAGACCAACAAACGCAACAGGGCGAAGATGTCATTCAAC
CATCGTGCGAGGCCGAAGGCGTTTGCTATAATTGCCAAGGAGAAGAAAGAGGAAGATGAAGGTGATAACTTCAGTGAAATCGATTTGTTCAGAGAGACGAGATAC
TCCGAATCTAAAGGTTGGGTTGAAGGCGCGGAGACAGCTTATCTAGATATGATGCGCATTAGAGAAGAATCTATGCAGGACGGGGGCGAACCCATGCTAGACCCA
GAAGTACTGGAAACAGTTCTCGGTTATCGATCAGGATACATTAAAAATGCTGGTTGGGACCCGAAACCTAAATCTCGCAGAGGTTCTGCTTCAAGTGTACATTCT
TCAGAGAGGGAACAAGAGTTGTCTCAAGAAATAAGTACACTAAGAGAAGAATTTGAGGACCGTCTCACACGAAAGAACAACGAGTTTGAGGAGTGCCTTACACAA
AAAGACAAAGAGTTTGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACATCGACGTTATCTTCCTCAAAATCATAGTTGGCGTGGGAGTAGACTGCATGATGGAAAGCTAGAACGACGACCCCCACCAGTGTTAATGGATGGAGAT
GCGATTCTTCAACAAGTACGTACGCTAAATTTTCCAGATTTTGGGGCATCAACTTCAAAGACATTATCTGTCGTTGAAAGAAAAGTTGCACATTGGTATGTACTG
AACAACTGTGCAGAGATAGAGTCGTATCGAGATCAACATTTAAGAGTAATTCGACTTCCAAGAGACAATGTTTCTGCCTTATATGAACGACACAGTACTCAATTT
CCTGATTGGTTCAAAGCACAGCTTTTGACATTACTGGAACAAGGCGAATTAGATGACGATCTTTATTCACTTGCATTAGGGCCCTCACTAACTGCACGCTCATAT
AGCGGTTGTATTGTAAACGGATTGCGTTTTCATAATACCATTCGAGATAGTCGTCGAACGACACAAAATAGCGAGGTCATGGTATGTGGAGGAGAAGGGGATGAC
ATGAATTTTTACGGTGTCTTGGATGATGTTTGGGAGTTAGATTATGCTAATAGGAGACGTGTCGTACTTTTTAAGTGCAAGTGGTTTAATACAGACTCTAAGAAG
AATAAAATTCGAGTTGACTTAGAACAAGAAGACGTTGGAAATGATGATCGTATTTCATTGTTAGAAGTAGGACCCAGGATAGAAGTAAATGAATCCACTCAAGAT
AGTTCATTGCGTCGAGATGGTGTTGATCCTGTTGTCGTTGAGGGTCAAGAACAATCAAATAGAAGCATACATACGCGCGATGTGGATGATTTTATAAATGACGAA
TCAGACAATGTACTGGACGCGTCAACTGACAATAGTATTGATGATGTTGATGATGAATACAACCAAGCAGGTCAGTCACAAGAACACATACAGGGTGGTGATCAT
CCACCAGCGAACTCGCCTGTTCCAGTTCTTCCTCCAGTTAGAACAGACGTGCGAGGATACACCCGATTGATAGATCTAGAAAATCAAGTCCGTGCAAATAATACA
AAAGTAAGAATCAGATGTGACCCCACAGTGAATAAACCAACTTGTCCTATGGCCACCACGTTTTCTAACGCAATCGGGGTACACACGCGCGAATGTGTTAGCCTA
AAACGTAGTAAATGGAGCGAGGTCGAACAAAGTGAGAAGATGGCAATTCTAACCCATCTCAAGGGTCATTTCGATTTCGACGAGTCAGACCCAATTATGATGTCT
TTCATCGAGCATGAAATGAAACAGACACATAAGGCGTATAGATCAAAATTGCACAAATACTACCTAAGTCAGCACACGCCTGAGATTGCTCGCAACACTCCGCAC
GAGCGGTTGTTGGCAAGACCAAATGATTGGAATATGTTATGCGATAGATGGGAGTCAGATGAATGGAAGACCAACAAACGCAACAGGGCGAAGATGTCATTCAAC
CATCGTGCGAGGCCGAAGGCGTTTGCTATAATTGCCAAGGAGAAGAAAGAGGAAGATGAAGGTGATAACTTCAGTGAAATCGATTTGTTCAGAGAGACGAGATAC
TCCGAATCTAAAGGTTGGGTTGAAGGCGCGGAGACAGCTTATCTAGATATGATGCGCATTAGAGAAGAATCTATGCAGGACGGGGGCGAACCCATGCTAGACCCA
GAAGTACTGGAAACAGTTCTCGGTTATCGATCAGGATACATTAAAAATGCTGGTTGGGACCCGAAACCTAAATCTCGCAGAGGTTCTGCTTCAAGTGTACATTCT
TCAGAGAGGGAACAAGAGTTGTCTCAAGAAATAAGTACACTAAGAGAAGAATTTGAGGACCGTCTCACACGAAAGAACAACGAGTTTGAGGAGTGCCTTACACAA
AAAGACAAAGAGTTTGCGTAG
Protein sequenceShow/hide protein sequence
MGHRRYLPQNHSWRGSRLHDGKLERRPPPVLMDGDAILQQVRTLNFPDFGASTSKTLSVVERKVAHWYVLNNCAEIESYRDQHLRVIRLPRDNVSALYERHSTQF
PDWFKAQLLTLLEQGELDDDLYSLALGPSLTARSYSGCIVNGLRFHNTIRDSRRTTQNSEVMVCGGEGDDMNFYGVLDDVWELDYANRRRVVLFKCKWFNTDSKK
NKIRVDLEQEDVGNDDRISLLEVGPRIEVNESTQDSSLRRDGVDPVVVEGQEQSNRSIHTRDVDDFINDESDNVLDASTDNSIDDVDDEYNQAGQSQEHIQGGDH
PPANSPVPVLPPVRTDVRGYTRLIDLENQVRANNTKVRIRCDPTVNKPTCPMATTFSNAIGVHTRECVSLKRSKWSEVEQSEKMAILTHLKGHFDFDESDPIMMS
FIEHEMKQTHKAYRSKLHKYYLSQHTPEIARNTPHERLLARPNDWNMLCDRWESDEWKTNKRNRAKMSFNHRARPKAFAIIAKEKKEEDEGDNFSEIDLFRETRY
SESKGWVEGAETAYLDMMRIREESMQDGGEPMLDPEVLETVLGYRSGYIKNAGWDPKPKSRRGSASSVHSSEREQELSQEISTLREEFEDRLTRKNNEFEECLTQ
KDKEFA