; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002114 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002114
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold30:2461651..2462340
RNA-Seq ExpressionMS002114
SyntenyMS002114
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587646.1 hypothetical protein SDJN03_16211, partial [Cucurbita argyrosperma subsp. sororia]1.3e-6966.09Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA
        ME CI+SRKR+RD+SN+SLFNFVGSKI R DS + +F SPD++D P+ SVSSD +SI S+Q     ++D  L+S Q   IQEDLLKILDEAD SIDR+ A
Subjt:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA

Query:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ
        I DLDSVI SFEKEI+VP P         QPELGYLLEASDD+LGLPPAA    +GE+E V F  + SG+ GMKGFLGFEDEV +YCWL++LSSE E N+
Subjt:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ

Query:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL
         EEEV ALGGL DH TDG VE PPYRSETMFCL
Subjt:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL

KAG6589552.1 hypothetical protein SDJN03_14975, partial [Cucurbita argyrosperma subsp. sororia]8.4e-6161.54Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS
        ME C++SRKR+RD+SNDSLFNF+G  SK  R+DS +LD D     D PI SVSSD KSI         D   L+S QA+ IQ+DLLKILD+ DA IDR+ 
Subjt:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS

Query:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENES
         IQDLDSVIRSFEKEI VP P         QPELG+LLEASDD+LGLPPA    E+ E EAV FAA+  G+GGMKG LG EDE V +YCWL++L SENE 
Subjt:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENES

Query:  N-QGEEEVLALGGLFDHTDGTVEPPYRSETMFCL
        N + EEEV+ LGGLFDHTD   E  YRSETM CL
Subjt:  N-QGEEEVLALGGLFDHTDGTVEPPYRSETMFCL

KAG7021606.1 hypothetical protein SDJN02_15332, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-6965.67Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA
        ME CI+SRKR+RD+SN+SLFNFVGSKI R DS + +F SPD++D P+ SVSSD +SI S+Q     ++D  L+S Q   IQEDLLKIL+EAD SIDR+ A
Subjt:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA

Query:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ
        I DLDSVI SFEKEI+VP P         QPELGYLLEASDD+LGLPPAA    +GE+E V F  + SG+ GMKGFLGFEDEV +YCWL++LSSE E N+
Subjt:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ

Query:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL
         EEEV ALGGL DH TDG VE PPYRSETMFCL
Subjt:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL

XP_023516749.1 uncharacterized protein LOC111780554 [Cucurbita pepo subsp. pepo]8.4e-6161.54Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS
        ME C++SRKR+RD+SNDSLFNF+G  SK  R+DS +LD D     D P  SVSSD KSI         D   L+S QA+ IQ+DLLKILD+ DA IDR+S
Subjt:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS

Query:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENE-
         IQDLDSVIRSFEKEI VP P         QPELG+LLEASDD+LGLPPA    E+ E EAV FAA+  G+GGMKG LG EDE V +YCWL++L SENE 
Subjt:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENE-

Query:  SNQGEEEVLALGGLFDHTDGTVEPPYRSETMFCL
        S + EEEV+ LGGLFDHTD   E  YRSETM CL
Subjt:  SNQGEEEVLALGGLFDHTDGTVEPPYRSETMFCL

XP_023531746.1 uncharacterized protein LOC111793909 [Cucurbita pepo subsp. pepo]1.4e-6865.24Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA
        ME CI+SRKR+RD+SN+SLFNFVGSKI R DS + +F SPD +D P+ SVSSD +SI S+Q     ++D  L+S Q   I+EDLLKILDEAD SIDR+ A
Subjt:  MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQI-RNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSA

Query:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ
        I DLDSVI SFEKEI+VP P         QPELGYLLEASDD+LGLPPAA    +GE+E V F  + SG  GMKGFLGFEDEV +YCWL++LSSE E N+
Subjt:  IQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQ

Query:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL
         E+EV ALGGL DH TDG VE PPYRSETMFCL
Subjt:  GEEEVLALGGLFDH-TDGTVE-PPYRSETMFCL

TrEMBL top hitse value%identityAlignment
A0A0A0LS21 Uncharacterized protein3.6e-4955.46Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRD
        ME  ++SRKR+RDDSNDSLFN +G  SK  R+++    +FD+P         VS    S HS                 H IQEDLLKILD+ DASIDR+
Subjt:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRD

Query:  SAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSENE
        + IQDLDSVIRSFEKEI VP  V   T  V QPELG+LLEASDD+LGLPPAAG  EE E       A+ SG+GG+KG LGFEDE+ S YCW D+L  E +
Subjt:  SAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSENE

Query:  --SNQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL
          S + EEEV+ALGGLFDHTD   E P  YRSE M CL
Subjt:  --SNQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL

A0A1S3BWV1 uncharacterized protein LOC1034943338.8e-4854.39Show/hide
Query:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR
        ME C+++RKR+RDDSN DSLFN +G  SK  R+++  D +FD+P        S S+D    H                  H IQEDLLKILD+ DASIDR
Subjt:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR

Query:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN
        ++AIQDLDSVIRSFEKEI VP P       V QPELG+LLEASDD+LGLPPA    E+ EIE  +F    SG+GG+KG LGFEDE+ S YCW D+L  E 
Subjt:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN

Query:  ES--NQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL
        +    + EEEV+ALGGLFDHTD   E P  YRSE M CL
Subjt:  ES--NQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL

A0A5A7UZW3 Uncharacterized protein8.8e-4854.39Show/hide
Query:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR
        ME C+++RKR+RDDSN DSLFN +G  SK  R+++  D +FD+P        S S+D    H                  H IQEDLLKILD+ DASIDR
Subjt:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR

Query:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN
        ++AIQDLDSVIRSFEKEI VP P       V QPELG+LLEASDD+LGLPPA    E+ EIE  +F    SG+GG+KG LGFEDE+ S YCW D+L  E 
Subjt:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN

Query:  ES--NQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL
        +    + EEEV+ALGGLFDHTD   E P  YRSE M CL
Subjt:  ES--NQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL

A0A5D3DXI3 Uncharacterized protein3.0e-4854.81Show/hide
Query:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR
        ME C+++RKR+RDDSN DSLFN +G  SK  R+++  D +FD+P        S S+D    H                  H IQEDLLKILD+ DASIDR
Subjt:  MEGCIESRKRIRDDSN-DSLFNFVG--SKIGRVDS-VDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDR

Query:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN
        ++AIQDLDSVIRSFEKEI VP P       V QPELG+LLEASDD+LGLPPA    E+ EIE  +F    SG+GG+KG LGFEDE+ S YCW D+L  E 
Subjt:  DSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTS-YCWLDSLSSEN

Query:  E--SNQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL
        +  S + EEEV+ALGGLFDHTD   E P  YRSE M CL
Subjt:  E--SNQGEEEVLALGGLFDHTDGTVEPP--YRSETMFCL

A0A6J1E5H5 uncharacterized protein LOC1114296742.6e-6061.54Show/hide
Query:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS
        ME C++SRKR+RD+SNDSLFNF+G  SK  R+DS +LD D     D PI SVSSD KSI              +S QA+ IQ+DLLKILD+ DA IDR+S
Subjt:  MEGCIESRKRIRDDSNDSLFNFVG--SKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDS

Query:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENES
         IQDLDSVIRSFEKEI VP P  SG     QPELG+LLEASDD+LGLPPA    E+ E EAV FAA+  G+G MKG LG EDE V +YCWL++L SENE 
Subjt:  AIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDE-VTSYCWLDSLSSENES

Query:  N-QGEEEVLALGGLFDHTDGTVEPPYRSETMFCL
        N + EEEV+ LGGLFDHTD   E  YRSETM CL
Subjt:  N-QGEEEVLALGGLFDHTDGTVEPPYRSETMFCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13360.1 unknown protein2.3e-1634.03Show/hide
Query:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP
        S+++K +  E      D+ VL+S +   +++DL  +LD++D     +   QDLDSV++SFE E+          S T   +QP+LGYLLEASDD+LGLPP
Subjt:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP

Query:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHTDGTVEP----PYRSETM
            S      EE   E V    + S  + G+    GFED V++Y  LD  S   +      + +A+ GLF+ +D   +      +RSE++
Subjt:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHTDGTVEP----PYRSETM

AT1G13360.2 unknown protein5.7e-1535.06Show/hide
Query:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP
        S+++K +  E      D+ VL+S +   +++DL  +LD++D     +   QDLDSV++SFE E+          S T   +QP+LGYLLEASDD+LGLPP
Subjt:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP

Query:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHT
            S      EE   E V    + S  + G+    GFED V++Y  LD  S   +      + +A+ G F +T
Subjt:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHT

AT1G13360.3 unknown protein1.7e-1437.58Show/hide
Query:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP
        S+++K +  E      D+ VL+S +   +++DL  +LD++D     +   QDLDSV++SFE E+          S T   +QP+LGYLLEASDD+LGLPP
Subjt:  SSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQ---SGTEVVSQPELGYLLEASDDDLGLPP

Query:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLD
            S      EE   E V    + S  + G+    GFED V++Y  LD
Subjt:  AAGPS------EEGEIEAVKFAAQLSG-TGGMKGFLGFEDEVTSYCWLD

AT3G25870.1 unknown protein7.7e-1233.33Show/hide
Query:  DEKSIHSEQIRNR--ADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAG
        +E+   ++ + N+   D L L+S     +++DL       D+ +D  S  QDLDSV++SFE E+       S  E  +QP+LGYL EASDD+LGLPP   
Subjt:  DEKSIHSEQIRNR--ADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSFEKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAG

Query:  P--------SEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHTDGTVE
        P         EE   E V+ ++  S  G +    GFED VT +   D               L   GLF++ DG ++
Subjt:  P--------SEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHTDGTVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCTGCATTGAGAGCAGGAAGCGGATTCGCGACGACTCCAATGATTCTTTATTCAATTTTGTCGGATCGAAGATCGGCCGAGTCGATTCGGTTGATTTGGACTT
CGATTCGCCGGATATGGAAGATACGCCGATCTGTTCCGTTTCGTCGGATGAAAAATCGATCCACTCTGAACAGATTCGGAATCGCGCTGATGATTTAGTCCTGGAATCGC
TTCAGGCACATGCAATTCAGGAAGACCTGCTGAAGATTCTCGACGAAGCCGACGCTTCGATTGATCGTGACTCTGCGATTCAAGATCTCGACTCGGTGATCAGAAGCTTC
GAGAAGGAAATTCATGTCCCGGCTCCTGTTCAATCAGGCACTGAGGTTGTGTCGCAGCCTGAACTCGGATACCTTCTAGAAGCGTCGGACGATGACCTAGGTCTTCCGCC
GGCCGCCGGCCCCAGCGAGGAAGGGGAGATCGAGGCTGTTAAATTTGCCGCGCAATTGTCAGGTACCGGCGGTATGAAGGGGTTTTTAGGGTTTGAGGATGAAGTTACGA
GTTACTGTTGGCTGGATAGCTTGAGCAGTGAGAATGAATCGAATCAGGGGGAAGAAGAGGTGTTGGCGTTGGGTGGGTTGTTCGATCATACGGACGGCACGGTGGAGCCG
CCATATCGATCGGAGACCATGTTCTGTTTG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGCTGCATTGAGAGCAGGAAGCGGATTCGCGACGACTCCAATGATTCTTTATTCAATTTTGTCGGATCGAAGATCGGCCGAGTCGATTCGGTTGATTTGGACTT
CGATTCGCCGGATATGGAAGATACGCCGATCTGTTCCGTTTCGTCGGATGAAAAATCGATCCACTCTGAACAGATTCGGAATCGCGCTGATGATTTAGTCCTGGAATCGC
TTCAGGCACATGCAATTCAGGAAGACCTGCTGAAGATTCTCGACGAAGCCGACGCTTCGATTGATCGTGACTCTGCGATTCAAGATCTCGACTCGGTGATCAGAAGCTTC
GAGAAGGAAATTCATGTCCCGGCTCCTGTTCAATCAGGCACTGAGGTTGTGTCGCAGCCTGAACTCGGATACCTTCTAGAAGCGTCGGACGATGACCTAGGTCTTCCGCC
GGCCGCCGGCCCCAGCGAGGAAGGGGAGATCGAGGCTGTTAAATTTGCCGCGCAATTGTCAGGTACCGGCGGTATGAAGGGGTTTTTAGGGTTTGAGGATGAAGTTACGA
GTTACTGTTGGCTGGATAGCTTGAGCAGTGAGAATGAATCGAATCAGGGGGAAGAAGAGGTGTTGGCGTTGGGTGGGTTGTTCGATCATACGGACGGCACGGTGGAGCCG
CCATATCGATCGGAGACCATGTTCTGTTTG
Protein sequenceShow/hide protein sequence
MEGCIESRKRIRDDSNDSLFNFVGSKIGRVDSVDLDFDSPDMEDTPICSVSSDEKSIHSEQIRNRADDLVLESLQAHAIQEDLLKILDEADASIDRDSAIQDLDSVIRSF
EKEIHVPAPVQSGTEVVSQPELGYLLEASDDDLGLPPAAGPSEEGEIEAVKFAAQLSGTGGMKGFLGFEDEVTSYCWLDSLSSENESNQGEEEVLALGGLFDHTDGTVEP
PYRSETMFCL