; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001803 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001803
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationtig00001138:4870..8041
RNA-Seq ExpressionSgr001803
SyntenySgr001803
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-21188.83Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR A+SDVSA Q P +KP+TVSSV RE E G GC E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTTM+GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMG QLNH HLSSEL RRM+R+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.2e-21189.56Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARRVHQGRQ DQLR A+SDVSAGQS  VKPS VSSV RE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

XP_022152290.1 uncharacterized protein LOC111020043 [Momordica charantia]1.3e-22192.49Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG+QF R CGDDRFYNPTKARR HQGRQND+LR A+SDVSAGQSPAVKPSTVS+V RE ENG+GC+EQ  PKS+ VSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNH H+S ELPRRM+RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTP+GGARS+QGPVVTYP+EIDGIPKM LPVFGLASYKFRGSLWTPNGGFEWQLAN LLQSAE+WLR L
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL

Query:  QVNHPDFIFFSGR
        QVNHPDFIFFS R
Subjt:  QVNHPDFIFFSGR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]2.7e-21188.59Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR A+SDVSA QSP +KP+TVSSV RE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTT++GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPER LKY G QLNH HLSSEL RRM+R+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]4.8e-21691.26Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QF R CGDDRFYNPTKARR HQGRQNDQLR A+SDVSAGQSP VKP  VSSV RE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMDRIS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L +PMGGARS+QGPVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AE+WLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045976.5e-21189.08Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGD RFYNPTKARRVHQGRQ DQLR A+SDVSAGQS  VKPS VSSV RE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A5D3BPU4 DUF789 domain-containing protein5.9e-21289.56Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARRVHQGRQ DQLR A+SDVSAGQS  VKPS VSSV RE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A6J1DDJ0 uncharacterized protein LOC1110200436.3e-22292.49Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG+QF R CGDDRFYNPTKARR HQGRQND+LR A+SDVSAGQSPAVKPSTVS+V RE ENG+GC+EQ  PKS+ VSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNH H+S ELPRRM+RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTP+GGARS+QGPVVTYP+EIDGIPKM LPVFGLASYKFRGSLWTPNGGFEWQLAN LLQSAE+WLR L
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL

Query:  QVNHPDFIFFSGR
        QVNHPDFIFFS R
Subjt:  QVNHPDFIFFSGR

A0A6J1FG46 uncharacterized protein LOC1114451071.9e-21088.59Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR A+SDVSA QSP +KP+TVSSV RE E G GC+E  P S+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTTM+GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMG QLNH HLSSEL RR +R+SLRDQLIGLQEDC SDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A6J1JV26 uncharacterized protein LOC1114891474.2e-21087.62Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        M GAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR  +SDVSA +SP +KP+TVSS+ RE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTT++GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERA+KYMG QLNH HLSSEL RRM+R+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ A+DWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.4e-10958.33Show/hide
Query:  QLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDLW
        QL+ A+ DVS G          SS  ++ ENG+          A+       +S SN+ERFL S+TPSVPA YLSKT +R     DVE Q PYF+LGD+W
Subjt:  QLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDLW

Query:  ESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDR
        ESF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +Q++          RMD+
Subjt:  ESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDR

Query:  ISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHS
        +SLR +    QED SSD+ E  +SQG+L+FE+LERDLPY REP ADK+SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+HS
Subjt:  ISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHS

Query:  LYTPMGGARSIQGPV-VTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR
        L+TP  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLAN L Q+A++WLR  QVNHPDFIFF  R
Subjt:  LYTPMGGARSIQGPV-VTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR

AT2G01260.1 Protein of unknown function (DUF789)6.4e-11858.37Show/hide
Query:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG Q  R R GDD FY   K RR +Q  + DQLR A+SDVS   S A  P                             EP   S SNL+RFL+S+T
Subjt:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS
        PSVPAQ+LSKT +R  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSS
Subjt:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS

Query:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD
        SD SSDS+ ER                 +  R+D ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCD
Subjt:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD

Query:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAED
        LL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E QL N L Q+A+ 
Subjt:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAED

Query:  WLRRLQVNHPDFIFFSGR
        WL    V+HPDF+FF  R
Subjt:  WLRRLQVNHPDFIFFSGR

AT2G01260.2 Protein of unknown function (DUF789)7.9e-9258.19Show/hide
Query:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG Q  R R GDD FY   K RR +Q  + DQLR A+SDVS   S A  P                             EP   S SNL+RFL+S+T
Subjt:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS
        PSVPAQ+LSKT +R  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSS
Subjt:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS

Query:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD
        SD SSDS+ ER                 +  R+D ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCD
Subjt:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD

Query:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGG
        LL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG
Subjt:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGG

AT4G16100.1 Protein of unknown function (DUF789)2.0e-8746.53Show/hide
Query:  GDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAV---KPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK
        G++RFYNP   R++ Q R+  +L +   +    ++  +   K        ++PE  +  D   P S   S      ++ SNL RFL   TP V  Q+L  
Subjt:  GDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAV---KPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK

Query:  TTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK
        T+ +GWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D        
Subjt:  TTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK

Query:  YMGKQLNHLHLSSELPRRMDRISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY
                     EL + + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAWY
Subjt:  YMGKQLNHLHLSSELPRRMDRISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY

Query:  PIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF-EWQLANLLLQSAEDWLRRLQVNHPD
        PIYRIP G +L++LDACFLTFHSL TP  G  + +G      ++     K+ LP FGLASYKF+ S W+P     E Q    LL++AE+WLRRL+V  PD
Subjt:  PIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF-EWQLANLLLQSAEDWLRRLQVNHPD

Query:  FIFF
        F  F
Subjt:  FIFF

AT5G49220.1 Protein of unknown function (DUF789)1.5e-7944.39Show/hide
Query:  GVQFGRRC--GDDRFYNPTKARRVHQGRQNDQL---RSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPK------------SMAVSAFEPVVSS
        GV   R    G++RFYNP   RR+ Q  Q  Q    +  R D         +    +   R    G G  E   +            S + S    V+S 
Subjt:  GVQFGRRC--GDDRFYNPTKARRVHQGRQNDQL---RSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPK------------SMAVSAFEPVVSS

Query:  LSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP
         SNL+RFL+  TP VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P
Subjt:  LSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP

Query:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAF
          D+     + SS+GSS+S        +G+              ++RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+KISDLA 
Subjt:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAF

Query:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEW
        + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFHSL T     +S  G   + P+      K+ LP FGLASYK + S+W  N   E 
Subjt:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEW

Query:  QLANLLLQSAEDWLRRLQVNHPDFIFFS
        Q    LLQ+A+ WL+RLQV+HPD+ FF+
Subjt:  QLANLLLQSAEDWLRRLQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTGCAGGGGTGCAGTTTGGTCGTCGTTGTGGAGACGACAGGTTTTACAATCCGACGAAAGCTCGTAGGGTGCATCAGGGTCGCCAAAATGATCAGCTCCGGAG
CGCTCGTAGCGACGTTTCTGCTGGCCAATCCCCTGCGGTTAAACCAAGCACGGTGTCCTCGGTGTTTAGAGAACCCGAAAACGGCGCTGGGTGTGACGAGCAGCCCCCAA
AATCCATGGCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTACCTGCACAATACCTCTCAAAGACA
ACGATGAGGGGTTGGAGAACTTGTGACGTGGAATTTCAACCATACTTTGTCCTTGGTGATTTGTGGGAGTCTTTCAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCT
TGTATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGAGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTG
AGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCTCTAAAATACATGGGGAAACAACTCAATCATCTCCATTTATCT
TCTGAGCTTCCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTTTTAATTCTCAAGGTCAGCT
GCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTG
ATCTATTGCCATCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATACCGACTGGACCAACGTTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTCATTG
TATACGCCAATGGGAGGGGCACGGAGCATTCAAGGGCCTGTGGTGACGTATCCTAATGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATA
CAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCGGGTTCGAGTGGCAATTGGCAAATTTGCTGTTGCAGTCAGCTGAGGATTGGTTAAGACGACTTCAAGTCAATCACC
CTGACTTCATCTTCTTCAGCGGACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTGCAGGGGTGCAGTTTGGTCGTCGTTGTGGAGACGACAGGTTTTACAATCCGACGAAAGCTCGTAGGGTGCATCAGGGTCGCCAAAATGATCAGCTCCGGAG
CGCTCGTAGCGACGTTTCTGCTGGCCAATCCCCTGCGGTTAAACCAAGCACGGTGTCCTCGGTGTTTAGAGAACCCGAAAACGGCGCTGGGTGTGACGAGCAGCCCCCAA
AATCCATGGCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTACCTGCACAATACCTCTCAAAGACA
ACGATGAGGGGTTGGAGAACTTGTGACGTGGAATTTCAACCATACTTTGTCCTTGGTGATTTGTGGGAGTCTTTCAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCT
TGTATTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGAGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTG
AGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCTCTAAAATACATGGGGAAACAACTCAATCATCTCCATTTATCT
TCTGAGCTTCCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTTTTAATTCTCAAGGTCAGCT
GCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGCGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTG
ATCTATTGCCATCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATACCGACTGGACCAACGTTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTCATTG
TATACGCCAATGGGAGGGGCACGGAGCATTCAAGGGCCTGTGGTGACGTATCCTAATGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATA
CAAGTTTAGAGGGTCTTTATGGACTCCAAATGGCGGGTTCGAGTGGCAATTGGCAAATTTGCTGTTGCAGTCAGCTGAGGATTGGTTAAGACGACTTCAAGTCAATCACC
CTGACTTCATCTTCTTCAGCGGACGGTGA
Protein sequenceShow/hide protein sequence
MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSARSDVSAGQSPAVKPSTVSSVFREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKT
TMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLS
SELPRRMDRISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSL
YTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR