; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025961 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025961
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationtig00153017:1855222..1858383
RNA-Seq ExpressionSgr025961
SyntenySgr025961
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-20988.35Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR   SDVSA Q P +KP+TVSSVIRE E G GC E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTTM+GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMG QLNH HLSSEL RRM+R+SLRDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]4.3e-20989.08Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARRVHQGRQ DQLR   SDVSAGQS  VKPS VSSVIRE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

XP_022152290.1 uncharacterized protein LOC111020043 [Momordica charantia]6.0e-21992.01Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG+QF R CGDDRFYNPTKARR HQGRQND+LR   SDVSAGQSPAVKPSTVS+VIRE ENG+GC+EQ  PKS+ VSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNH H+S ELPRRM+RISLRDQLIG QEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTP+GGARS+QGPVVTYP+EIDGIPKM LPVFGLASYKFRGSLWTPNGGFEWQLAN LLQSAE+WLR L
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL

Query:  QVNHPDFIFFSGR
        QVNHPDFIFFS R
Subjt:  QVNHPDFIFFSGR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]9.6e-20988.11Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR   SDVSA QSP +KP+TVSSVIRE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTT++GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPER LKY G QLNH HLSSEL RRM+R+SLRDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.7e-21390.78Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QF R CGDDRFYNPTKARR HQGRQNDQLR   SDVSAGQSP VKP  VSSVIRE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMDRIS RDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L +PMGGARS+QGPVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ AE+WLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045972.3e-20888.59Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGD RFYNPTKARRVHQGRQ DQLR   SDVSAGQS  VKPS VSSVIRE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A5D3BPU4 DUF789 domain-containing protein2.1e-20989.08Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARRVHQGRQ DQLR   SDVSAGQS  VKPS VSSVIRE E G GC+E  PKS+A+S FEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGES KSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMGKQLNH HLSSEL RRMD IS RDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARS+Q PVVTYP+EIDGIPKMSLPVFGLASYKFRGSLWTPNGG+EWQLAN LL  AEDWLR  Q
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A6J1DDJ0 uncharacterized protein LOC1110200432.9e-21992.01Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG+QF R CGDDRFYNPTKARR HQGRQND+LR   SDVSAGQSPAVKPSTVS+VIRE ENG+GC+EQ  PKS+ VSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQP-PKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNH H+S ELPRRM+RISLRDQLIG QEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTP+GGARS+QGPVVTYP+EIDGIPKM LPVFGLASYKFRGSLWTPNGGFEWQLAN LLQSAE+WLR L
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRL

Query:  QVNHPDFIFFSGR
        QVNHPDFIFFS R
Subjt:  QVNHPDFIFFSGR

A0A6J1FG46 uncharacterized protein LOC1114451076.7e-20888.11Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        MLGAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR   SDVSA QSP +KP+TVSSVIRE E G GC+E  P S+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTTM+GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERALKYMG QLNH HLSSEL RR +R+SLRDQLIG QEDC SDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG EWQLAN LLQ AEDWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

A0A6J1JV26 uncharacterized protein LOC1114891475.1e-20887.38Show/hide
Query:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP
        M GAG+QFGR CGDDRFYNPTKARR HQGRQNDQLR   SDVSA +SP +KP+TVSS+IRE E G GC+E  PKS+A+SAFEPVVSSLSNLERFLQSI P
Subjt:  MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITP

Query:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
        SVPAQY SKTT++GWRTCD E QPYFVLGDLWESFKEWSAYGAGVPLVLN SDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD
Subjt:  SVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSD

Query:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW
        SEPERA+KYMG QLNH HLSSEL RRM+R+SLRDQLIG QEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSSW
Subjt:  SEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSW

Query:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ
        FSVAWYPIYRIPTGPTLRDLDACFLTFH L TPMGGARS+QGPVVTYP++IDGIP+MSLPVFGLASYKFRGSLWTPNGG+EWQLAN LLQ A+DWLR   
Subjt:  FSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQ

Query:  VNHPDFIFFSGR
        VNHPDFIFFS R
Subjt:  VNHPDFIFFSGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.4e-10963.55Show/hide
Query:  SSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQ
        +S SN+ERFL S+TPSVPA YLSKT +R     DVE Q PYF+LGD+WESF EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+
Subjt:  SSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYG--ESLKSSAKSRQ

Query:  PGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLA
         GE+S+SDFRDSSS+GSS SE ER L Y  +Q++          RMD++SLR +    QED SSD+ E  +SQG+L+FE+LERDLPY REP ADK+SDLA
Subjt:  PGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLA

Query:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPV-VTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF
         +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+HSL+TP  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG 
Subjt:  FQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPV-VTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF

Query:  EWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR
          QLAN L Q+A++WLR  QVNHPDFIFF  R
Subjt:  EWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR

AT2G01260.1 Protein of unknown function (DUF789)2.1e-11658.13Show/hide
Query:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG Q  R R GDD FY   K RR +Q  + DQLR   SDVS   S A  P                             EP   S SNL+RFL+S+T
Subjt:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS
        PSVPAQ+LSKT +R  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSS
Subjt:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS

Query:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD
        SD SSDS+ ER                 +  R+D ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCD
Subjt:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD

Query:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAED
        LL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG  S Q   +T P E +   KMSLPVFGLASYKFRGSLWTP GG E QL N L Q+A+ 
Subjt:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAED

Query:  WLRRLQVNHPDFIFFSGR
        WL    V+HPDF+FF  R
Subjt:  WLRRLQVNHPDFIFFSGR

AT2G01260.2 Protein of unknown function (DUF789)2.5e-9057.89Show/hide
Query:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT
        MLGAG Q  R R GDD FY   K RR +Q  + DQLR   SDVS   S A  P                             EP   S SNL+RFL+S+T
Subjt:  MLGAGVQFGR-RCGDDRFYNPTKARRVHQGRQNDQLR---SDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS
        PSVPAQ+LSKT +R  R  D   +  PYFVLGD+W+SF EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDSS
Subjt:  PSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHS-DSVVQYYVPYLSGIQIYGES--LKSSAKSRQPGEDSDSDFRDSS

Query:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD
        SD SSDS+ ER                 +  R+D ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSCD
Subjt:  SDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCD

Query:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGG
        LL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG
Subjt:  LLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGG

AT4G16100.1 Protein of unknown function (DUF789)1.2e-8746.78Show/hide
Query:  GDDRFYNPTKARRVHQGRQNDQLRSDVSAGQSPAVKP------STVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK
        G++RFYNP   R++ Q R+  +L ++    +    K             I++PE  +  D   P S   S      ++ SNL RFL   TP V  Q+L  
Subjt:  GDDRFYNPTKARRVHQGRQNDQLRSDVSAGQSPAVKP------STVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK

Query:  TTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK
        T+ +GWRT + E++PYF+L DLW+SF+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++    R+ GE+SD D  RD SSDGS+D        
Subjt:  TTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDF-RDSSSDGSSDSEPERALK

Query:  YMGKQLNHLHLSSELPRRMDRISLRDQ-LIGFQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY
                     EL + + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAWY
Subjt:  YMGKQLNHLHLSSELPRRMDRISLRDQ-LIGFQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWY

Query:  PIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF-EWQLANLLLQSAEDWLRRLQVNHPD
        PIYRIP G +L++LDACFLTFHSL TP  G  + +G      ++     K+ LP FGLASYKF+ S W+P     E Q    LL++AE+WLRRL+V  PD
Subjt:  PIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGF-EWQLANLLLQSAEDWLRRLQVNHPD

Query:  FIFF
        F  F
Subjt:  FIFF

AT5G49220.1 Protein of unknown function (DUF789)3.1e-8044.63Show/hide
Query:  GVQFGRRC--GDDRFYNPTKARRVHQGRQ-------------NDQLRSDVSAGQSPAVKPSTVSSVIREPENG-----AGCDEQPPKSMAVSAFEPVVSS
        GV   R    G++RFYNP   RR+ Q  Q              D++  D    ++  V P T    +   E+      +G +     S + S    V+S 
Subjt:  GVQFGRRC--GDDRFYNPTKARRVHQGRQ-------------NDQLRSDVSAGQSPAVKPSTVSSVIREPENG-----AGCDEQPPKSMAVSAFEPVVSS

Query:  LSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP
         SNL+RFL+  TP VPA+     +    +T + +   YFVL DLWESF EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K R P
Subjt:  LSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWESFKEWSAYGAGV-----PLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQP

Query:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAF
          D+     + SS+GSS+S        +G+              ++RISL+DQ I      SS EAE  N QG+LLFE+LE + P+ REPLA+KISDLA 
Subjt:  GEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSELPRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAF

Query:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEW
        + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFHSL T     +S  G   + P+      K+ LP FGLASYK + S+W  N   E 
Subjt:  QFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPMGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEW

Query:  QLANLLLQSAEDWLRRLQVNHPDFIFFS
        Q    LLQ+A+ WL+RLQV+HPD+ FF+
Subjt:  QLANLLLQSAEDWLRRLQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTGCAGGGGTGCAGTTTGGTCGTCGTTGTGGAGACGACAGGTTTTACAATCCGACGAAAGCTCGTAGGGTGCATCAGGGTCGCCAAAATGATCAGCTCCGGAG
CGACGTTTCTGCTGGCCAATCCCCTGCGGTTAAACCAAGCACGGTGTCCTCGGTGATTAGAGAACCCGAAAACGGCGCTGGGTGTGACGAGCAGCCCCCAAAATCCATGG
CGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTACCTGCACAATACCTCTCAAAGACAACGATGAGG
GGTTGGAGAACTTGTGACGTGGAATTTCAACCATACTTTGTCCTTGGTGATTTGTGGGAGTCTTTCAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTATTAAA
CCACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGGGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGGACAGTG
ATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCTCTAAAATACATGGGGAAACAACTCAATCATCTCCATTTATCTTCTGAGCTT
CCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTGATTGGATTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTTTTAATTCTCAAGGTCAGCTGCTATTTGA
GCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTATTGC
CGTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATACCGACTGGACCAACGTTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTCATTGTATACGCCA
ATGGGAGGGGCACGGAGCATTCAAGGGCCTGTGGTGACGTATCCTAATGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTTTAG
AGGGTCTTTATGGACTCCAAATGGCGGGTTCGAGTGGCAATTGGCAAATTTGCTGTTGCAGTCAGCTGAGGATTGGTTAAGACGACTTCAAGTCAATCACCCTGACTTCA
TCTTCTTCAGCGGACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTGCAGGGGTGCAGTTTGGTCGTCGTTGTGGAGACGACAGGTTTTACAATCCGACGAAAGCTCGTAGGGTGCATCAGGGTCGCCAAAATGATCAGCTCCGGAG
CGACGTTTCTGCTGGCCAATCCCCTGCGGTTAAACCAAGCACGGTGTCCTCGGTGATTAGAGAACCCGAAAACGGCGCTGGGTGTGACGAGCAGCCCCCAAAATCCATGG
CGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTACCTGCACAATACCTCTCAAAGACAACGATGAGG
GGTTGGAGAACTTGTGACGTGGAATTTCAACCATACTTTGTCCTTGGTGATTTGTGGGAGTCTTTCAAGGAATGGAGTGCTTATGGTGCAGGTGTGCCTCTTGTATTAAA
CCACAGTGACAGTGTTGTCCAGTATTATGTACCATATTTATCCGGTATACAGATATATGGGGAATCCTTGAAGTCCTCTGCAAAGTCAAGGCAACCAGGTGAGGACAGTG
ATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGAGCTCTAAAATACATGGGGAAACAACTCAATCATCTCCATTTATCTTCTGAGCTT
CCTCGTAGAATGGATAGGATATCTTTGCGGGACCAGCTGATTGGATTTCAAGAAGACTGTTCTAGTGACGAGGCTGAATCTTTTAATTCTCAAGGTCAGCTGCTATTTGA
GCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAGACATTACGAAGTTGTGATCTATTGC
CGTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTACAGGATACCGACTGGACCAACGTTAAGGGATCTGGATGCCTGCTTCCTCACCTTTCATTCATTGTATACGCCA
ATGGGAGGGGCACGGAGCATTCAAGGGCCTGTGGTGACGTATCCTAATGAGATAGATGGTATCCCTAAGATGTCCCTACCAGTTTTTGGTCTAGCTTCATACAAGTTTAG
AGGGTCTTTATGGACTCCAAATGGCGGGTTCGAGTGGCAATTGGCAAATTTGCTGTTGCAGTCAGCTGAGGATTGGTTAAGACGACTTCAAGTCAATCACCCTGACTTCA
TCTTCTTCAGCGGACGGTGA
Protein sequenceShow/hide protein sequence
MLGAGVQFGRRCGDDRFYNPTKARRVHQGRQNDQLRSDVSAGQSPAVKPSTVSSVIREPENGAGCDEQPPKSMAVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMR
GWRTCDVEFQPYFVLGDLWESFKEWSAYGAGVPLVLNHSDSVVQYYVPYLSGIQIYGESLKSSAKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKQLNHLHLSSEL
PRRMDRISLRDQLIGFQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTP
MGGARSIQGPVVTYPNEIDGIPKMSLPVFGLASYKFRGSLWTPNGGFEWQLANLLLQSAEDWLRRLQVNHPDFIFFSGR