; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationchr4:19749417..19752474
RNA-Seq ExpressionMoc04g26870
SyntenyMoc04g26870
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-21389.1Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA Q P +KP+TVS+VIRE+E G GCE  +LPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTTM+GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMG  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDL+F+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]5.5e-21289.35Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCE  ELPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_022152290.1 uncharacterized protein LOC111020043 [Momordica charantia]1.4e-239100Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]5.0e-21389.1Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA QSP +KP+TVS+VIRE+E G GCE  ELPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTT++GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPER LKY G  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]1.4e-21892.01Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSAGQSP VKP  VS+VIRE+E G GCE  ELPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+RIS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L +P+GGARSVQGPVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045972.9e-21188.86Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCE  ELPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKT+RSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein2.7e-21289.35Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCE  ELPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADKISDLAFQFP+LKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1DDJ0 uncharacterized protein LOC1110200436.7e-240100Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451073.5e-21289.1Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA QSP +KP+TVS+VIRE+E G GCE  ELP SI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTTM+GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMG  LNHHH+S EL RR ER+SLRDQLIGLQEDC SDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891471.0e-21187.89Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        M GAGLQF RGCGDDRFYNPTKARR+HQGRQND+LRR QSDVSA +SP +KP+TVS++IRE+E G GCE  ELPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTT++GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS
        DSEPERA+KYMG  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK+SDLAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ A++WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.2e-11158.44Show/hide
Query:  ELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDL
        +L+RAQ DVS G          S+  ++ ENGS   +  + +           +S SN+ERFL S+TPSVPA YLSKT +R     DVE Q PYF+LGD+
Subjt:  ELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDL

Query:  WEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRME
        WE+F EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +         ++  RM+
Subjt:  WEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRME

Query:  RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH
        ++SLR +    QED SSD+ E  +SQG+L+FE+LERDLPY REP ADK+SDLA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+H
Subjt:  RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH

Query:  SLYTPIGGARSVQGPV-VTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR
        SL+TP  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL Q+A+NWLRL QVNHPDFIFF RR
Subjt:  SLYTPIGGARSVQGPV-VTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.5e-11958.95Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI
        MLGAG Q  RG  GDD FY   K RRA+Q  + D+LRRAQSDVS   S A  P                 +Q+L         EP   S SNL+RFL+S+
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI

Query:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS
        TPSVPAQ+LSKT +R  R  D   +  PYFVLGD+W++F EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDS
Subjt:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS

Query:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSC
        SSD SSDS+ ER                 +  R++ ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSC
Subjt:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSC

Query:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAE
        DLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG  S Q   +T P E +   KM LPVFGLASYKFRGSLWTP GG E QL NSL Q+A+
Subjt:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAE

Query:  NWLRLLQVNHPDFIFFSRR
         WL    V+HPDF+FF RR
Subjt:  NWLRLLQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)4.2e-9358.6Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI
        MLGAG Q  RG  GDD FY   K RRA+Q  + D+LRRAQSDVS   S A  P                 +Q+L         EP   S SNL+RFL+S+
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI

Query:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS
        TPSVPAQ+LSKT +R  R  D   +  PYFVLGD+W++F EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDS
Subjt:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS

Query:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSC
        SSD SSDS+ ER                 +  R++ ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK+ DLA QFPEL TLRSC
Subjt:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSC

Query:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGG
        DLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG
Subjt:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGG

AT4G16100.1 Protein of unknown function (DUF789)1.2e-8746.15Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVV-SSLSNLERFLQSITPSVPAQYLSKT
        G++RFYNP   R+  Q R+   L   + +    ++  +    +    +E +    C   +      VS+      ++ SNL RFL   TP V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVV-SSLSNLERFLQSITPSVPAQYLSKT

Query:  TMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + +GWRT + E++PYF+L DLW++F+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++ T  R+ GE+SD D  RD SSDGS+D         
Subjt:  TMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGKPLNHHHISLELPRRMERISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYP
                    EL + + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DKIS+L+ QFP L+T RSCDL PSSW SVAWYP
Subjt:  MGKPLNHHHISLELPRRMERISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYP

Query:  IYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGF-EWQLANSLLQSAENWLRLLQVNHPDF
        IYRIP G +L++LDACFLTFHSL TP  G  + +G      S+     K+PLP FGLASYKF+ S W+P     E Q   +LL++AE WLR L+V  PDF
Subjt:  IYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGF-EWQLANSLLQSAENWLRLLQVNHPDF

Query:  IFF
          F
Subjt:  IFF

AT5G49220.1 Protein of unknown function (DUF789)1.8e-8045.14Show/hide
Query:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-----NDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFE-------------P
        G+  AR    G++RFYNP   RR  Q  Q      ++ RR   D         K +TV+   R +  G G  E +    + VS  E              
Subjt:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-----NDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFE-------------P

Query:  VVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTK
        V+S  SNL+RFL+  TP VPA+     +    +T + +   YFVL DLWE+F EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K
Subjt:  VVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTK

Query:  SRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKIS
         R P  D+     + SS+GSS+S        +G+              + RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+KIS
Subjt:  SRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKIS

Query:  DLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNG
        DLA + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFHSL T     +S  G   + PS      K+PLP FGLASYK + S+W  N 
Subjt:  DLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNG

Query:  GFEWQLANSLLQSAENWLRLLQVNHPDFIFFS
          E Q   SLLQ+A+ WL+ LQV+HPD+ FF+
Subjt:  GFEWQLANSLLQSAENWLRLLQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTGCAGGGTTGCAGTTTGCCCGTGGTTGTGGAGACGATAGGTTTTACAATCCGACGAAGGCTCGTAGGGCGCATCAGGGCCGCCAAAATGACGAGCTCCGGAG
AGCTCAGAGCGATGTTTCTGCCGGCCAATCCCCTGCGGTTAAACCGAGCACGGTGTCCGCCGTGATTAGAGAATCCGAAAATGGGTCTGGGTGTGAAGAGCAAGAGCTCC
CAAAATCGATTCCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTTCCTGCGCAGTACCTCTCAAAG
ACAACGATGAGGGGGTGGAGGACTTGTGATGTGGAATTTCAACCATACTTTGTTCTTGGTGATTTGTGGGAGGCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCC
TCTTGTGTTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATCTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTACAAAGTCAAGGCAACCAG
GTGAGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGGGCACTAAAATACATGGGGAAACCACTCAATCATCACCATATA
TCGTTGGAGCTTCCTCGTAGAATGGAAAGGATATCGTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCCAGTGATGAGGCGGAATCTTTTAATTCTCAAGGCCA
GCTGCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAAACGTTACGAAGTT
GTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTATAGGATACCAACTGGACCAACATTAAGGGATCTAGATGCCTGTTTCCTCACCTTTCATTCT
TTGTATACGCCAATTGGAGGGGCACGTAGTGTTCAAGGCCCTGTAGTAACATATCCTAGTGAGATAGATGGTATCCCTAAGATGCCCCTACCAGTTTTCGGTCTAGCTTC
ATACAAGTTTAGAGGGTCTTTGTGGACTCCAAATGGCGGATTTGAGTGGCAATTGGCAAACTCACTTTTGCAGTCTGCTGAGAATTGGTTAAGACTGCTTCAAGTAAATC
ACCCTGACTTCATCTTCTTCAGCCGGCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGTGCAGGGTTGCAGTTTGCCCGTGGTTGTGGAGACGATAGGTTTTACAATCCGACGAAGGCTCGTAGGGCGCATCAGGGCCGCCAAAATGACGAGCTCCGGAG
AGCTCAGAGCGATGTTTCTGCCGGCCAATCCCCTGCGGTTAAACCGAGCACGGTGTCCGCCGTGATTAGAGAATCCGAAAATGGGTCTGGGTGTGAAGAGCAAGAGCTCC
CAAAATCGATTCCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTTCCTGCGCAGTACCTCTCAAAG
ACAACGATGAGGGGGTGGAGGACTTGTGATGTGGAATTTCAACCATACTTTGTTCTTGGTGATTTGTGGGAGGCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCC
TCTTGTGTTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATCTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTACAAAGTCAAGGCAACCAG
GTGAGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGGGCACTAAAATACATGGGGAAACCACTCAATCATCACCATATA
TCGTTGGAGCTTCCTCGTAGAATGGAAAGGATATCGTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCCAGTGATGAGGCGGAATCTTTTAATTCTCAAGGCCA
GCTGCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGATATCAGATCTTGCCTTTCAGTTCCCTGAGCTCAAAACGTTACGAAGTT
GTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTATAGGATACCAACTGGACCAACATTAAGGGATCTAGATGCCTGTTTCCTCACCTTTCATTCT
TTGTATACGCCAATTGGAGGGGCACGTAGTGTTCAAGGCCCTGTAGTAACATATCCTAGTGAGATAGATGGTATCCCTAAGATGCCCCTACCAGTTTTCGGTCTAGCTTC
ATACAAGTTTAGAGGGTCTTTGTGGACTCCAAATGGCGGATTTGAGTGGCAATTGGCAAACTCACTTTTGCAGTCTGCTGAGAATTGGTTAAGACTGCTTCAAGTAAATC
ACCCTGACTTCATCTTCTTCAGCCGGCGGTGA
Protein sequenceShow/hide protein sequence
MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK
TTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHI
SLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKISDLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHS
LYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR