; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0927 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0927
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationMC04:16977088..16982155
RNA-Seq ExpressionMC04g0927
SyntenyMC04g0927
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016733.1 hypothetical protein SDJN02_21843, partial [Cucurbita argyrosperma subsp. argyrosperma]2.07e-26888.62Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA Q P +KP+TVS+VIRE+E G GCE+  LPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTTM+GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMG  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++L+F+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

TYK00266.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.98e-26688.62Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCEE  LPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_022152290.1 uncharacterized protein LOC111020043 [Momordica charantia]1.14e-30299.27Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_023550272.1 uncharacterized protein LOC111808496 [Cucurbita pepo subsp. pepo]5.93e-26888.62Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA QSP +KP+TVS+VIRE+E G GCEE  LPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTT++GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPER LKY G  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

XP_038874258.1 uncharacterized protein LOC120066989 isoform X1 [Benincasa hispida]4.09e-27591.28Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSAGQSP VKP  VS+VIRE+E G GCEE  LPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCDVEFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+RIS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L +P+GGARSVQGPVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ AE WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

TrEMBL top hitse value%identityAlignment
A0A1S3CT52 uncharacterized protein LOC1035045972.25e-26588.14Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGD RFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCEE  LPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKT+RSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A5D3BPU4 DUF789 domain-containing protein9.59e-26788.62Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARR HQGRQ D+LRRAQSDVSAGQS  VKPS VS+VIRE+E G GCEE  LPKSI +S FEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTM+GWRTCD+EFQPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGES KSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGK LNHHH+S EL RRM+ IS RDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAFQFP+LKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTL+DLDACFLTFH L +P GGARSVQ PVVTYPSEIDGIPKM LPVFGLASYKFRGSLWTPNGG+EWQLANSLL  AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1DDJ0 uncharacterized protein LOC1110200435.54e-30399.27Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK ++LAFQFPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
        QVNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1FG46 uncharacterized protein LOC1114451079.59e-26788.62Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        MLGAGLQF RGCGDDRFYNPTKARRAHQGRQND+LRRAQSDVSA QSP +KP+TVS+VIRE+E G GCEE  LP SI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTTM+GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERALKYMG  LNHHH+S EL RR ER+SLRDQLIGLQEDC SDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG EWQLANSLLQ AE+WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

A0A6J1JV26 uncharacterized protein LOC1114891473.90e-26687.41Show/hide
Query:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT
        M GAGLQF RGCGDDRFYNPTKARR+HQGRQND+LRR QSDVSA +SP +KP+TVS++IRE+E G GCEE  LPKSI +SAFEPVVSSLSNLERFLQSI 
Subjt:  MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSIT

Query:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS
        PSVPAQY SKTT++GWRTCD E QPYFVLGDLWE+FKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSS KSRQPGEDSDSDFRDSSSDGSS
Subjt:  PSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSS

Query:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS
        DSEPERA+KYMG  LNHHH+S EL RRMER+SLRDQLIGLQEDCSSDEAES NSQGQLLFEHLERDLPYSREPLADK ++LAF+FPELKTLRSCDLLPSS
Subjt:  DSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSS

Query:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL
        WFSVAWYPIYRIPTGPTLRDLDACFLTFH L TP+GGARSVQGPVVTYPS+IDGIP+M LPVFGLASYKFRGSLWTPNGG+EWQLANSLLQ A++WLR  
Subjt:  WFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLL

Query:  QVNHPDFIFFSRR
         VNHPDFIFFSRR
Subjt:  QVNHPDFIFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)3.2e-10957.92Show/hide
Query:  ELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDL
        +L+RAQ DVS G          S+  ++ ENGS   +  + +           +S SN+ERFL S+TPSVPA YLSKT +R     DVE Q PYF+LGD+
Subjt:  ELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQ-PYFVLGDL

Query:  WEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRME
        WE+F EWSAYG GVPL LN++ D V QYYVP LSGIQ+Y   ++L SS ++R+ GE+S+SDFRDSSS+GSS SE ER L Y  +         ++  RM+
Subjt:  WEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYG--ESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRME

Query:  RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH
        ++SLR +    QED SSD+ E  +SQG+L+FE+LERDLPY REP ADK ++LA +FPELKTLRSCDLLPSSWFSVAWYPIY+IPTGPTL+DLDACFLT+H
Subjt:  RISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFH

Query:  SLYTPIGGARSVQGPV-VTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR
        SL+TP  G     G + V  P E   + KM LPVFGLASYK RGS+WT  GG   QLANSL Q+A+NWLRL QVNHPDFIFF RR
Subjt:  SLYTPIGGARSVQGPV-VTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.4e-11758.71Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI
        MLGAG Q  RG  GDD FY   K RRA+Q  + D+LRRAQSDVS   S A  P                 +Q+L         EP   S SNL+RFL+S+
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI

Query:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS
        TPSVPAQ+LSKT +R  R  D   +  PYFVLGD+W++F EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDS
Subjt:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS

Query:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSC
        SSD SSDS+ ER                 +  R++ ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK  +LA QFPEL TLRSC
Subjt:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSC

Query:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAE
        DLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG  S Q   +T P E +   KM LPVFGLASYKFRGSLWTP GG E QL NSL Q+A+
Subjt:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAE

Query:  NWLRLLQVNHPDFIFFSRR
         WL    V+HPDF+FF RR
Subjt:  NWLRLLQVNHPDFIFFSRR

AT2G01260.2 Protein of unknown function (DUF789)5.1e-9158.31Show/hide
Query:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI
        MLGAG Q  RG  GDD FY   K RRA+Q  + D+LRRAQSDVS   S A  P                 +Q+L         EP   S SNL+RFL+S+
Subjt:  MLGAGLQFARG-CGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSI

Query:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS
        TPSVPAQ+LSKT +R  R  D   +  PYFVLGD+W++F EWSAYG GVPLVLN++ D V+QYYVP LS IQIY  S  L SS KSR+PG+ SDSDFRDS
Subjt:  TPSVPAQYLSKTTMRGWRTCD--VEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDS-DSVVQYYVPYLSGIQIYGES--LKSSTKSRQPGEDSDSDFRDS

Query:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSC
        SSD SSDS+ ER                 +  R++ ISLRDQ    QED SSD+ E   SQG+L+FE+LERDLPY REP ADK  +LA QFPEL TLRSC
Subjt:  SSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKA-NLAFQFPELKTLRSC

Query:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGG
        DLL SSWFSVAWYPIYRIPTGPTL+DLDACFLT+HSL+T  GG
Subjt:  DLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGG

AT4G16100.1 Protein of unknown function (DUF789)3.8e-8645.91Show/hide
Query:  GDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVV-SSLSNLERFLQSITPSVPAQYLSKT
        G++RFYNP   R+  Q R+   L   + +    ++  +    +    +E +    C   +      VS+      ++ SNL RFL   TP V  Q+L  T
Subjt:  GDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVV-SSLSNLERFLQSITPSVPAQYLSKT

Query:  TMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY
        + +GWRT + E++PYF+L DLW++F+EWSAYG GVPL+LN  DSVVQYYVPYLSGIQ+Y +  ++ T  R+ GE+SD D  RD SSDGS+D         
Subjt:  TMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDF-RDSSSDGSSDSEPERALKY

Query:  MGKPLNHHHISLELPRRMERISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYP
                    EL + + R SL ++  IG     SSDE+E S NS G+L+FE+LE  +P+ REPL DK +NL+ QFP L+T RSCDL PSSW SVAWYP
Subjt:  MGKPLNHHHISLELPRRMERISLRDQ-LIGLQEDCSSDEAE-SFNSQGQLLFEHLERDLPYSREPLADK-ANLAFQFPELKTLRSCDLLPSSWFSVAWYP

Query:  IYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGF-EWQLANSLLQSAENWLRLLQVNHPDF
        IYRIP G +L++LDACFLTFHSL TP  G  + +G      S+     K+PLP FGLASYKF+ S W+P     E Q   +LL++AE WLR L+V  PDF
Subjt:  IYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGF-EWQLANSLLQSAENWLRLLQVNHPDF

Query:  IFF
          F
Subjt:  IFF

AT5G49220.1 Protein of unknown function (DUF789)8.5e-7844.44Show/hide
Query:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-----NDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFE-------------P
        G+  AR    G++RFYNP   RR  Q  Q      ++ RR   D         K +TV+   R +  G G  E +    + VS  E              
Subjt:  GLQFARGC--GDDRFYNPTKARRAHQGRQ-----NDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFE-------------P

Query:  VVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTK
        V+S  SNL+RFL+  TP VPA+     +    +T + +   YFVL DLWE+F EWSAYGAGV     PL ++ +DS VQYYVPYLSGIQ+Y + LK   K
Subjt:  VVSSLSNLERFLQSITPSVPAQYLSKTTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGV-----PLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTK

Query:  SRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-A
         R P  D+     + SS+GSS+S        +G+              + RISL+DQ   +    SS EAE  N QG+LLFE+LE + P+ REPLA+K +
Subjt:  SRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHISLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADK-A

Query:  NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNG
        +LA + PEL T RSCDLLPSSW SV+WYPIYRIP GPTL++LDACFLTFHSL T     +S  G   + PS      K+PLP FGLASYK + S+W  N 
Subjt:  NLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSLYTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNG

Query:  GFEWQLANSLLQSAENWLRLLQVNHPDFIFFS
          E Q   SLLQ+A+ WL+ LQV+HPD+ FF+
Subjt:  GFEWQLANSLLQSAENWLRLLQVNHPDFIFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGTGCAGGGTTGCAGTTTGCCCGTGGTTGTGGAGACGATAGGTTTTACAATCCGACGAAGGCTCGTAGGGCGCATCAGGGCCGCCAAAATGACGAGCTCCGGAG
AGCTCAGAGCGATGTTTCTGCCGGCCAATCCCCTGCGGTTAAACCGAGCACGGTGTCCGCCGTGATTAGAGAATCCGAAAATGGGTCTGGGTGTGAAGAGCAAGAGCTCC
CAAAATCGATTCCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTTCCTGCGCAGTACCTCTCAAAG
ACAACGATGAGGGGGTGGAGGACTTGTGATGTGGAATTTCAACCATACTTTGTTCTTGGTGATTTGTGGGAGGCTTTTAAGGAATGGAGTGCTTATGGTGCAGGTGTGCC
TCTTGTGTTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATCTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTACAAAGTCAAGGCAACCAG
GTGAGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGGGCACTAAAATACATGGGGAAACCACTCAATCATCACCATATA
TCGTTGGAGCTTCCTCGTAGAATGGAAAGGATATCGTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCCAGTGATGAGGCGGAATCTTTTAATTCTCAAGGCCA
GCTGCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGGCAAATCTTGCCTTTCAGTTCCCTGAGCTCAAAACGTTACGAAGTTGTG
ATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTATAGGATACCAACTGGACCAACATTAAGGGATCTAGATGCCTGTTTCCTCACCTTTCATTCTTTG
TATACGCCAATTGGAGGGGCACGTAGTGTTCAAGGCCCTGTAGTAACATATCCTAGTGAGATAGATGGTATCCCTAAGATGCCCCTACCAGTTTTCGGTCTAGCTTCATA
CAAGTTTAGAGGGTCTTTGTGGACTCCAAATGGCGGATTTGAGTGGCAATTGGCAAACTCACTTTTGCAGTCTGCTGAGAATTGGTTAAGACTGCTTCAAGTAAATCACC
CTGACTTCATCTTCTTCAGCCGGCGGTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAACAAAAAGATAAAACAAGTATGTAACACTTTTATGGCAGGGGTATAATTGTCTTTCCATATAAAAGCAGCTAGTCCCAAATTCCCAATTTAATTAGGGCGCCTC
TTCTGACTGCTTGAAGACAGTTGGTGAAGACCATGATATTATCCATTAAATATATATCTTTTTCTCCAAAAAGATAAAAATAAAAATAAAATGATATTAAATTAATTAGT
GGCTATTTTGTTCCTTGCAAAAACCAGCGCACTGGTTCTTCTCCATCGCTCTCTCTCTCCCTGAATCGCTTCGTTTTTCTTCCATTGCTCTCTCTCTCTTTATCTCTTTC
TCTCTCCTCTGTCTAATTCGACCCTTTTGAAACAAACCCTTTTGCTTGTTTAGATCCATTGCGTCTCAATCTGTGTGAGATTTCGCGTTATTCAAATCATTTTCTCCCAT
TCGATTCACCGATTTTGAAAACCTACAGCGGGCTCTGTTTTCGTTTTCTTCTCTTGCGATTTGGCAGCGATTGTCTGTCGAATTCAATCCTGCAATCGCCGTTTCTGTGC
GCAGGGATTTGATATTGTTCATCCGATCAGTGTCCGGCGATTTCGTTGACTTTTATTGAGGATTGTAGGACTTTCGATTGAGATTTCTGAGGCTGTTTTCGGTTTGGTTT
GGGAAATAGAATGTTGGGTGCAGGGTTGCAGTTTGCCCGTGGTTGTGGAGACGATAGGTTTTACAATCCGACGAAGGCTCGTAGGGCGCATCAGGGCCGCCAAAATGACG
AGCTCCGGAGAGCTCAGAGCGATGTTTCTGCCGGCCAATCCCCTGCGGTTAAACCGAGCACGGTGTCCGCCGTGATTAGAGAATCCGAAAATGGGTCTGGGTGTGAAGAG
CAAGAGCTCCCAAAATCGATTCCGGTGTCGGCTTTTGAGCCAGTGGTGTCGTCGCTGAGTAATCTGGAGCGGTTCTTGCAGTCTATCACGCCATCTGTTCCTGCGCAGTA
CCTCTCAAAGACAACGATGAGGGGGTGGAGGACTTGTGATGTGGAATTTCAACCATACTTTGTTCTTGGTGATTTGTGGGAGGCTTTTAAGGAATGGAGTGCTTATGGTG
CAGGTGTGCCTCTTGTGTTAAACGACAGTGACAGTGTTGTCCAGTATTATGTACCATATCTATCCGGTATACAGATATATGGTGAATCCTTGAAGTCCTCTACAAAGTCA
AGGCAACCAGGTGAGGACAGTGATAGTGATTTCAGAGATTCAAGTAGTGATGGTAGTAGTGATTCAGAACCTGAACGGGCACTAAAATACATGGGGAAACCACTCAATCA
TCACCATATATCGTTGGAGCTTCCTCGTAGAATGGAAAGGATATCGTTGCGGGACCAGCTGATTGGACTTCAAGAAGACTGTTCCAGTGATGAGGCGGAATCTTTTAATT
CTCAAGGCCAGCTGCTATTTGAGCATCTTGAACGTGACTTGCCTTATAGTCGTGAACCTTTGGCAGATAAGGCAAATCTTGCCTTTCAGTTCCCTGAGCTCAAAACGTTA
CGAAGTTGTGATCTATTGCCTTCCAGCTGGTTTTCTGTGGCATGGTATCCAATTTATAGGATACCAACTGGACCAACATTAAGGGATCTAGATGCCTGTTTCCTCACCTT
TCATTCTTTGTATACGCCAATTGGAGGGGCACGTAGTGTTCAAGGCCCTGTAGTAACATATCCTAGTGAGATAGATGGTATCCCTAAGATGCCCCTACCAGTTTTCGGTC
TAGCTTCATACAAGTTTAGAGGGTCTTTGTGGACTCCAAATGGCGGATTTGAGTGGCAATTGGCAAACTCACTTTTGCAGTCTGCTGAGAATTGGTTAAGACTGCTTCAA
GTAAATCACCCTGACTTCATCTTCTTCAGCCGGCGGTGAAGTCCTTGCAACATCTATGAAGCCTAAAGGTGGGAATGAGGATATCGTAGCTCGGTACCGTGTTGTAATTT
GCTCTTTTGGGGCGCTGTTCTACTTTCAGAAAATTGAAAGGAAAAAGAGGGAAAAAAAGGAAAAATGGGGAAAAAGTTTGTGATAGGACGGGTGAGGTGGCGGAGGGCGA
CTTAAAACAAGACTCAAGCAGAAGTCAAAAGGGGAAAAAGCAATAAAGGAAAGGCAAAAGATACAGTTGGAGGAAGAATGTCGAAGTTGTAGCCAAAACAGCCCAAAGTT
GGTTCAACTTGCTGCTTCTGCTTGCATTGCAGGTTTTATCCTTTTTTATCTTTTTCCTTTTAATTATTGAAAGTTCTTTTTATAAATTATTTACTGTAGGTGAAATGCAA
GAAAAAAAAGAAAAACAGGCCCTTTTTTACTGTCCTGATGATGTTAGAATGTAAACTATGTGAGTGAGGGGAATTTTGATAGGAAGGTTATGCTTAGGAGTAAGTTATGG
ATCCCCGCCAAAAAAATATATATATTTGAATGCGTGTTATGCTCTGTAGTTGCTACGAAAACTGTATAATTCGTTACGATGAGTGTCGTTATCAATGAACTGTCATTCAA
GCTCTATCATTCTGTTGCTTCTTTTAGTCTGAATGGGGAGGGGTTGAAATGTTTTGTGCAGATTTGATACATTTTTTAGGTGTTTCAAACTACTTTGTATCAAATGACTG
GTAGATTGGATCATTAACAAGATTACCTGCACCATTCTTAATACATTTCTTTGATGTTCTTTTTTTTTTTATTTGAATACAATATTTGGGGTCGAACCTACATATATGCT
TTAACCAGTTAATCTATGTTTATGTTGATATTTCTACGGTGTTTCTTGAATGTGAAAGGATAGTCTATGTGAAGGGAGATTAAATAGAAATCATGCTCACGTTACACTTT
TCCCAGATAGGAAAGGAAAAATGTTTCATAAATGTATTGCTTAAATCATTTTAACCTTGCATTTAAAGCTATGGGGGCAAAAAGGTCATTTTGCTGAAAAATATGAGACT
CGGAGTGTAGATAATTTCTTTAGAATTGTTGTAGAAATTTTCAGCCTGATTCAAGAGTTCTGGATTTGAATTAGCATTTGCATTTGCATTTGCATTTGAATTTGAATTTG
TCAGGTAGATTAAAGTCTCAACATTTGAGGAACCGTTTTATATTTTAGAGAAAAGGTTGAAAAATAGTTTTCGTACTCTAAATCCAGGAAAAATAAGTTTGGATAATTAT
TTTTTAGAAGCGGAGATTCATTCGTCTACGACTATATTATAGTTCATCCAGTCGGAAAG
Protein sequenceShow/hide protein sequence
MLGAGLQFARGCGDDRFYNPTKARRAHQGRQNDELRRAQSDVSAGQSPAVKPSTVSAVIRESENGSGCEEQELPKSIPVSAFEPVVSSLSNLERFLQSITPSVPAQYLSK
TTMRGWRTCDVEFQPYFVLGDLWEAFKEWSAYGAGVPLVLNDSDSVVQYYVPYLSGIQIYGESLKSSTKSRQPGEDSDSDFRDSSSDGSSDSEPERALKYMGKPLNHHHI
SLELPRRMERISLRDQLIGLQEDCSSDEAESFNSQGQLLFEHLERDLPYSREPLADKANLAFQFPELKTLRSCDLLPSSWFSVAWYPIYRIPTGPTLRDLDACFLTFHSL
YTPIGGARSVQGPVVTYPSEIDGIPKMPLPVFGLASYKFRGSLWTPNGGFEWQLANSLLQSAENWLRLLQVNHPDFIFFSRR