; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1573 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1573
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUPF0098 protein
Genome locationMC05:19501084..19505162
RNA-Seq ExpressionMC05g1573
SyntenyMC05g1573
Gene Ontology termsNA
InterPro domainsIPR005247 - YbhB/YbcL
IPR008914 - Phosphatidylethanolamine-binding protein
IPR036610 - PEBP-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137699.1 uncharacterized protein LOC101207571 [Cucumis sativus]2.23e-10988.62Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        M D+ +FRLVSSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+PKGTK+LALVVQDIDAPDP GPIVPWTVWVV+NIPPTLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEGNNDEK+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATKDKLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

XP_008442384.1 PREDICTED: UPF0098 protein TC_0109 [Cucumis melo]1.76e-10686.23Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        M D+ +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIP TLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+K+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

XP_022145739.1 uncharacterized protein LOC111015122 [Momordica charantia]6.95e-122100Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

XP_023529183.1 uncharacterized protein LOC111791902 [Cucurbita pepo subsp. pepo]4.15e-10585.03Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        MADSE+FRL SSAIDHEGRLPRKYTSEGQG QKN+SPPLEWYNVP+GTK+LALVVQDIDAP+P GPIVPWTVWVV+NIPP+LKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+KIPGWR PTLPS GHRFEFKLYALDD++NLG+K TK+KLLD IEGHVLGEAVLMA+F
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

XP_038904153.1 UPF0098 protein TC_0109 [Benincasa hispida]1.83e-10888.62Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        MADS  FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIPPTLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDEK+PGWRAPTLPSHGHRFEFKLYALDD LNLG+K TK+KLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

TrEMBL top hitse value%identityAlignment
A0A0A0L9T7 Uncharacterized protein1.08e-10988.62Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        M D+ +FRLVSSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+PKGTK+LALVVQDIDAPDP GPIVPWTVWVV+NIPPTLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEGNNDEK+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATKDKLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

A0A1S3B537 UPF0098 protein TC_01098.52e-10786.23Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        M D+ +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIP TLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+K+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

A0A5A7TL06 UPF0098 protein8.52e-10786.23Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        M D+ +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIP TLKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+K+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

A0A6J1CW59 uncharacterized protein LOC1110151223.37e-122100Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

A0A6J1IX32 uncharacterized protein LOC1114814411.36e-10383.83Show/hide
Query:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY
        MADS++FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYNVP+GTK+LALVVQDIDAP+P  PIVPWTVWVV+NIPP+LKGLPEDFSG  + LG DY
Subjt:  MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADY

Query:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+KIPGWR PTLPS GHRFEFKLYALDD++NLG+K +K+KLLD IEGHVLGEAVLMA+F
Subjt:  AAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

SwissProt top hitse value%identityAlignment
O26373 UPF0098 protein MTH_2731.1e-2543.4Show/hide
Query:  LVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGNN
        L + A +  GR+P +YT +G+    N SPPL W  VP   KSLAL+  D DAP        WT WV+ NIPP   GL E+     +A      ++Q G N
Subjt:  LVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGNN

Query:  DEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        D    G+R P  PS  HR+ F+LYALD  L+L   A+K+ +L+A+EGHVLGEA L+ ++
Subjt:  DEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

O84741 UPF0098 protein CT_7366.6e-1835Show/hide
Query:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + +VP+  KSL L+V+D D P        W  W+V N+ P +  L E         GA   A+Q  N
Subjt:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN

Query:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
           +I G+  P  P   HR+ F  YALD  L+     TK++LL+A++GH++  A LM  +
Subjt:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

Q9PLJ0 UPF0098 protein TC_01095.1e-1836.25Show/hide
Query:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + ++P   KSLAL+V+D D P        W  W+V N+ P +  L E         GA   A+Q  N
Subjt:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN

Query:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
           +I G+  P  P   HR+ F  YALD  L      TK++LL+A+EGH+L  A LM  +
Subjt:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

Q9UZJ3 UPF0098 protein PYRAB115301.2e-1432.73Show/hide
Query:  EDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQ
        E+   VSS   ++  +P KYT EG     + +PPL    + +  KSL ++V      DP  P+  +T W+  NIPP ++ +PE    + E     +  + 
Subjt:  EDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQ

Query:  EGNNDEKIPGWRAPTLP-SHG-HRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        +G ND    G+  P  P  HG H + FK+Y LD  LNL   AT+++L  A+EGH++    L+ ++
Subjt:  EGNNDEKIPGWRAPTLP-SHG-HRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

Q9Z729 UPF0098 protein CPn_0877/CP_0992/CPj0877/CpB09061.1e-1735Show/hide
Query:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN
        +L+S A  +   +P+KYT +G G     SPPL + +VP   +SLAL+V+D D P        W  W+V N+  T+  L E         GA+  A+Q G 
Subjt:  RLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGN

Query:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF
        N    P +  P  P   HR+ F L+ALD  L      T+D+L +A+E H++ +A LM  +
Subjt:  NDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF

Arabidopsis top hitse value%identityAlignment
AT5G01300.1 PEBP (phosphatidylethanolamine-binding protein) family protein1.4e-6368.12Show/hide
Query:  SEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAI
        SE+ RLVS  ID++G+LPRKYT  GQG +K+ SPPLEWYNVP+GTK+LALVV+DIDAPDP GP+VPWTVWVV++IPP +KGLPE +SG  +        I
Subjt:  SEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAI

Query:  QEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVL
        +EGNND KIPGWR P LPSHGHRF+FKL+ALDD+  +GH  TK++LL AIEGHVLGEA+L
Subjt:  QEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVL

AT5G01300.2 PEBP (phosphatidylethanolamine-binding protein) family protein1.1e-4768.85Show/hide
Query:  YNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLG
        YNVP+GTK+LALVV+DIDAPDP GP+VPWTVWVV++IPP +KGLPE +SG  +        I+EGNND KIPGWR P LPSHGHRF+FKL+ALDD+  +G
Subjt:  YNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGNNDEKIPGWRAPTLPSHGHRFEFKLYALDDQLNLG

Query:  HKATKDKLLDAIEGHVLGEAVL
        H  TK++LL AIEGHVLGEA+L
Subjt:  HKATKDKLLDAIEGHVLGEAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACAGCGAGGATTTCAGGCTGGTGTCATCGGCGATAGATCACGAAGGGAGGTTGCCGAGAAAGTACACGTCGGAAGGGCAAGGGACGCAGAAGAATAAATCTCC
GCCGTTGGAATGGTACAATGTGCCCAAGGGGACCAAGAGCCTTGCCCTGGTGGTGCAGGACATCGACGCGCCGGACCCGCAGGGTCCGATTGTGCCGTGGACCGTGTGGG
TGGTCCTCAACATTCCGCCCACATTGAAGGGCCTGCCCGAAGATTTCTCGGGCAAACACGAAGCGCTCGGCGCTGATTATGCCGCTATCCAAGAGGGGAATAACGACGAG
AAAATCCCTGGTTGGCGCGCCCCCACTTTGCCCTCCCACGGCCACCGCTTCGAGTTCAAGCTCTACGCTTTGGACGACCAGTTGAACCTCGGCCACAAGGCAACAAAGGA
CAAGCTGTTAGACGCAATAGAAGGGCACGTGCTGGGAGAAGCAGTACTAATGGCGGTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
CAGATCTCCAGTCCGCGGAGGACACTCTAAGTCTGCCACGTCAAGAGTGACGTGTCGTGAGCTTCTTGAACCCTCTCCTTTTGTTGCCAACTTGAAGTTTCCTGACCAAA
TGTCGCTGAAGATATGGGCAAGAACCAAGGGCGCAGTGTATAAATCAATTCCAATGACGCAACAAAACCAACGACCGATACCGAAGAAGAAATTAAAGCCTAATTAGTGT
TAATTTTTGAAGGCATGGCCGACAGCGAGGATTTCAGGCTGGTGTCATCGGCGATAGATCACGAAGGGAGGTTGCCGAGAAAGTACACGTCGGAAGGGCAAGGGACGCAG
AAGAATAAATCTCCGCCGTTGGAATGGTACAATGTGCCCAAGGGGACCAAGAGCCTTGCCCTGGTGGTGCAGGACATCGACGCGCCGGACCCGCAGGGTCCGATTGTGCC
GTGGACCGTGTGGGTGGTCCTCAACATTCCGCCCACATTGAAGGGCCTGCCCGAAGATTTCTCGGGCAAACACGAAGCGCTCGGCGCTGATTATGCCGCTATCCAAGAGG
GGAATAACGACGAGAAAATCCCTGGTTGGCGCGCCCCCACTTTGCCCTCCCACGGCCACCGCTTCGAGTTCAAGCTCTACGCTTTGGACGACCAGTTGAACCTCGGCCAC
AAGGCAACAAAGGACAAGCTGTTAGACGCAATAGAAGGGCACGTGCTGGGAGAAGCAGTACTAATGGCGGTCTTCTGATCATGAGAGACATTAAACTTTTTGTACTATTT
TATCTATATATCTTATCTGATATCGCAAAAAGACAGCAGTTTGTAAATTAGATTAACTATTTCCTTTCTTAATAAGAGAGATTTCGCTTTTTTCCACGCTTGAAATCCTC
GGCCACACCAGAGAAAGCATTCATTAGAAAAAGGAAGAACACCCAAGCAATTCCAAAACACTATTATACACCAAGACGCAGAAGAAACAGAGTAAAAGAAACTTTTTTTT
TTCTTTTTTAATTTTATTTTTACAAATTGATGGTCACTCCTCTGGCACCGTTGATATAAGGCGACGCCGCTCTCTCTTCCGTGCTCGCCGTCGCATTGCCGTGCCTTCTC
GGATCGAACTTCTCGCCGTCGCCCCAAAGAGCGTAAGCCTCCTCGCCGTGCACGGCGTCGTCTCCGATCATGAGGTCCTCGTCCGGCATCCTCAACGGCACGAACAGCCG
AATAAAGAGGAGTATAATCGTCGTCGACACAACGTTCCATCCTATTATAAACATGGCTCCGGCCAGTTGTTTAAGAAACTGAATGCCGCCGGTTCCGCCGTAGAAGGCAC
CGCGCGTGTCGGGGACCGGTAGATACAGATCGCATAGATCGGGCTCCGCCAAAAGCCCAGTTAAAAGCCCGCCCAACATTCCCGCCACCGCGTGCGTGTGAAATACGCCA
AGCGTGTCGTCAGCCTATTATTACATTTTACATTATTTCACAAATTACGGATATTGGATCTCAAAGACAAATATCTTTAATATTATGGAGGAAAAAATGTAAATTTGTAG
AACTAAAAAGAATATGTAAATTTTGTAAATATATATATTTTTTTGGGAGAATTAACGGGCTTTGATCTTGCTCCTTAATAAGGCAAAATATTATCAATTTAAATATTTGT
AATGGATTTTTTTATTACTATACTTTCGTCAGAAATACCTATCGATTATATTTCTACAAAATTAAGGGATGAAATTATTGATCTGTTGAAGATAATTATACACGAAAGAT
GTGGGATCGGGAAGAATATTCAAATATGACGGAAAGTGACGTTCGGAAGAAAATTCCACAAACAACAAAACAGTAATCTGTAAGGACAGAGGGCCCGGACCATTGACTAT
GATTTGCCAAAGACTAGCTAGAAACTAGCATTAATTTGGCAAATATTTAATTTTATGTCATTTTCTGAGGAAAAAAAACCTGATAAACAATGACAAAATTATATGTTATT
CATATTACCAATAAATTTATAACTATTGATTACAAAATAGCTTTCGTGATGTTAGAGTTACTATAGTTTGTTAACCATAGTTTGTTATTTACAGTCTCCTGTATTTATGT
ACAACGAGAGAGAATTACTCATTAATAAAATTTCTTCTCTCAATTCTTGTTTTACGCGTAACACGTGAGCTCTTGAAAGTAAATTTTAAAAGTGAGAGAAAATAAATTTA
AAAAATGATTAAATAAACAATTTTTTAAATTAAAATTTTGTTTTAAAAAGTAGGTGAGAATTTTTAGAAACAGTTGGAAGATTTAAGGACATACCTTCTGAAGGAAACTC
AATTTTTTGTGGAGGACCATCATGGACACCCACGGAATGATTCCTGCTAGAATCCCCATTATGATAGCCGCCCACGTTTGCACAACTCCTGTTTTAAATAATTTGTTAAA
AATCTTAATATTTTGAGAAGGAACCAAGATTGCAATCAATAACGACCCTAAATTCCTAATATATTTTATGAACATTTTGATAATGAATTTTAGGAAAATAACAAATATTA
GGAATTTTATTTTTTTAAAAGTTTAATAACTAATATTAAATACTAATTAATGTGGAGGCGGAATTACCTGCGCCGGGCGTAACGCAAGCCAAGCCGGTCATCATACCCTG
AACGGCTCCGATCACCGATGGCTTCCCGAAGAAAATGACATCCAAAGTGGTCCACACGAGCAGGCTGGTCGCCGCGCTGACGTTCGTGTTCAGCACCGCAATCGAGGCCA
CCACGTTGGCGGCGTGGGGCGCGCCGCCGTTAAATCCCGACCACCCCATCCACAGCAGCCCCGCTCCGGCGAGCATCAGCAGCACGTTATTGGGCGGAAATCTCTCTCTA
TCACTCTTAATTCTCGGACCAACCTGACCACAAAAAAAGAGTAAAATTTAATCAATTAACGCTCTTCCGATTTCCGATTGCGAATTTGGTCGGAGAGTGGGCGAAATTTG
TTGGGGTTTTTTTTGTTTTTTTGCAATTACCCAGTAAGCAGCGGTTAAGCCGGAGATTCCCGAAGAAAGATGGATGACGTAGCCGCCGGAGTAATCGATGACGCCCCAGT
GGAAGAGAAACCCTCCGCCCCATATAGTGTATGCGCCGACGGTGTATGAGAAAATGACCCAAAGAGGAACAAAGGCCATCCAGGCCTTGATATTCATTCGAGCAAGAACA
GAGCCAGCAAGTAAAATCACAGTAATCGCAGCGAAAGTGAATTGGAAGTAGACGAGCGAGGCCATGGGAACGCTGGGTTCAATTCTGGGCGTCACCACCGTGCCGTCGCG
GCGGCGGTGGATGCTCTCGGGTATATTGGCTTGGCCGATTAGGTAACTTTGGCCGAGAGCAGGAATGCCTCTGCCCCACAAGGGAAGGAGCTGCTCGCCGAAGGCCATGC
GATAGCCGAAGAGAACCCAGCAGATTAGGACGGCGGCGAAGGCGTAAAGGGCCATAAATGCAGAATTGACGGCCCATTTCTTCTTGACGATGCTGGCGTAGAGAATCACC
AGGCCCGGCATGCTCTGAAGTCCGACGAGAGTGGAGGCGGTTATTTGCCACGCGTTGTCGCCGCTGTTCAGCCACGGCGGAGATCCGGAGGCCATATTTCCCGGCGGTGC
GGGGTTGTTCATCCTTATTCTTCTTCTTTTTTCAGTTGCCCAATTTCTGAGACTTGGAAGAAGATGGGACTTTCGAAAATGAAGATTCTAAAGAGGAATTTCGGCCGAGG
AAGGAAACTGTGAGGGAAGGACAAGTGGGTTTCTACGAATTGATGTGTTAGATGGTGAAATTTCGATCATTCGTTAAGGAATGTGTAGAGAAATTGGGGATTGACG
Protein sequenceShow/hide protein sequence
MADSEDFRLVSSAIDHEGRLPRKYTSEGQGTQKNKSPPLEWYNVPKGTKSLALVVQDIDAPDPQGPIVPWTVWVVLNIPPTLKGLPEDFSGKHEALGADYAAIQEGNNDE
KIPGWRAPTLPSHGHRFEFKLYALDDQLNLGHKATKDKLLDAIEGHVLGEAVLMAVF