; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0285 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0285
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEpidermal patterning factor-like protein
Genome locationMC08:2169725..2177075
RNA-Seq ExpressionMC08g0285
SyntenyMC08g0285
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131875.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Momordica charantia]1.69e-108100Show/hide
Query:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ
        MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ
Subjt:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ

Query:  NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
        NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

XP_022951797.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata]5.30e-5864.38Show/hide
Query:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIR-SQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS
        MGSLQ WHRNK H+SI  L  LSV +F+ + S+AEGRG  TPLME RKVE   S  EL   V M R SQIGS+PPSCRRRCRECGG CEAIQVPVA+HDS
Subjt:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIR-SQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS

Query:  QQNQRKKRS-----SHFPAAAATSAGASNKD-MALSSEDETSNYKPISWKCKCGNFIFNP
           ++++R+     SHF + A++S+ +S  + +ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QQNQRKKRS-----SHFPAAAATSAGASNKD-MALSSEDETSNYKPISWKCKCGNFIFNP

XP_023002540.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita maxima]5.70e-5963.92Show/hide
Query:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ
        MGSLQ WHRNK H+SI  L  LSV +F+ + S+AEGRG  TPLME RKVE   S  EL   V M RSQIGS+PPSCRRRCRECGG CEA+QVPVA+HDS 
Subjt:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ

Query:  QNQRKKRS-----SHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
          ++++R+     SHF + A++S+   +  +ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QNQRKKRS-----SHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

XP_023538508.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita pepo subsp. pepo]8.46e-6063.58Show/hide
Query:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ
        MGSLQ WHRNK H+SI  L  LSV +F+ + SMAEGRG  TPLME RKVE   S  EL   V M RSQIGS+PPSCRRRCRECGG CEA+QVPVA+HDS 
Subjt:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ

Query:  QNQRKKRS------SHFPAAAATSAGASNKD---MALSSEDETSNYKPISWKCKCGNFIFNP
          ++++R+      SHF + A++S+ +S+     +ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QNQRKKRS------SHFPAAAATSAGASNKD---MALSSEDETSNYKPISWKCKCGNFIFNP

XP_038886514.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Benincasa hispida]1.03e-6068.15Show/hide
Query:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHD-S
        MGS QNWHR K H+SISLL LSVSI + V S+ EGRGI   L  GR      +  E  EK GM+ R+QIGSRPPSCRRRCRECGGHCEA+QVPVALHD S
Subjt:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHD-S

Query:  QQNQRKKR---SSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
         QNQRKKR   SSHF          S  D+ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QQNQRKKR---SSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

TrEMBL top hitse value%identityAlignment
A0A0A0LPQ0 Epidermal patterning factor-like protein3.92e-5664.94Show/hide
Query:  MGSLQNWHRNK-QHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS
        MGS Q WHRN+ +HISI LL LSV I + V S+ EGRGI    MEG +  A  +  E  EK+GM+ R+QIGSRPPSCRR+CRECGGHCEA+QVPVALHDS
Subjt:  MGSLQNWHRNK-QHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS

Query:  QQNQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
         QNQRK R     ++   S  + + D+ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QQNQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

A0A1S3BBM2 Epidermal patterning factor-like protein8.52e-5463.06Show/hide
Query:  MGSLQNWHRNK-QHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEA---AGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVAL
        MGS Q WHRN+ +H+SI LL LSV I + V S+ EGRGI    MEG +  A   A +  E  EK+GM+ R+QIGSRPPSCRR+C ECGGHCEA+QVPVAL
Subjt:  MGSLQNWHRNK-QHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEA---AGSWPELPEKVGMI-RSQIGSRPPSCRRRCRECGGHCEAIQVPVAL

Query:  HDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
        HDS QNQ+K R     +    S   S  D+ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  HDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

A0A6J1BQW9 Epidermal patterning factor-like protein8.18e-109100Show/hide
Query:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ
        MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ
Subjt:  MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQ

Query:  NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
        NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  NQRKKRSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

A0A6J1GIN2 Epidermal patterning factor-like protein2.57e-5864.38Show/hide
Query:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIR-SQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS
        MGSLQ WHRNK H+SI  L  LSV +F+ + S+AEGRG  TPLME RKVE   S  EL   V M R SQIGS+PPSCRRRCRECGG CEAIQVPVA+HDS
Subjt:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIR-SQIGSRPPSCRRRCRECGGHCEAIQVPVALHDS

Query:  QQNQRKKRS-----SHFPAAAATSAGASNKD-MALSSEDETSNYKPISWKCKCGNFIFNP
           ++++R+     SHF + A++S+ +S  + +ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QQNQRKKRS-----SHFPAAAATSAGASNKD-MALSSEDETSNYKPISWKCKCGNFIFNP

A0A6J1KP85 Epidermal patterning factor-like protein2.76e-5963.92Show/hide
Query:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ
        MGSLQ WHRNK H+SI  L  LSV +F+ + S+AEGRG  TPLME RKVE   S  EL   V M RSQIGS+PPSCRRRCRECGG CEA+QVPVA+HDS 
Subjt:  MGSLQNWHRNKQHISISLLL-LSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQ

Query:  QNQRKKRS-----SHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
          ++++R+     SHF + A++S+   +  +ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QNQRKKRS-----SHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

SwissProt top hitse value%identityAlignment
Q9LFT5 EPIDERMAL PATTERNING FACTOR-like protein 12.2e-0734.65Show/hide
Query:  PELPEKVGMI--RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSE-DETSNYKPISWKCKCGNFIFN
        P +  +V +I  ++++GS PPSC  RC  C   C AIQVP               S F      S G      +L++  D+ SNYKP+ WKC C    +N
Subjt:  PELPEKVGMI--RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSE-DETSNYKPISWKCKCGNFIFN

Query:  P
        P
Subjt:  P

Q9T068 EPIDERMAL PATTERNING FACTOR-like protein 21.3e-1538.1Show/hide
Query:  WHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRR-RCRECGGHCEAIQVPVALHDSQQNQRKK
        W  N     + LL+L+ + F     MA GR  P        VE   S  +  +   M+R  IGSRPP C R RCR C GHCEAIQVP            +
Subjt:  WHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRR-RCRECGGHCEAIQVPVALHDSQQNQRKK

Query:  RSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
           H P   ++S+ +    +  +  D+++NYKP+SWKCKCGN I+NP
Subjt:  RSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

Arabidopsis top hitse value%identityAlignment
AT4G37810.1 unknown protein9.0e-1738.1Show/hide
Query:  WHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRR-RCRECGGHCEAIQVPVALHDSQQNQRKK
        W  N     + LL+L+ + F     MA GR  P        VE   S  +  +   M+R  IGSRPP C R RCR C GHCEAIQVP            +
Subjt:  WHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRR-RCRECGGHCEAIQVPVALHDSQQNQRKK

Query:  RSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP
           H P   ++S+ +    +  +  D+++NYKP+SWKCKCGN I+NP
Subjt:  RSSHFPAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP

AT5G10310.1 unknown protein1.5e-0834.65Show/hide
Query:  PELPEKVGMI--RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSE-DETSNYKPISWKCKCGNFIFN
        P +  +V +I  ++++GS PPSC  RC  C   C AIQVP               S F      S G      +L++  D+ SNYKP+ WKC C    +N
Subjt:  PELPEKVGMI--RSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQNQRKKRSSHFPAAAATSAGASNKDMALSSE-DETSNYKPISWKCKCGNFIFN

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCCTTCAAAATTGGCACAGAAACAAACAACACATTTCAATTTCCTTGCTCCTCCTCTCTGTTTCCATCTTTGTTGATGTCATATCCATGGCTGAAGGCAGAGG
AATTCCAACTCCATTAATGGAGGGCAGGAAGGTGGAGGCAGCAGGCAGCTGGCCGGAGCTGCCGGAGAAAGTGGGGATGATTAGGAGTCAAATAGGGTCACGGCCGCCGA
GCTGCCGGAGAAGGTGCAGGGAATGCGGCGGACATTGTGAGGCAATTCAAGTACCAGTGGCACTGCATGACTCACAACAGAATCAAAGGAAAAAAAGAAGCAGCCATTTC
CCTGCTGCAGCGGCAACATCAGCAGGAGCTTCAAACAAGGATATGGCTCTCTCTAGTGAAGATGAGACCTCTAATTACAAACCCATAAGCTGGAAATGCAAGTGTGGGAA
CTTCATCTTCAACCCATGA
mRNA sequenceShow/hide mRNA sequence
ATTAAATATGAAGTGGCAGACAAGTTTTGAGGAACAAACAAGTGCACAAGAGAGATTCATGGCTTAAACAAGGTTGTTTGTGGAAGTGGATAAATGATTAGGCCATTAAC
GAGCTGAGAAGAAAGAAAAAGAGGCTCCAATGGCATCTAATGAAGTTTGGCCATTGCAGAGAAGTCGCTTTCCATTCCATTGCAGCCAAAGCCATGTTCAGAACCTTCAG
AAAAGAGTTTACGTCTCAACCAAAACTTTCATCCCAATGATTCCTCTATAATCACCAATGATCAATCAAAGTATAAGGAAAAAGCAAACTAAAAAGAAAAAAAAGAGAAA
CTGATATCATCTAAAACTGTTTTCACAAAATGAAAGAGAGAGGGACCAAATAATTTGGCGTGTAAAAGTGGAACTGCCCAAGATTCTATGCACTAAAAAAGAAAAGAAAA
ATGGGAATGGGAGACAAAATGAAGGAAAACGAAAAGGCATTGAGACCCAGCTGGAGATTTGAGATTTATATTCTTCACCACTCTCCAAAAATGTAAATTTTGGCCTTTTT
TTCCCCCTCTTCTGAATTATCTGCAATGGAACAGTAGAGTCAGTTCATTTTCTTTCTCGTGTAAAAGAACAGAACGTTGGCTCGTTAAACGAGGGTGATGTCCTTCCCCT
TTCTTCTTTGGATTATTGCTTGTACTCGCTTGCTTTTAACTTCCATCCTCCATAAAAGAGAACTACAGAGAGAGAGAAAGAGGTAAGAAGAAGAACGATGGGTCGATTTT
GAATTATAGAGTGGGCACTCAATCTCTGCTGACTGCAGCTGATACCCTTTTCTTTTGGTACACTTATAATATTCAGGATTCACAGTTTCAATTTTTCTGCTACACTTTTT
CTCTGTCACTCTTCACCCAAATTCTCCCCCTCCCCACCCTTCCAATTCAATCTACAAATTCTTCTCTCTTAAATCTTCCCCTTATTTAAACAACTTCTTCTTCTTCTTCT
TTTTCTTCTTCTTCCATACTTACCTTTACTCTTTAACCTTTGAACTCCTCTTCTTTCTACCTTTTCAAGTCTCTGAAAAGATGGGCAGCCTTCAAAATTGGCACAGAAAC
AAACAACACATTTCAATTTCCTTGCTCCTCCTCTCTGTTTCCATCTTTGTTGATGTCATATCCATGGCTGAAGGCAGAGGAATTCCAACTCCATTAATGGAGGGCAGGAA
GGTGGAGGCAGCAGGCAGCTGGCCGGAGCTGCCGGAGAAAGTGGGGATGATTAGGAGTCAAATAGGGTCACGGCCGCCGAGCTGCCGGAGAAGGTGCAGGGAATGCGGCG
GACATTGTGAGGCAATTCAAGTACCAGTGGCACTGCATGACTCACAACAGAATCAAAGGAAAAAAAGAAGCAGCCATTTCCCTGCTGCAGCGGCAACATCAGCAGGAGCT
TCAAACAAGGATATGGCTCTCTCTAGTGAAGATGAGACCTCTAATTACAAACCCATAAGCTGGAAATGCAAGTGTGGGAACTTCATCTTCAACCCATGATTTTTTTTGTT
CCTTCTTTCTTCTCTGAAAATTTTCTTGAATAAAAACACCAACTTGTATAAAGATCAGATCAAAAGCCGGAAATGACCATCTTGTCCAATCATAGTTTCTACCTCCCTTC
CCTGTCGCTGTATTTTGGGGAACTTTTTGTTGGGAATGCTTTAATTTCTGTGTTTTCCAAGACAGTAAGAAAGGGGACGTCAGCTTGAGGAAAATTGCAGAGATTTGGTA
ATTGTAAAGGGAAATGGGGTAAATGGTAATGAGATTTTGAAAGTTCAGAAGAGAGAGAGTTATGATGACAGAGAGGGAGTGGGGGAGAGAGAGAGAGGATTGTAAAGAAA
AACGAGGAAGTGTTGTGTCAGGTAAGATTTCAGACAGTTCTGGTTTTCGTATACTACAAGATTTCTAGATCTGGAAAAATATTCAACAAACCCAATCAGTAACTTCTACA
ATTTACTAGCTTGGTAATTTTGATCACCTTGTGTTTGAATCATAATTAAAAAGTGAGTTTTGATCTTTGTTTGATCAGTTTAACACAGAGAAGCAAGCAAAGGGATGATC
TTTGAAATTGGGTTGGCCTTGGCCATGAGTTCATCAGTATAACTAGAAGCTCATGTAAATCACCTGTTAATACAAAGTTCCATGATCTCTGTTGGGTCTTATTCATTTGG
CTGAACCTTAAGTTTAACGATTGAGCATTGAAGGAAAATGGGGTAGAATCTTCTATTTGAAATATTTCGATAACCCTTGAAGGAAGAAAGCTGTTATAACGTTAATTAAT
TCCTCTAAAACAAAATTTACATTCAACCCTTTAACACTTCCCTTTTCTACCTTGGAAATATTTACAGTCATTCTAGTAACTCCCAACAGAAGCAGCAGGGGATGGAGAGA
AAATTAGTTCAGATTTTGGGCTAATTGATGCAAAGTTAAAAAGGCATATTTACACCTTCCAACCGCAAACTCGTCAACTTTGTGATCGAGAATAACATTAGATAATGGCT
AGAAACCAAAGAACAGAATCAGGTAGTCAATTTCATACGAAACCGCACCTATACTTGGCTTCTCCCCCGCCGCCCTTGGCCAAGTAAGATTGTATGAGTTGCCAGAAGTA
AAGCACCTGCTTGGAAGCCTTGCCCTTGTGCGCTTATTCGAGAGGTTTGATATTTTCTGGAGAGTACTGACCGTGACCGTTTGAGGATCAAAGAATTGTCTAGTAAGATG
TTCTCTTCTTGTACTTGCTCTGCACCAGGCCCAACAGGAAGTCTGTGAATATGTGCTCATATTCCGGCATTTTCCCATACCCTATACAAATGACAACCAGTTTTAAAATC
ACAAGCAGAATTATGGGGGCGACAGAATAGCAAAACTGCACAATACTTGGATTCTGATTGACGAAAAATTGATTTCATGTTCATAAAAGATAATCACATACCAGGGAAGT
AGTTGATATCGATGACATAGAACTGATCTCTAGTCCCATACTCTCGAATGATATCCAGGTTGAATAAACGAAGCCCCTGAACAGAACAGAGATAACAGGACCATATGAGC
ACAATCTATTTTTCCCAGCACATGATTTGCCACAAGAAAGATGAGGAAAATGGGCAAGCCTACCAGTCTACGCCTGAGTTCCTTTGCTAGTCTCTCTAACAGAGGTCGTG
GAGGAAGTTCTACAGGAGAAAGTGAACACCAAGTTAAAGGTCACACTAAGAAATTGCGATTTTCCCAACAAAAAGATATTTTGCGTCCCAACGTTGGATATTGATTAGGG
TTCACACCTTAAACCTTTCCTTTAAAGTATTCTATCACACACATAATACACTTACCATGAACACAGTAACTGATCCTGATCTCTTGAAATAAAAGGGAGAAGAAGAAAAA
CAGATCAAAGGTGTTAAAGGACGATTCTTCAAATTTGATAACTTACCAGCAATACAAGGATCCAGATCGGCATCTTCAGCAGATGCCGCAGCATGAGAAACTCTTGGAAA
ACGATAAATACCAGCATTTTTCAAGACCTCCCACATGCTGACATTTGGTAAAGAAAAACGTCTGACCACCTTTATAGCTTCACCAACGATGAAAACCTTAAACATGACAC
CTCCTAGCACCATGAGTAATAGCAATTCAGAAATAAAATTTACATGACTCAAAACTATGGATGTAAGTTTGTAACATGGAATGATGGGAATAAAGGTTAATATTACTCAC
CATGGTTGACAAATTCCTGGAGGACAAGAGGAGGCTCGAGCTTTTTGAGGGAGTAATTGTCATAAGCTAAAGATAATTCATGGGACTTTTCACTTCCATCAGCAACCAAT
GGTTTTGCAACTGATACCAATATAAGAAAAAATGTAAGGATAAAGCGTAAATAAGAGTTACAATCGCAGGATATATTTTTGACTAGTTACAACAAGTCTTGGAGAGCATT
AAAAATAAGAATAAAGTTTTTTAGACACTTTATGACATTTCTTCATAGCCGAAAGAAGTATACATCGGCCTGTAGTAAACCACTCCTATCGAAGAATAGATAAAAGTATA
TCTAACAGAGATCACTTTACATTTTCTAAAAGGTGTCAGCTATAAGGTTCATTATCATGTAGATATACATACCAAGAGGAAGTTTTAGCCCGTCATTCACCACTGCATCA
GAAATGGAAGATGCATCTTTCATTATGACTAATTGTTTAGGAACATCTACTCTGCCTGTGAAGCAAAAAAAAGTCCATTTTCCTTCAGTACATTCCAGAGAGTAATATCC
CATATATATATGAGACTCAAGAAACATATTCTCAATAATTACCATAGGAGTCAGATAATTCCATGTCAGCAACTGCCTGAAGCATAGACTGGCGATTGTGTAAATGCTGT
ATCGCATCCGGAGGATCTAAAACAGTCACTTCTGGATGTGCTTGCCTATATTCCTACAACATATGCAAAGAGGGTAAGAGAAAATAATTAGAGTCTATGAGTTAGGAAAT
ATGTCAGAAATTTAGTACATCATGGTCTAATGGGCACCAAAAAGGCCGGGATGCAAGAACACAAGAAAGATGAATTTATGGCTACTGCAAAACAAAAGAACAACATAATT
CTTCCATTCCTTTGGTACACATCAAAAAAGTAGTTTCACTTGATTGAAACACATAACATACTGAATCTACCAAAGACATGATATGTAGTAATTAACAAAAGATGCTAGGA
ATGGATTCACGTAGACCAGAAATACAAACTACATATAAAAAATTAAATGCTAAATCACAAACCATCATCTGACCAGAATCAATTAAAATAACATTCATCATTCACAACCC
ACAATCGCTTAAAAGTCTCTAGTTACTCATCATGGGAATAAAGGAATTTGAAGTTAACACAATAACTCAGAAATAGAATAAAAATTAAAACTTTTGTTATTTTCAGACAT
TAGACATCCTTGCTATATACTTTGTGAATATTTAGGAATGGATGAGTTTGAGACCTTCAAACACATCATGCATAATTAAATAGAAGACGACAAGTTAAACTGGTTGCATA
TTTTCGGTCAAACTTGGATAAATCATAAAAAGTACAATTAGAGATATCAATAAAAAAATAACATTCATGATCCACAAATTTTAATAACTTTTTGACATCCTAGCTATACT
CATTGAGAAAATCTGGAGATGTATCAAAATGAGACTCTTCAAACAGACACAACTAAATTGTCAGATGATAAGTTCAACAGCTCGTGCATCTTCCAATAAAATGATGGTGA
AGTTAGAAGACCCTCAAACTTTCAGGAAGCTGAGAATTGTTCAGCACCTATAACCCGGCACAATCTTTTCCTTTTTCACAAATCATATGACCAGAACATGTGAGTTACTG
GAAGCTTACAGTTGAGATAAACGACAAGGAGGAGGAGGTGACTTACAGAACCTAATCCTATAATTGGAAGTTCTACAAAAATCTTAAATTAATCGGTTAGGCAACTCTGA
AGCTATCTGATGTTAATATTCAGATGGGTAACCTTATACCGTGGAGAGTGGAACCCATAAAAGTTAAAGCAAATCAGAAGTTGCCATTTTTTGTTTTGGCAAAAATTTCC
AAGAAAACCTGAAGAAATCAGCCATGTGGCAAGAAGTGGAGACATCCTCATTTGTTGGAGCCTTAAGGCCTCGACCTCAACCTCAACCTCAACTACTGAGTCATGATTCA
TACGTAAGAGAACCAGTCTGCACATTACGCCAGATGTGGATGAGCTTTATAATTCAAAACTAGTTAGCCGATTAGCCAATGTCAGTGATCATTCGACTCCAACAGCCATA
TGATATTAGATGTCGCATAGTACACAAGTAAGGCACACGATACTACATTCCAGGAGGTTCCAGTGTAATTTGTACACAATAATAAGAAGTCAAACAGAAACTAAATTTGA
ATAGTCAATAGAGGACCAACCTCGAGAATCTGACGCCACTCTTTTCCAGACAACTGTATGTAAGAAAAACAGAGAGAGAAAGGGGCTGTCAGAAAGATTTGAAGACCCAT
AAGCAATCGTACAAACTACTGGGACCCACAAAACGTAAAAAAATATATATATAAGTGGATGCAAGTAATCCACATGTTGGTGGAAGAAGGATTATGAACCCCACTTTGTG
ACAAAGACGTAGATAGTATTCCTACCAAAATTATCAGATTTCTTTCGAGGAAATTCTTCGATGGAATAGAGACTAACCTTGTGAAGGACGATGTCAAACGGCCCCTGATC
CGACAGAGGTCTGTTCTGATCAATTGCAACAAATAGAATCCCCTTATTCCTGCATTACCCAGGAACAAATTATAAGGTTAAGCACAATCACTTATTAATCTTAGAGTAAT
GGAGCTCATTTGGTCTTATTTTCATAACCCAATCTTAATTGTAGCCTCATATTCCACAGAAGACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTT
GATTATTTTAATTTTTCCCTCTTTCCCCCCTCAAACGAAAAGGACGTAAGCGGCCCAGGAATATTCATCAACAGTCGGAAAACAGGCAGCCCCTTCCCTCCGGGGAAACA
AAAACAAAAAACCCAAATAATAAAAACGATTAGTTAGGGTTAAAAAAGAAAAGGCGATGGTGGGGTTGGTACCTAGCTAATCTTTCGAGCTTGGGTTTCATGAAGCTCTT
GGTCTTCTTGGAGGTGAGAGCATAGCCAACAACAACGAGGTTCCTGTGGTGGGGAAACCTGGATTGGTCCATTTCAGCCCCCCCTTCGTTGGCGTAATTTGGCAAATCCT
CCTTCAACCTCATGGTTGTTCGTTCCCCTGCTTGAGAAATCGGACGAGAAGGAAGACGGAGAGAAGGGGGGAAAATGCAGAGCAAGAAATGGGAAGGAGAGAGGATCTAG
AATAGAAAGAGAAACAACAAAAAATGGATGAGAGATGAGAGAGGAGAGAGGAGAGAGGAGAAACGAACGGTTGGCGTTGAGAAGCAGAGCAGAGGAAGAAGAGAGGCGCA
GCGCAAAATCACGAGAAGCAGAGATCGAAGGAACAACCCAGGAGATGAGGTGGGAACTGGAGAAGTCTTATTTATACACACCTTTTTTTTATTTTCCTTTTCCTTTTTCT
TCTTTCAGTTTCTCGATATATAAAATAAATAGAAAAAAAGGGAATGGTTTCTAGCTGTACTGTTATTTC
Protein sequenceShow/hide protein sequence
MGSLQNWHRNKQHISISLLLLSVSIFVDVISMAEGRGIPTPLMEGRKVEAAGSWPELPEKVGMIRSQIGSRPPSCRRRCRECGGHCEAIQVPVALHDSQQNQRKKRSSHF
PAAAATSAGASNKDMALSSEDETSNYKPISWKCKCGNFIFNP