; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010105 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010105
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationscaffold779:683424..686170
RNA-Seq ExpressionMS010105
SyntenyMS010105
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456550.1 PREDICTED: uncharacterized protein SYNPCC7002_A1590 isoform X1 [Cucumis melo]1.2e-11388.02Show/hide
Query:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MA SSCSP SI L YKN   P  F H+P+ +LASSA+DS+RPSL  S NSNPKARF+ARRSES TVRQL RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVL+VRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRF AQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

XP_022133875.1 uncharacterized protein LOC111006320 [Momordica charantia]1.3e-12899.16Show/hide
Query:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
        MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRP LRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
Subjt:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV

Query:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
        YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
Subjt:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV

Query:  LEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        LEQILKIMLPRFT QLVKDYQAWASGDTSRQPLGTGKI
Subjt:  LEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

XP_022939662.1 uncharacterized protein LOC111445487 isoform X2 [Cucurbita moschata]3.4e-11389.26Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RPSLR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRFTAQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

XP_022992978.1 uncharacterized protein LOC111489142 isoform X2 [Cucurbita maxima]4.4e-11389.26Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RPSLR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRFTAQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

XP_038883978.1 uncharacterized protein SYNPCC7002_A1590 isoform X1 [Benincasa hispida]1.4e-11490.08Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  KNPR      HRP+ +LASSADDS RPSLR S NSNPKARFIARRSES TVRQLARPLNEYMSLPASQYSVLDAERIERVDD TF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTS+TVIEVNIEIPFAFRAIP+QAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRFTAQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

TrEMBL top hitse value%identityAlignment
A0A1S3C468 uncharacterized protein SYNPCC7002_A1590 isoform X15.6e-11488.02Show/hide
Query:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MA SSCSP SI L YKN   P  F H+P+ +LASSA+DS+RPSL  S NSNPKARF+ARRSES TVRQL RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVL+VRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRF AQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

A0A5D3BDM7 Uncharacterized protein5.6e-11488.02Show/hide
Query:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MA SSCSP SI L YKN   P  F H+P+ +LASSA+DS+RPSL  S NSNPKARF+ARRSES TVRQL RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVL+VRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRF AQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

A0A6J1BWG8 uncharacterized protein LOC1110063206.2e-12999.16Show/hide
Query:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
        MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRP LRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
Subjt:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV

Query:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
        YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
Subjt:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV

Query:  LEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        LEQILKIMLPRFT QLVKDYQAWASGDTSRQPLGTGKI
Subjt:  LEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

A0A6J1FHF9 uncharacterized protein LOC111445487 isoform X21.6e-11389.26Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RPSLR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRFTAQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

A0A6J1K0U6 uncharacterized protein LOC111489142 isoform X22.1e-11389.26Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RPSLR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        GTQVLEQILK+MLPRFTAQLVKDYQAWASGDTSRQPLGTG+I
Subjt:  GTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)1.6e-2033.12Show/hide
Query:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS
        +E++  P+   +V++A+ ++    VDD   T+RC + + +  +FEV PVL++RV      C ++LLSCKLEGS ++  Q+++F A M N +++++     
Subjt:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS

Query:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAW
        P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAW

AT4G31115.2 Protein of unknown function (DUF1997)1.6e-2033.12Show/hide
Query:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS
        +E++  P+   +V++A+ ++    VDD   T+RC + + +  +FEV PVL++RV      C ++LLSCKLEGS ++  Q+++F A M N +++++     
Subjt:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS

Query:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAW
        P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAW

AT5G04440.1 Protein of unknown function (DUF1997)7.5e-8767.06Show/hide
Query:  MALSSCSPGSIPLPY------KNPRKFAHRPYVLASSA-DDSSRPS----------LRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVL
        MALSS +     L +      +NPR  A    + +SS+ D+S +PS          +R S++S PKARFIAR+ +S +VRQL RPL EYMSLPASQYSVL
Subjt:  MALSSCSPGSIPLPY------KNPRKFAHRPYVLASSA-DDSSRPS----------LRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVL

Query:  DAERIERVDDCTFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF
        DAERIERVDD TFRCYVY FKFF FEVCPVL+VRVEEQPNGCCIKLLSCKLEGSP+V AQNDKFDA MVN++S D  +  S  Q++TSD VIEVNIEIPF
Subjt:  DAERIERVDDCTFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF

Query:  AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI
        AFR  PV AIE+ GTQVL+QILK+MLPRF +QL KDY AWASGDTSRQPLGTG+I
Subjt:  AFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDYQAWASGDTSRQPLGTGKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTGAGTTCCTGCTCGCCAGGTTCCATTCCACTCCCCTACAAAAACCCTAGAAAGTTTGCGCACAGACCATATGTTCTCGCTTCTTCTGCGGACGATTCTTCCAG
GCCGTCGCTTCGCACCTCCGCCAATTCGAATCCAAAAGCGCGCTTCATTGCCCGGAGAAGCGAATCCGCCACCGTTCGGCAGCTGGCGCGGCCTCTAAATGAGTATATGA
GCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCAGAGAGGATCGAGCGGGTTGACGATTGCACCTTTAGGTGCTACGTTTATAGATTTAAGTTCTTTGCGTTTGAGGTT
TGCCCTGTTTTGATTGTTCGAGTTGAAGAGCAGCCCAATGGGTGTTGCATCAAGCTGCTGTCATGCAAGCTTGAGGGCTCACCGATTGTGGCTGCACAGAATGACAAATT
TGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAACTCACCCTTGCAGAAACTCACGTCAGATACTGTCATTGAGGTTAACATCGAGATTCCTTTCG
CCTTCCGTGCAATTCCTGTACAAGCAATTGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGATCATGCTTCCTCGCTTCACAGCCCAGCTCGTGAAGGATTAT
CAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTTGGAACAGGTAAGATC
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTGAGTTCCTGCTCGCCAGGTTCCATTCCACTCCCCTACAAAAACCCTAGAAAGTTTGCGCACAGACCATATGTTCTCGCTTCTTCTGCGGACGATTCTTCCAG
GCCGTCGCTTCGCACCTCCGCCAATTCGAATCCAAAAGCGCGCTTCATTGCCCGGAGAAGCGAATCCGCCACCGTTCGGCAGCTGGCGCGGCCTCTAAATGAGTATATGA
GCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCAGAGAGGATCGAGCGGGTTGACGATTGCACCTTTAGGTGCTACGTTTATAGATTTAAGTTCTTTGCGTTTGAGGTT
TGCCCTGTTTTGATTGTTCGAGTTGAAGAGCAGCCCAATGGGTGTTGCATCAAGCTGCTGTCATGCAAGCTTGAGGGCTCACCGATTGTGGCTGCACAGAATGACAAATT
TGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAACTCACCCTTGCAGAAACTCACGTCAGATACTGTCATTGAGGTTAACATCGAGATTCCTTTCG
CCTTCCGTGCAATTCCTGTACAAGCAATTGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGATCATGCTTCCTCGCTTCACAGCCCAGCTCGTGAAGGATTAT
CAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTTGGAACAGGTAAGATC
Protein sequenceShow/hide protein sequence
MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPSLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYVYRFKFFAFEV
CPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQVLEQILKIMLPRFTAQLVKDY
QAWASGDTSRQPLGTGKI