; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g33530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g33530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1997)
Genome locationchr4:25254052..25258317
RNA-Seq ExpressionMoc04g33530
SyntenyMoc04g33530
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578788.1 hypothetical protein SDJN03_23236, partial [Cucurbita argyrosperma subsp. sororia]1.6e-11186.94Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RP LR SANSNPKARF+ARRSES TVRQL+RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT + S+N
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN

XP_022133875.1 uncharacterized protein LOC111006320 [Momordica charantia]3.2e-128100Show/hide
Query:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
        MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
Subjt:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV

Query:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
        YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
Subjt:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV

Query:  LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
        LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
Subjt:  LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT

XP_022939661.1 uncharacterized protein LOC111445487 isoform X1 [Cucurbita moschata]5.5e-11287.35Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RP LR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT + S+N
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN

XP_022992977.1 uncharacterized protein LOC111489142 isoform X1 [Cucurbita maxima]9.4e-11287.35Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RP LR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT + S+N
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN

XP_038883978.1 uncharacterized protein SYNPCC7002_A1590 isoform X1 [Benincasa hispida]3.2e-11289.54Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  KNPR      HRP+ +LASSADDS RP LR S NSNPKARFIARRSES TVRQLARPLNEYMSLPASQYSVLDAERIERVDD TF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTS+TVIEVNIEIPFAFRAIP+QAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT

TrEMBL top hitse value%identityAlignment
A0A1S3C468 uncharacterized protein SYNPCC7002_A1590 isoform X11.3e-11187.45Show/hide
Query:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MA SSCSP SI L YKN   P  F H+P+ +LASSA+DS+RP L  S NSNPKARF+ARRSES TVRQL RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVL+VRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILK+MLPRF  QLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT

A0A5D3BDM7 Uncharacterized protein1.3e-11187.45Show/hide
Query:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MA SSCSP SI L YKN   P  F H+P+ +LASSA+DS+RP L  S NSNPKARF+ARRSES TVRQL RPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKN---PRKFAHRPY-VLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVL+VRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILK+MLPRF  QLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT

A0A6J1BWG8 uncharacterized protein LOC1110063201.6e-128100Show/hide
Query:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
        MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV
Subjt:  MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYV

Query:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
        YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV
Subjt:  YRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQV

Query:  LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
        LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT
Subjt:  LEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGT

A0A6J1FGJ8 uncharacterized protein LOC111445487 isoform X12.7e-11287.35Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RP LR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT + S+N
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN

A0A6J1JRG4 uncharacterized protein LOC111489142 isoform X14.6e-11287.35Show/hide
Query:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF
        MALSSCSP SI L  ++PR       RP+V LASSADDS RP LR SANSNPKARF+ARRSES TVRQLARPLNEYMSLPASQYSVLDAERIER+DDCTF
Subjt:  MALSSCSPGSIPLPYKNPR---KFAHRPYV-LASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESA

Query:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN
        GTQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT + S+N
Subjt:  GTQVLEQILKIMLPRFTTQLVKDYQAWASGDTSRQPLGTVHHSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)1.8e-2033.12Show/hide
Query:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS
        +E++  P+   +V++A+ ++    VDD   T+RC + + +  +FEV PVL++RV      C ++LLSCKLEGS ++  Q+++F A M N +++++     
Subjt:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS

Query:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAW
        P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAW

AT4G31115.2 Protein of unknown function (DUF1997)1.8e-2033.12Show/hide
Query:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS
        +E++  P+   +V++A+ ++    VDD   T+RC + + +  +FEV PVL++RV      C ++LLSCKLEGS ++  Q+++F A M N +++++     
Subjt:  NEYMSLPASQYSVLDAERIER---VDDC--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNS

Query:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAW
        P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  PLQKLTSDTVIEVNIEIPF-AFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAW

AT5G04440.1 Protein of unknown function (DUF1997)8.0e-8575.85Show/hide
Query:  ASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIK
        +S A  SS   +R S++S PKARFIAR+ +S +VRQL RPL EYMSLPASQYSVLDAERIERVDD TFRCYVY FKFF FEVCPVL+VRVEEQPNGCCIK
Subjt:  ASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIK

Query:  LLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAWASGDT
        LLSCKLEGSP+V AQNDKFDA MVN++S D  +  S  Q++TSD VIEVNIEIPFAFR  PV AIE+ GTQVL+QILK+MLPRF +QL KDY AWASGDT
Subjt:  LLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDYQAWASGDT

Query:  SRQPLGT
        SRQPLGT
Subjt:  SRQPLGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTGAGTTCCTGCTCGCCAGGTTCCATTCCACTCCCCTACAAAAACCCTAGAAAGTTTGCGCACAGACCATATGTTCTCGCTTCTTCTGCGGACGATTCTTCCAG
GCCGTTGCTTCGCACCTCCGCCAATTCGAATCCAAAAGCGCGCTTCATTGCCCGGAGAAGCGAATCCGCCACCGTTCGGCAGCTGGCGCGGCCTCTAAATGAGTATATGA
GCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCAGAGAGGATCGAGCGGGTTGACGATTGCACCTTTAGGTGCTACGTTTATAGATTTAAGTTCTTTGCGTTTGAGGTT
TGCCCTGTTTTGATTGTTCGAGTTGAAGAGCAGCCCAATGGGTGTTGCATCAAGCTGCTGTCATGCAAGCTTGAGGGCTCACCGATTGTGGCTGCACAGAATGACAAATT
TGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAACTCACCCTTGCAGAAACTCACGTCAGATACTGTCATTGAGGTTAACATCGAGATTCCTTTCG
CCTTCCGTGCAATTCCCGTACAAGCAATCGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGATCATGCTTCCTCGCTTCACAACCCAGCTCGTGAAGGATTAT
CAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTTGGAACAGTTCATCATTCATCAAACTGTCGTGTAATTGCGGGGAAGAAGAAAGCCAATGTAGATTTTGTATT
TGCTGCAGCCGATGGGATGAGGGGAGCCGGAGGAAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTGAGTTCCTGCTCGCCAGGTTCCATTCCACTCCCCTACAAAAACCCTAGAAAGTTTGCGCACAGACCATATGTTCTCGCTTCTTCTGCGGACGATTCTTCCAG
GCCGTTGCTTCGCACCTCCGCCAATTCGAATCCAAAAGCGCGCTTCATTGCCCGGAGAAGCGAATCCGCCACCGTTCGGCAGCTGGCGCGGCCTCTAAATGAGTATATGA
GCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCAGAGAGGATCGAGCGGGTTGACGATTGCACCTTTAGGTGCTACGTTTATAGATTTAAGTTCTTTGCGTTTGAGGTT
TGCCCTGTTTTGATTGTTCGAGTTGAAGAGCAGCCCAATGGGTGTTGCATCAAGCTGCTGTCATGCAAGCTTGAGGGCTCACCGATTGTGGCTGCACAGAATGACAAATT
TGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAACTCACCCTTGCAGAAACTCACGTCAGATACTGTCATTGAGGTTAACATCGAGATTCCTTTCG
CCTTCCGTGCAATTCCCGTACAAGCAATCGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGATCATGCTTCCTCGCTTCACAACCCAGCTCGTGAAGGATTAT
CAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTTGGAACAGTTCATCATTCATCAAACTGTCGTGTAATTGCGGGGAAGAAGAAAGCCAATGTAGATTTTGTATT
TGCTGCAGCCGATGGGATGAGGGGAGCCGGAGGAAGAAGGTGA
Protein sequenceShow/hide protein sequence
MALSSCSPGSIPLPYKNPRKFAHRPYVLASSADDSSRPLLRTSANSNPKARFIARRSESATVRQLARPLNEYMSLPASQYSVLDAERIERVDDCTFRCYVYRFKFFAFEV
CPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRAIPVQAIESAGTQVLEQILKIMLPRFTTQLVKDY
QAWASGDTSRQPLGTVHHSSNCRVIAGKKKANVDFVFAAADGMRGAGGRR