; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022659 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022659
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationtig00000289:2071436..2077173
RNA-Seq ExpressionSgr022659
SyntenySgr022659
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578788.1 hypothetical protein SDJN03_23236, partial [Cucurbita argyrosperma subsp. sororia]5.6e-11792.89Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPR SFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

XP_022939662.1 uncharacterized protein LOC111445487 isoform X2 [Cucurbita moschata]1.2e-11692.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPR SFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSESVTVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

XP_022992977.1 uncharacterized protein LOC111489142 isoform X1 [Cucurbita maxima]5.6e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPRASFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSES+TVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

XP_022992978.1 uncharacterized protein LOC111489142 isoform X2 [Cucurbita maxima]5.6e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPRASFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSES+TVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

XP_038883978.1 uncharacterized protein SYNPCC7002_A1590 isoform X1 [Benincasa hispida]9.5e-11791.21Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPY-VLASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI L CK+PR  FSL  RP+ +LASSADDSPRPSLR S NSNPKARF+ARRSESVTVRQL+RPLNEYMSLPASQYSVLDAERIERVDD+TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPY-VLASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDAYMVNQISYDVNRGNSPLQKLTS+TVIEVNIEIPFAFR IP+QAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

TrEMBL top hitse value%identityAlignment
A0A6J1BWG8 uncharacterized protein LOC1110063203.3e-11591.6Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYVLASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTFR
        MALSSCSPGSIPL  K+PR       RPYVLASSADDS RP LR SANSNPKARF+ARRSES TVRQL+RPLNEYMSLPASQYSVLDAERIERVDD TFR
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYVLASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTFR

Query:  CYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESAG
        CYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFR IPVQAIESAG
Subjt:  CYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESAG

Query:  TQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        TQVLEQILK+MLPRFT QLVKDYQAWASGDTSRQPLGT
Subjt:  TQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

A0A6J1FGJ8 uncharacterized protein LOC111445487 isoform X16.0e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPR SFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSESVTVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

A0A6J1FHF9 uncharacterized protein LOC111445487 isoform X26.0e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPR SFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSESVTVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

A0A6J1JRG4 uncharacterized protein LOC111489142 isoform X12.7e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPRASFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSES+TVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

A0A6J1K0U6 uncharacterized protein LOC111489142 isoform X22.7e-11792.47Show/hide
Query:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF
        MALSSCSP SI LH +SPRASFS+ RRP+V LASSADDSPRPSLR SANSNPKARFVARRSES+TVRQL+RPLNEYMSLPASQYSVLDAERIER+DD TF
Subjt:  MALSSCSPGSIPLHCKSPRASFSLRRRPYV-LASSADDSPRPSLRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTF

Query:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA
        RCYVYRFKFFAFEVCPVLIVRVE QPNGCCIKLLSCKLEGSPIV AQNDKFDA MVNQISYDVNRG+SPLQKLTSDTVIEVNIEIPFAFR IPVQAIESA
Subjt:  RCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESA

Query:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
Subjt:  GTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)8.0e-2131.38Show/hide
Query:  NSNPKARFVARRSESVTVR---QLSRPLNEYMSLPASQYSVLDAERIER---VDDS--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEG
        +S  KA   A R + + +    +     +E++  P+   +V++A+ ++    VDDS  T+RC + + +  +FEV PVL++RV      C ++LLSCKLEG
Subjt:  NSNPKARFVARRSESVTVR---QLSRPLNEYMSLPASQYSVLDAERIER---VDDS--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEG

Query:  SPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF-AFRPIPVQAIESAGTQVLEQILKLMLPRFTAQLVKDYQAW
        S ++  Q+++F A M N +++++     P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  SPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF-AFRPIPVQAIESAGTQVLEQILKLMLPRFTAQLVKDYQAW

AT4G31115.2 Protein of unknown function (DUF1997)8.0e-2131.38Show/hide
Query:  NSNPKARFVARRSESVTVR---QLSRPLNEYMSLPASQYSVLDAERIER---VDDS--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEG
        +S  KA   A R + + +    +     +E++  P+   +V++A+ ++    VDDS  T+RC + + +  +FEV PVL++RV      C ++LLSCKLEG
Subjt:  NSNPKARFVARRSESVTVR---QLSRPLNEYMSLPASQYSVLDAERIER---VDDS--TFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEG

Query:  SPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF-AFRPIPVQAIESAGTQVLEQILKLMLPRFTAQLVKDYQAW
        S ++  Q+++F A M N +++++     P   L  D  + V +EI    F  +PV A+E+ G  V++ ++  ++P    QL+KDY  W
Subjt:  SPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPF-AFRPIPVQAIESAGTQVLEQILKLMLPRFTAQLVKDYQAW

AT5G04440.1 Protein of unknown function (DUF1997)6.4e-8770.39Show/hide
Query:  KSPRASFSLRRRPYVLASSADDSPRPS----------LRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTFRCYVYR
        ++P  SF++       +SS D+SP+PS          +R S++S PKARF+AR+ +SV+VRQL RPL EYMSLPASQYSVLDAERIERVDD+TFRCYVY 
Subjt:  KSPRASFSLRRRPYVLASSADDSPRPS----------LRFSANSNPKARFVARRSESVTVRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTFRCYVYR

Query:  FKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESAGTQVLE
        FKFF FEVCPVL+VRVEEQPNGCCIKLLSCKLEGSP+V AQNDKFDA MVN++S D  +  S  Q++TSD VIEVNIEIPFAFR  PV AIE+ GTQVL+
Subjt:  FKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTSDTVIEVNIEIPFAFRPIPVQAIESAGTQVLE

Query:  QILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT
        QILKLMLPRF +QL KDY AWASGDTSRQPLGT
Subjt:  QILKLMLPRFTAQLVKDYQAWASGDTSRQPLGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACGATTGAAAAACGCTGTGCCAGTTGCTATTGTCTTATCCATCTCCCACAGTCCCCCTGCTTTCTTGCTACACCGAAGCCTCAAAAGTCGATATCCATCAGAATT
GCTCTCAGGAAGAACCTCGAACGCTTTAATGGCGTTGAGTTCTTGCTCTCCGGGCTCCATTCCACTCCACTGCAAAAGCCCTAGAGCTTCTTTTTCTCTCAGGCGCAGAC
CTTACGTGCTCGCTTCCTCTGCGGACGATTCTCCAAGGCCTTCGCTTCGCTTCTCCGCGAATTCGAATCCAAAAGCACGGTTCGTTGCCCGGAGAAGCGAGTCGGTTACT
GTTCGGCAGTTGTCGCGGCCCCTAAATGAGTATATGAGCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCGGAGAGGATCGAGCGGGTAGATGATAGTACCTTTAGGTG
CTATGTCTATAGATTTAAGTTCTTCGCTTTTGAGGTTTGCCCTGTTCTGATTGTTAGAGTTGAAGAGCAGCCCAATGGGTGTTGTATCAAGCTGCTGTCCTGCAAGCTCG
AGGGCTCGCCAATCGTGGCTGCACAGAATGATAAATTTGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAATTCACCCTTGCAGAAACTCACATCA
GATACTGTCATTGAGGTTAACATTGAGATTCCTTTCGCCTTCCGTCCAATTCCTGTACAAGCAATTGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGCTTAT
GCTTCCCCGCTTCACAGCCCAGCTCGTGAAGGACTATCAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTCGGAACAGCAGGTTTTGTTCTACCAGCCTCCGAAA
AGGATCAAAACTCTTGCGACACATCAGCTTCCACGAAATCTCCCACTTGGGTTTATCAAGACCGTGCAGTTGCTGCAGGCGGTCATGAGACTCCTCAGTTTCGCCGGAGG
GAGAAGCCTGTAACTCAATCCATAGAGGAGAAGACTTATAAGCATCCAAATGCTCATTCTTATCGTTTAGCAATGAACTATGCACAAGAAGCTGTTAGCTGCCAATTTAT
TCCCAAGAAACTGGATCTTCAACAAGGGATCAGGTTCTTTGACTTGGGAACTTCCATTGAAACAGTAAATTTGTTTGGCATCATTCTGAAACACAATAAGATCCAGATAC
CTTTAAGCTCTGTGTATTATCACAGGTTGAACCGCATGTGTCATAGAACTCTGAATCATGAGAAAGTTCCCCATGGACGGACTCTTCATGATTCTGCTTTTCTTCTTTTT
GCTGTCTTAGCTCCTCTTCAAGCACTTGAACTCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACGATTGAAAAACGCTGTGCCAGTTGCTATTGTCTTATCCATCTCCCACAGTCCCCCTGCTTTCTTGCTACACCGAAGCCTCAAAAGTCGATATCCATCAGAATT
GCTCTCAGGAAGAACCTCGAACGCTTTAATGGCGTTGAGTTCTTGCTCTCCGGGCTCCATTCCACTCCACTGCAAAAGCCCTAGAGCTTCTTTTTCTCTCAGGCGCAGAC
CTTACGTGCTCGCTTCCTCTGCGGACGATTCTCCAAGGCCTTCGCTTCGCTTCTCCGCGAATTCGAATCCAAAAGCACGGTTCGTTGCCCGGAGAAGCGAGTCGGTTACT
GTTCGGCAGTTGTCGCGGCCCCTAAATGAGTATATGAGCTTGCCGGCTAGTCAGTACTCGGTGTTGGATGCGGAGAGGATCGAGCGGGTAGATGATAGTACCTTTAGGTG
CTATGTCTATAGATTTAAGTTCTTCGCTTTTGAGGTTTGCCCTGTTCTGATTGTTAGAGTTGAAGAGCAGCCCAATGGGTGTTGTATCAAGCTGCTGTCCTGCAAGCTCG
AGGGCTCGCCAATCGTGGCTGCACAGAATGATAAATTTGACGCTTATATGGTGAACCAGATTTCTTATGATGTCAATCGAGGCAATTCACCCTTGCAGAAACTCACATCA
GATACTGTCATTGAGGTTAACATTGAGATTCCTTTCGCCTTCCGTCCAATTCCTGTACAAGCAATTGAATCAGCTGGGACCCAAGTCCTCGAACAAATATTGAAGCTTAT
GCTTCCCCGCTTCACAGCCCAGCTCGTGAAGGACTATCAAGCATGGGCCTCTGGTGATACATCAAGGCAACCTCTCGGAACAGCAGGTTTTGTTCTACCAGCCTCCGAAA
AGGATCAAAACTCTTGCGACACATCAGCTTCCACGAAATCTCCCACTTGGGTTTATCAAGACCGTGCAGTTGCTGCAGGCGGTCATGAGACTCCTCAGTTTCGCCGGAGG
GAGAAGCCTGTAACTCAATCCATAGAGGAGAAGACTTATAAGCATCCAAATGCTCATTCTTATCGTTTAGCAATGAACTATGCACAAGAAGCTGTTAGCTGCCAATTTAT
TCCCAAGAAACTGGATCTTCAACAAGGGATCAGGTTCTTTGACTTGGGAACTTCCATTGAAACAGTAAATTTGTTTGGCATCATTCTGAAACACAATAAGATCCAGATAC
CTTTAAGCTCTGTGTATTATCACAGGTTGAACCGCATGTGTCATAGAACTCTGAATCATGAGAAAGTTCCCCATGGACGGACTCTTCATGATTCTGCTTTTCTTCTTTTT
GCTGTCTTAGCTCCTCTTCAAGCACTTGAACTCTCTTGA
Protein sequenceShow/hide protein sequence
MQRLKNAVPVAIVLSISHSPPAFLLHRSLKSRYPSELLSGRTSNALMALSSCSPGSIPLHCKSPRASFSLRRRPYVLASSADDSPRPSLRFSANSNPKARFVARRSESVT
VRQLSRPLNEYMSLPASQYSVLDAERIERVDDSTFRCYVYRFKFFAFEVCPVLIVRVEEQPNGCCIKLLSCKLEGSPIVAAQNDKFDAYMVNQISYDVNRGNSPLQKLTS
DTVIEVNIEIPFAFRPIPVQAIESAGTQVLEQILKLMLPRFTAQLVKDYQAWASGDTSRQPLGTAGFVLPASEKDQNSCDTSASTKSPTWVYQDRAVAAGGHETPQFRRR
EKPVTQSIEEKTYKHPNAHSYRLAMNYAQEAVSCQFIPKKLDLQQGIRFFDLGTSIETVNLFGIILKHNKIQIPLSSVYYHRLNRMCHRTLNHEKVPHGRTLHDSAFLLF
AVLAPLQALELS