; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G9387 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G9387
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUsp domain-containing protein
Genome locationctg1672:64954..66690
RNA-Seq ExpressionCucsat.G9387
SyntenyCucsat.G9387
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150443.1 uncharacterized protein LOC101206721 [Cucumis sativus]3.92e-15399.12Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQ ME GGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT
        TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT
Subjt:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

XP_008466707.1 PREDICTED: uncharacterized protein LOC103504053 [Cucumis melo]1.77e-14796.49Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESFLRQISSRRGEGSSRSTSRRWGGEFR +EGEERVSEGSFW+Q MEGGGVNSM+GIDNGGMSRRKRVMVVVDHTSQS+HATMWALTHLANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSS-AADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
        TLLHVITNSSTDSSS AADS+SSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
Subjt:  TLLHVITNSSTDSSS-AADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

XP_022922915.1 uncharacterized protein LOC111430748 [Cucurbita moschata]1.52e-12485.09Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV
        MP  ESFLRQIS  RGEG SRSTS+RWGGEFRR+   EERVSEGS W++ MEGG VN M+G+D+GG+SRRKRVMVVVDHTSQSNHATMWALTH+ANKGDV
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV

Query:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
        LTLLH+ITN+STDSSS++ S+S FCA+SLGSLCKASRPEVEVEVLVIEGP+L+TVMNQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAECL
Subjt:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

XP_023550879.1 uncharacterized protein LOC111808882 [Cucurbita pepo subsp. pepo]1.06e-12585.28Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESFLRQIS R G   SRS SRRWGGEFRR+ GEE VSEG  W+Q MEGG VN+M+GIDNGGMSR+KRVMVVVD TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSST---DSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNA
        TLLHVIT+S++   DSSS++ S+SSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLV+GQRRPS  FSCFCGSGGAGDLVEQCINNA
Subjt:  TLLHVITNSST---DSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNA

Query:  ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

XP_038884412.1 uncharacterized protein LOC120075267 [Benincasa hispida]5.79e-13290Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESF+RQIS RRGEG SRSTSRRWGGEFRR+EGEERVSEG+ W+Q MEGG VN M+GIDNGGMSRRKRVMVVVD+TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSSAADSASS---FCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAE
        TLLHVITNSSTDSSSAADS+SS   FCA+SLGSLCKASRPEVEVEVLVIEGPKLATV+NQVKKLEVSVLVVGQR+PS  SCFCGSGGAGDLVEQCINN E
Subjt:  TLLHVITNSSTDSSSAADSASS---FCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAE

Query:  CLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        CLTIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  CLTIGVRKQSRDMGGYVINTRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0KDD8 Usp domain-containing protein1.90e-15399.12Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQ ME GGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT
        TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT
Subjt:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGGYVINTRWQKNFWLLA
        IGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  IGVRKQSRDMGGYVINTRWQKNFWLLA

A0A1S3CRX0 uncharacterized protein LOC1035040538.59e-14896.49Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        MPSTESFLRQISSRRGEGSSRSTSRRWGGEFR +EGEERVSEGSFW+Q MEGGGVNSM+GIDNGGMSRRKRVMVVVDHTSQS+HATMWALTHLANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSS-AADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
        TLLHVITNSSTDSSS AADS+SSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
Subjt:  TLLHVITNSSTDSSS-AADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1EA58 uncharacterized protein LOC1114307487.35e-12585.09Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV
        MP  ESFLRQIS  RGEG SRSTS+RWGGEFRR+   EERVSEGS W++ MEGG VN M+G+D+GG+SRRKRVMVVVDHTSQSNHATMWALTH+ANKGDV
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV

Query:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
        LTLLH+ITN+STDSSS++ S+S FCA+SLGSLCKASRPEVEVEVLVIEGP+L+TVMNQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAECL
Subjt:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1FKL3 uncharacterized protein LOC1114448695.12e-12383.77Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL
        M STESFLRQIS R G   SRS SRRWGGEFRR+ GEE VSEG  W Q MEGG VN+M+GIDNGGMSR+KRVMVVVD TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVL

Query:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNAECL
        TLLHVIT+S++  + ++ S+SSFCA+SLGS+CKASRPEVEVEVLV+EGPKLATVMNQVKKLEVSVLV+GQRRPS  FSCFCGSGGAGDLVEQCINNAECL
Subjt:  TLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQKNFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

A0A6J1JPP1 uncharacterized protein LOC1114877673.25e-12385.96Show/hide
Query:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV
        MP  ESFLRQIS  RGEG SRSTS+RWGGEFRR+   EERVSEGS W++ MEGG VN M+G+D+GG+SRRKRVMVVVDHTSQSNHATMWALTH+ANKGDV
Subjt:  MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEG-EERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDV

Query:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL
        LTLLHVITNSSTDSSS+  S+S FCA+SLGSLCKASRPEVEVEVLVIEGPKL TVMNQVKKLEVSVLV+GQRRPS  SCFCGSGGAGDLVEQCINNAECL
Subjt:  LTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGGYVINTRWQKNFWLLA
        TIGVRKQSRDMGGYVINTRWQ+NFWLLA
Subjt:  TIGVRKQSRDMGGYVINTRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.5e-5754.98Show/hide
Query:  STESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSM---HGIDNGG--MSRRKRVMVVVDHTSQSNHATMWALTHLANKG
        S  S LRQ+S + G    RS S+RW      T G+   +    ++ +  GGG +SM   +G+ +GG   +R KRVMVVVD +S+S HA MWALTHL NKG
Subjt:  STESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSM---HGIDNGG--MSRRKRVMVVVDHTSQSNHATMWALTHLANKG

Query:  DVLTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNA
        D++TLLHV+        S  D A+   A SLGSLCKA +PEV+VE LVI+GPKLATV++QVKKLEVSVLV+GQ++ + L SC CG   + +LV +CIN A
Subjt:  DVLTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPS-LFSCFCGSGGAGDLVEQCINNA

Query:  ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        +CLTIGVRKQ + +GGY+INTRWQKNFWLLA
Subjt:  ECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-2132.78Show/hide
Query:  KRVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVI---TNSSTDSSSAADSASSFC-----------ASSLGSLCKASRPEVEVEVLVIEG-PKLATV
        +R++VVVD  S++ +A +W L+H A   D + LLH +   T+ S D ++  +     C            S+L ++C+  RPEV+ EV+ ++G  K  T+
Subjt:  KRVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVI---TNSSTDSSSAADSASSFC-----------ASSLGSLCKASRPEVEVEVLVIEG-PKLATV

Query:  MNQVKKLEVSVLVVGQRRP-------SLFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        + + ++ E S+LV+GQ++         +++         D VE CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  MNQVKKLEVSVLVVGQRRP-------SLFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-1831.33Show/hide
Query:  KRVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEG-PKLATVMNQVKKLEVSVLVV
        +R++VVVD  S++ +A +W L+H A   D + LLH +   ++ S   A+       S        +  +V+ EV+ ++G  K  T++ + ++ E S+LV+
Subjt:  KRVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEG-PKLATVMNQVKKLEVSVLVV

Query:  GQRRP-------SLFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        GQ++         +++         D VE CINN+ C+ I VRK+ + +GGY + T+  K+FWLLA
Subjt:  GQRRP-------SLFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-2137.35Show/hide
Query:  MVVVDHTSQSNHATMWALTHLANKGDVLTLLHV----ITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIE--GPKLATVMNQVKKLEVSVL
        MVVVD TSQ+ +A  WALTH     D +TLLHV    +  +  ++    +S +      L + C+  +P V+ E++V+E    K  T++ + KK    VL
Subjt:  MVVVDHTSQSNHATMWALTHLANKGDVLTLLHV----ITNSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIE--GPKLATVMNQVKKLEVSVL

Query:  VVGQRRPS-----LFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        V+GQR+ +     ++      G  G +VE CI+N++C+ I VRK+S + GGY+I T+  K+FWLLA
Subjt:  VVGQRRPS-----LFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-1835.71Show/hide
Query:  RVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVIT--NSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEG---PKLATVMNQVKKLEVSV
        RVMVVVD    S  A  WA+TH     D L LL+       S   +   +  +     +L  LC+  RP +EVE+  +EG    K   ++ + KK +VS+
Subjt:  RVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVIT--NSSTDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEG---PKLATVMNQVKKLEVSV

Query:  LVVGQ-RRPSLFS-----CFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA
        LVVGQ ++P ++       +    G   +++ C+ NA C+TI V+ ++R +GGY+I T+  KNFWLLA
Subjt:  LVVGQ-RRPSLFS-----CFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTAGTAGAAGGGGGGAGGGATCTTCTAGATCAACTTCCAGGAGGTGGGGTGGGGAGTTTAGGAGAACTGAAGGTGA
AGAGCGTGTCAGTGAAGGGAGTTTCTGGCATCAGAACATGGAGGGCGGTGGTGTCAACAGTATGCATGGGATTGATAATGGTGGGATGTCAAGAAGGAAGAGGGTGATGG
TTGTGGTGGATCATACTTCACAATCTAACCATGCAACCATGTGGGCTCTCACTCATCTCGCTAACAAGGGGGATGTTCTTACTCTTCTTCATGTCATTACAAACTCCTCT
ACCGACTCTTCTTCTGCTGCAGACTCTGCTTCCTCTTTTTGTGCTAGCTCTCTTGGTTCTCTCTGTAAGGCTTCTAGACCTGAGGTCGAGGTGGAAGTGCTGGTAATTGA
GGGGCCGAAGCTAGCCACAGTGATGAACCAAGTCAAGAAACTAGAGGTGTCGGTGCTGGTTGTGGGACAGAGAAGGCCATCCTTATTCAGCTGCTTTTGTGGGAGTGGTG
GGGCGGGCGATTTGGTGGAACAGTGCATAAACAATGCAGAGTGTTTGACGATTGGTGTAAGGAAGCAGAGTAGAGACATGGGTGGGTATGTAATCAACACTAGATGGCAG
AAAAATTTCTGGCTCCTTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTAGTAGAAGGGGGGAGGGATCTTCTAGATCAACTTCCAGGAGGTGGGGTGGGGAGTTTAGGAGAACTGAAGGTGA
AGAGCGTGTCAGTGAAGGGAGTTTCTGGCATCAGAACATGGAGGGCGGTGGTGTCAACAGTATGCATGGGATTGATAATGGTGGGATGTCAAGAAGGAAGAGGGTGATGG
TTGTGGTGGATCATACTTCACAATCTAACCATGCAACCATGTGGGCTCTCACTCATCTCGCTAACAAGGGGGATGTTCTTACTCTTCTTCATGTCATTACAAACTCCTCT
ACCGACTCTTCTTCTGCTGCAGACTCTGCTTCCTCTTTTTGTGCTAGCTCTCTTGGTTCTCTCTGTAAGGCTTCTAGACCTGAGGTCGAGGTGGAAGTGCTGGTAATTGA
GGGGCCGAAGCTAGCCACAGTGATGAACCAAGTCAAGAAACTAGAGGTGTCGGTGCTGGTTGTGGGACAGAGAAGGCCATCCTTATTCAGCTGCTTTTGTGGGAGTGGTG
GGGCGGGCGATTTGGTGGAACAGTGCATAAACAATGCAGAGTGTTTGACGATTGGTGTAAGGAAGCAGAGTAGAGACATGGGTGGGTATGTAATCAACACTAGATGGCAG
AAAAATTTCTGGCTCCTTGCTTAA
Protein sequenceShow/hide protein sequence
MPSTESFLRQISSRRGEGSSRSTSRRWGGEFRRTEGEERVSEGSFWHQNMEGGGVNSMHGIDNGGMSRRKRVMVVVDHTSQSNHATMWALTHLANKGDVLTLLHVITNSS
TDSSSAADSASSFCASSLGSLCKASRPEVEVEVLVIEGPKLATVMNQVKKLEVSVLVVGQRRPSLFSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGYVINTRWQ
KNFWLLA