; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031547 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031547
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUsp domain-containing protein
Genome locationscaffold11:40985396..40988518
RNA-Seq ExpressionSpg031547
SyntenySpg031547
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597331.1 hypothetical protein SDJN03_10511, partial [Cucurbita argyrosperma subsp. sororia]9.2e-8682.49Show/hide
Query:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA
        MGGGR   N TT S T PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS  G GGGG  
Subjt:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA

Query:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN
        +VDFL+EMKK   VARP+LKVRT RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQN
Subjt:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN

Query:  AGYLLNTKTHRNFWLLA
        AGYLLNTKTHRNFWLLA
Subjt:  AGYLLNTKTHRNFWLLA

XP_004134033.1 uncharacterized protein LOC101222608 [Cucumis sativus]4.0e-8982.19Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG
        MGGGR    TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASDGG GGG 
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG

Query:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         AEVDFLEEMKKA K A PKL+V T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

XP_008438448.1 PREDICTED: uncharacterized protein LOC103483538 [Cucumis melo]3.1e-8982.19Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG
        MGGGR     ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASDGG GGG 
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG

Query:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

XP_022938646.1 uncharacterized protein LOC111444812 [Cucurbita moschata]4.6e-8582.03Show/hide
Query:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA
        MGGGR   N TT S T PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS  G GGGG  
Subjt:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA

Query:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN
        +VDFL+EMKK   VARP+LKVR  RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQN
Subjt:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN

Query:  AGYLLNTKTHRNFWLLA
        AGYLLNTKTHRNFWLLA
Subjt:  AGYLLNTKTHRNFWLLA

XP_038881793.1 homeobox protein 5 [Benincasa hispida]5.0e-9283.33Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA--------------NAHAPATATATAA
        MGGGR    TST   +KVMVVVDPTRESAAALQYALSHAVIDND+VILLHVDNPNSWKNAITTFLKRPNGGSA              NA+A A A ATAA
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA--------------NAHAPATATATAA

Query:  SDGGGGGGGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKC
        SDGG GGG  AEVDFLEEMKKA K A PKLKV T+RVELEGKDKA+MIMAQTK+LGIDLLVIGQRRSLSTAILGYRRSGGPMK AKMLDTAEYLIENSKC
Subjt:  SDGGGGGGGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKC

Query:  TCVAVQKKGQNAGYLLNTKTHRNFWLLA
        TCVAVQKKGQNAGYLLNTKTHRNFWLLA
Subjt:  TCVAVQKKGQNAGYLLNTKTHRNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0L6Y4 Uncharacterized protein1.9e-8982.19Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG
        MGGGR    TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASDGG GGG 
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG

Query:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         AEVDFLEEMKKA K A PKL+V T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

A0A1S3AWE0 uncharacterized protein LOC1034835381.5e-8982.19Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG
        MGGGR     ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASDGG GGG 
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG

Query:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

A0A5A7U1M3 Uncharacterized protein1.5e-8982.19Show/hide
Query:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG
        MGGGR     ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASDGG GGG 
Subjt:  MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGGG

Query:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

A0A6J1FDQ7 uncharacterized protein LOC1114448122.2e-8582.03Show/hide
Query:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA
        MGGGR   N TT S T PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS  G GGGG  
Subjt:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA

Query:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN
        +VDFL+EMKK   VARP+LKVR  RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQN
Subjt:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN

Query:  AGYLLNTKTHRNFWLLA
        AGYLLNTKTHRNFWLLA
Subjt:  AGYLLNTKTHRNFWLLA

A0A6J1IE79 uncharacterized protein LOC111473211 isoform X12.4e-8481.11Show/hide
Query:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA
        MGG R   N TT S T PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPNSWKNAITTFLKRPNGG AN+HA AT    AASD   GGGG  
Subjt:  MGGGR---NITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAA

Query:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN
        +VDFL+EMKK   VARP+LKVRT RVE+EGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G      KMLDTAEYLIENS CTCVAVQKKGQN
Subjt:  EVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN

Query:  AGYLLNTKTHRNFWLLA
        AGYLLNTKTHRNFWLLA
Subjt:  AGYLLNTKTHRNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.0e-1327.62Show/hide
Query:  TTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKK
        TT+     ++++VVVD   E+  AL + LSH     D ++LLH     + ++       +  G   +   P T+ A                  +  +K 
Subjt:  TTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKK

Query:  ASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLST--AILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNT
          ++ RP  +V+T  V ++G +K   I+ + +     LLV+GQ++  +T   ++ +     P+      D  EY I NS C  +AV+K+G+   GY L T
Subjt:  ASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLST--AILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNT

Query:  KTHRNFWLLA
        K H++FWLLA
Subjt:  KTHRNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.9e-1932.83Show/hide
Query:  MVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKKASKVARPKLKV
        MVVVD T ++  ALQ+AL+H V D D + LLHV      +    T  +R    ++ AH                       + +  +K   ++ +P +K 
Subjt:  MVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKKASKVARPKLKV

Query:  RTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
          + VE   ++K   I+ ++K  G  +LV+GQR+  S    I  +R  GG   G       EY I NS C  +AV+KK  N GYL+ TK H++FWLLA
Subjt:  RTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA

AT4G13450.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.6e-5451.64Show/hide
Query:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWKNAITTFLKRP------NGGSANAHAPATATATAASDGGGGGGGAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SWKNA ++FL+ P      + GS+ A    T  + AA++      G  + +F
Subjt:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWKNAITTFLKRP------NGGSANAHAPATATATAASDGGGGGGGAAEVDF

Query:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL
        LE+MK+  ++A+PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LG RR GG ++G+K +DTAEYLIENSKCTCV V KKGQN GY+
Subjt:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL

Query:  LNTKTHRNFWLLA
        LNTKTH+NFWLLA
Subjt:  LNTKTHRNFWLLA

AT4G13450.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.2e-2843.75Show/hide
Query:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWKNAITTFLKRP------NGGSANAHAPATATATAASDGGGGGGGAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SWKNA ++FL+ P      + GS+ A    T  + AA++      G  + +F
Subjt:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWKNAITTFLKRP------NGGSANAHAPATATATAASDGGGGGGGAAEVDF

Query:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGY
        LE+MK+  ++A+PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LGY
Subjt:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGY

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.0e-1329.15Show/hide
Query:  KVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKKASKVARPKL
        +VMVVVD    S  AL++A++H +   D + LL+   P  ++ +     KR N                 +D           + +  +KK  +  RP +
Subjt:  KVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKKASKVARPKL

Query:  KVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFWLLA
        +V   R+E + KDK   I+ ++K   + LLV+GQ +      L  R +    +G +     +Y +EN+ C  +AV+ K +   GYL+ TK H+NFWLLA
Subjt:  KVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAGGGCGGAATATTACCACAACGTCCACCACGGCACCAAAGAAGGTCATGGTCGTCGTAGACCCCACCCGAGAGTCCGCCGCCGCGCTGCAGTACGCGCTTTC
GCATGCCGTCATTGATAACGACGAGGTCATTCTCCTTCATGTCGATAACCCTAATTCTTGGAAGAATGCCATTACTACGTTCCTTAAGAGGCCGAACGGCGGCTCCGCCA
ATGCTCATGCACCTGCCACTGCCACTGCCACCGCCGCGTCTGATGGCGGCGGAGGAGGAGGAGGAGCGGCGGAGGTGGATTTTCTTGAGGAGATGAAGAAGGCCAGCAAG
GTTGCTCGTCCGAAACTGAAAGTGCGGACGATGAGGGTTGAATTGGAAGGCAAAGACAAAGCCGCCATGATTATGGCTCAAACCAAGTCTCTCGGTATTGATCTGCTAGT
CATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACA
GCAAATGCACTTGTGTTGCTGTACAAAAGAAAGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTCTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAGGGCGGAATATTACCACAACGTCCACCACGGCACCAAAGAAGGTCATGGTCGTCGTAGACCCCACCCGAGAGTCCGCCGCCGCGCTGCAGTACGCGCTTTC
GCATGCCGTCATTGATAACGACGAGGTCATTCTCCTTCATGTCGATAACCCTAATTCTTGGAAGAATGCCATTACTACGTTCCTTAAGAGGCCGAACGGCGGCTCCGCCA
ATGCTCATGCACCTGCCACTGCCACTGCCACCGCCGCGTCTGATGGCGGCGGAGGAGGAGGAGGAGCGGCGGAGGTGGATTTTCTTGAGGAGATGAAGAAGGCCAGCAAG
GTTGCTCGTCCGAAACTGAAAGTGCGGACGATGAGGGTTGAATTGGAAGGCAAAGACAAAGCCGCCATGATTATGGCTCAAACCAAGTCTCTCGGTATTGATCTGCTAGT
CATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACA
GCAAATGCACTTGTGTTGCTGTACAAAAGAAAGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTCTTGGCTTGA
Protein sequenceShow/hide protein sequence
MGGGRNITTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGAAEVDFLEEMKKASK
VARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA