; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021045 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021045
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUsp domain-containing protein
Genome locationchr7:4213723..4215290
RNA-Seq ExpressionLag0021045
SyntenyLag0021045
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597331.1 hypothetical protein SDJN03_10511, partial [Cucurbita argyrosperma subsp. sororia]9.3e-8682.11Show/hide
Query:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA
        MGGGR  TNTT  S T PKKVMVVVDPTRESAAALQY+LSHAVID DEV+LLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS   G GGGG 
Subjt:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA

Query:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ
         +VDFL+EMKK   VARP+LKVRT RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQ
Subjt:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ

Query:  NAGYLLNTKTHRNFWLLA
        NAGYLLNTKTHRNFWLLA
Subjt:  NAGYLLNTKTHRNFWLLA

XP_004134033.1 uncharacterized protein LOC101222608 [Cucumis sativus]2.2e-8781Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG
        MGGGR     TST   +KVMVVVDPTRESAAALQYALSHA++DND+V+LLH+DNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASD GG GG
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG

Query:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        G  AEVDFLEEMKKA K A PKL+V T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQK
Subjt:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

XP_008438448.1 PREDICTED: uncharacterized protein LOC103483538 [Cucumis melo]1.7e-8781Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG
        MGGGR      ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+V+LLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASD GG GG
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG

Query:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        G  A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQK
Subjt:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

XP_022938646.1 uncharacterized protein LOC111444812 [Cucurbita moschata]4.6e-8581.65Show/hide
Query:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA
        MGGGR  TNTT  S T PKKVMVVVDPTRESAAALQY+LSHAVID DEV+LLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS   G GGGG 
Subjt:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA

Query:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ
         +VDFL+EMKK   VARP+LKVR  RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQ
Subjt:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ

Query:  NAGYLLNTKTHRNFWLLA
        NAGYLLNTKTHRNFWLLA
Subjt:  NAGYLLNTKTHRNFWLLA

XP_038881793.1 homeobox protein 5 [Benincasa hispida]2.8e-9082.17Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA--------------NAHAPATATATA
        MGGGR     TST   +KVMVVVDPTRESAAALQYALSHAVIDND+V+LLHVDNPNSWKNAITTFLKRPNGGSA              NA+A A A ATA
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA--------------NAHAPATATATA

Query:  ASDGGGGGGGGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENS
        ASD GG GGG  AEVDFLEEMKKA K A PKLKV T+RVELEGKDKA+MIMAQTK+LGIDLLVIGQRRSLSTAILGYRRSGGPMK AKMLDTAEYLIENS
Subjt:  ASDGGGGGGGGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENS

Query:  KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
        KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Subjt:  KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0L6Y4 Uncharacterized protein1.1e-8781Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG
        MGGGR     TST   +KVMVVVDPTRESAAALQYALSHA++DND+V+LLH+DNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASD GG GG
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG

Query:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        G  AEVDFLEEMKKA K A PKL+V T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQK
Subjt:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

A0A1S3AWE0 uncharacterized protein LOC1034835388.2e-8881Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG
        MGGGR      ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+V+LLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASD GG GG
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG

Query:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        G  A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQK
Subjt:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

A0A5A7U1M3 Uncharacterized protein8.2e-8881Show/hide
Query:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG
        MGGGR      ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+V+LLHVDNPNSW+NAI+TFLKRPNGG +     N +  A ATATAASD GG GG
Subjt:  MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSA-----NAHAPATATATAASDGGGGGG

Query:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        G  A+VDFLEEMKKA KVA PK+KV T+RVELEGKDKA+MIMAQTKSLG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQK
Subjt:  GGAAEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

A0A6J1FDQ7 uncharacterized protein LOC1114448122.2e-8581.65Show/hide
Query:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA
        MGGGR  TNTT  S T PKKVMVVVDPTRESAAALQY+LSHAVID DEV+LLHVDNPN+WKNAITTFLKRPNGG AN+HA AT T  AAS   G GGGG 
Subjt:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA

Query:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ
         +VDFL+EMKK   VARP+LKVR  RVELEGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQKKGQ
Subjt:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ

Query:  NAGYLLNTKTHRNFWLLA
        NAGYLLNTKTHRNFWLLA
Subjt:  NAGYLLNTKTHRNFWLLA

A0A6J1IE79 uncharacterized protein LOC111473211 isoform X12.5e-8480.73Show/hide
Query:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA
        MGG R  TNTT  S T PKKVMVVVDPTRESAAALQY+LSHAVID DEV+LLHVDNPNSWKNAITTFLKRPNGG AN+HA AT    AASD    GGGG 
Subjt:  MGGGRNITNTT--STTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGA

Query:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ
         +VDFL+EMKK   VARP+LKVRT RVE+EGKDKAAMIMAQTK+L IDLLVIGQRRSLSTAILGY+R+G      KMLDTAEYLIENS CTCVAVQKKGQ
Subjt:  AEVDFLEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQ

Query:  NAGYLLNTKTHRNFWLLA
        NAGYLLNTKTHRNFWLLA
Subjt:  NAGYLLNTKTHRNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.5e-1327.59Show/hide
Query:  KKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARP
        ++++VVVD   E+  AL + LSH     D ++LLH     + ++       +  G   +   P T+ A                   +  +K   ++ RP
Subjt:  KKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARP

Query:  KLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLST--AILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFW
          +V+T  V ++G +K   I+ + +     LLV+GQ++  +T   ++ +     P+      D  EY I NS C  +AV+K+G+   GY L TK H++FW
Subjt:  KLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLST--AILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFW

Query:  LLA
        LLA
Subjt:  LLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein8.0e-1932.66Show/hide
Query:  MVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARPKLK
        MVVVD T ++  ALQ+AL+H V D D + LLHV      +    T  +R    ++ AH                        + +  +K   ++ +P +K
Subjt:  MVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARPKLK

Query:  VRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
           + VE   ++K   I+ ++K  G  +LV+GQR+  S    I  +R  GG   G       EY I NS C  +AV+KK  N GYL+ TK H++FWLLA
Subjt:  VRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA

AT4G13450.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.8e-5451.17Show/hide
Query:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNP-NSWKNAITTFLKRPN--GGSANAHAPATATATAASDGGGGGGG---GAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE++L+H++N   SWKNA ++FL+ P+    S++  +PA+   T AS+          G  + +F
Subjt:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNP-NSWKNAITTFLKRPN--GGSANAHAPATATATAASDGGGGGGG---GAAEVDF

Query:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL
        LE+MK+  ++A+PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LG RR GG ++G+K +DTAEYLIENSKCTCV V KKGQN GY+
Subjt:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL

Query:  LNTKTHRNFWLLA
        LNTKTH+NFWLLA
Subjt:  LNTKTHRNFWLLA

AT4G13450.2 Adenine nucleotide alpha hydrolases-like superfamily protein7.2e-2843.12Show/hide
Query:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNP-NSWKNAITTFLKRPN--GGSANAHAPATATATAASDGGGGGGG---GAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE++L+H++N   SWKNA ++FL+ P+    S++  +PA+   T AS+          G  + +F
Subjt:  TTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNP-NSWKNAITTFLKRPN--GGSANAHAPATATATAASDGGGGGGG---GAAEVDF

Query:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGY
        LE+MK+  ++A+PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LGY
Subjt:  LEEMKKASKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGY

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.7e-1329Show/hide
Query:  KVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARPK
        +VMVVVD    S  AL++A++H +   D + LL+   P  ++ +     KR N                 +D            + +  +KK  +  RP 
Subjt:  KVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKASKVARPK

Query:  LKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFWLLA
        ++V   R+E + KDK   I+ ++K   + LLV+GQ +      L  R +    +G +     +Y +EN+ C  +AV+ K +   GYL+ TK H+NFWLLA
Subjt:  LKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAGGGCGGAATATTACCAACACAACGTCCACCACGGCACCAAAGAAGGTCATGGTCGTCGTAGACCCCACCCGAGAGTCCGCCGCCGCGCTGCAGTACGCTCT
TTCGCATGCCGTCATTGATAACGACGAGGTCGTTCTCCTTCATGTCGATAACCCTAATTCTTGGAAGAATGCCATTACTACGTTCCTTAAGAGGCCGAACGGCGGCTCCG
CCAATGCTCATGCACCTGCCACTGCCACTGCCACCGCCGCGTCTGATGGCGGCGGAGGAGGAGGAGGAGGAGCGGCGGAGGTGGATTTTCTTGAGGAGATGAAGAAGGCC
AGCAAGGTTGCTCGTCCGAAACTGAAAGTGCGGACGATGAGGGTTGAATTGGAAGGCAAAGACAAAGCCGCCATGATTATGGCTCAAACCAAGTCTCTCGGTATTGATCT
GCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTG
AGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAAGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTCTTGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAGGGCGGAATATTACCAACACAACGTCCACCACGGCACCAAAGAAGGTCATGGTCGTCGTAGACCCCACCCGAGAGTCCGCCGCCGCGCTGCAGTACGCTCT
TTCGCATGCCGTCATTGATAACGACGAGGTCGTTCTCCTTCATGTCGATAACCCTAATTCTTGGAAGAATGCCATTACTACGTTCCTTAAGAGGCCGAACGGCGGCTCCG
CCAATGCTCATGCACCTGCCACTGCCACTGCCACCGCCGCGTCTGATGGCGGCGGAGGAGGAGGAGGAGGAGCGGCGGAGGTGGATTTTCTTGAGGAGATGAAGAAGGCC
AGCAAGGTTGCTCGTCCGAAACTGAAAGTGCGGACGATGAGGGTTGAATTGGAAGGCAAAGACAAAGCCGCCATGATTATGGCTCAAACCAAGTCTCTCGGTATTGATCT
GCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTG
AGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAAGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTCTTGGCATGA
Protein sequenceShow/hide protein sequence
MGGGRNITNTTSTTAPKKVMVVVDPTRESAAALQYALSHAVIDNDEVVLLHVDNPNSWKNAITTFLKRPNGGSANAHAPATATATAASDGGGGGGGGAAEVDFLEEMKKA
SKVARPKLKVRTMRVELEGKDKAAMIMAQTKSLGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA