; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014423 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014423
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationchr12:608393..609207
RNA-Seq ExpressionLag0014423
SyntenyLag0014423
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012065.1 hypothetical protein SDJN02_26973, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-9981.13Show/hide
Query:  MAADVTSLVRVLAG-----------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQ
        MAA V++L+RVLAG           +DNRT LGN QE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQ
Subjt:  MAADVTSLVRVLAG-----------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQ

Query:  MMINQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----
         M NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLT +SPSPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK    
Subjt:  MMINQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----

Query:  EEEAAEIRN-------AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        EEEA E RN        AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  EEEAAEIRN-------AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]1.3e-10487.25Show/hide
Query:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES
        MAA+V+SL+RVLAG   DDNRTALGN Q+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA-----KEEEAAEI
        KFQDLNFPPSPSKRTLNLFNETSLDLKLT      S  SS+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA     +EEEAAEI
Subjt:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA-----KEEEAAEI

Query:  RNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        RN+AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  RNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]6.9e-10688Show/hide
Query:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES
        MAA+V+SL+RVLAG   DDNRTALGN Q+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA----KEEEAAEIR
        KFQDLNFPPSPSKRTLNLFNETSLDLKLT+     SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA    +EEEAAEIR
Subjt:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA----KEEEAAEIR

Query:  NAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        N+AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  NAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]7.4e-10083.01Show/hide
Query:  MAADVTSLVRVLAG--------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMI
        MAA V++L+RVLAG        +DNRT LGN QE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQ M 
Subjt:  MAADVTSLVRVLAG--------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMI

Query:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEE
        NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLT +SPSPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK    EEE
Subjt:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEE

Query:  AAEIRN----AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        A E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  AAEIRN----AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]2.3e-10185.77Show/hide
Query:  MAADVTSLVRVLAG-----DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQT
        MAA V+SL+RVLAG     +DNRT LGN QE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQ M NQT
Subjt:  MAADVTSLVRVLAG-----DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAE
        ESKFQDLNFPPSPSKRTLNLFNETSLDLKLT++SPS SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK    EEEA E
Subjt:  ESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAE

Query:  IRN-AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
         RN AAAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  IRN-AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein6.3e-10587.25Show/hide
Query:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES
        MAA+V+SL+RVLAG   DDNRTALGN Q+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA-----KEEEAAEI
        KFQDLNFPPSPSKRTLNLFNETSLDLKLT      S  SS+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA     +EEEAAEI
Subjt:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA-----KEEEAAEI

Query:  RNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        RN+AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  RNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896403.4e-10688Show/hide
Query:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES
        MAA+V+SL+RVLAG   DDNRTALGN Q+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA----KEEEAAEIR
        KFQDLNFPPSPSKRTLNLFNETSLDLKLT+     SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA    +EEEAAEIR
Subjt:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAA----KEEEAAEIR

Query:  NAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        N+AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  NAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370429.5e-9381.3Show/hide
Query:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES
        MAA V+SL+RVLAG    DNR AL   +ESTAL TRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM  NQTES
Subjt:  MAADVTSLVRVLAG---DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAEIRNAAA
        K QDLNFPPSPSKRTLNLFNETSLDL LT         SS+NYASVCTLDKVKSALERADKEL+KKRSTLWKS SSP    S++   +EEEAAE R +AA
Subjt:  KFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAEIRNAAA

Query:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        PMAVGCPGCLSYVLV KNNPRCPRCNSVVPLPS IKKPRIDLN+SI
Subjt:  PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548763.6e-10083.01Show/hide
Query:  MAADVTSLVRVLAG--------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMI
        MAA V++L+RVLAG        +DNRT LGN QE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK HQ M 
Subjt:  MAADVTSLVRVLAG--------DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMI

Query:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEE
        NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLT +SPSPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK    EEE
Subjt:  NQTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEE

Query:  AAEIRN----AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        A E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  AAEIRN----AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684501.1e-10185.77Show/hide
Query:  MAADVTSLVRVLAG-----DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQT
        MAA V+SL+RVLAG     +DNRT LGN QE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQ M NQT
Subjt:  MAADVTSLVRVLAG-----DDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAE
        ESKFQDLNFPPSPSKRTLNLFNETSLDLKLT++SPS SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK    EEEA E
Subjt:  ESKFQDLNFPPSPSKRTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAE

Query:  IRN-AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
         RN AAAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  IRN-AAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein1.1e-4547.08Show/hide
Query:  MAADVTSLVRVLAG-DDNRTALGNR---QESTALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQR---SQTPDSPLNSDSK
        MAADV+SLVR+L+   D+RT + +    + + AL+TRDLLG    +         S ELDLD+QVP+GWEKRLDLKSGKVY+Q+   S +  S  +    
Subjt:  MAADVTSLVRVLAG-DDNRTALGNR---QESTALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQR---SQTPDSPLNSDSK

Query:  QHQMMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTAAS---PSPSPSSS-------SNYASVCTLDKVKSALERADKELVKKRSTLWK
         H+   NQT  +FQDLN PP     P+K  L+LF   ++TSL+LKL  +S   P P P SS       S  +SVCTLDKVK ALERA+K+  K++S    
Subjt:  QHQMMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTAAS---PSPSPSSS-------SNYASVCTLDKVKSALERADKELVKKRSTLWK

Query:  SSSSPSYSSSSSSAAKEEEAAEIRNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
              Y  ++S+            AA+ +A GCPGCLSYV V KNNP+CPRC+S VPLP+ +KKP+IDLN+S+
Subjt:  SSSSPSYSSSSSSAAKEEEAAEIRNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

AT1G79160.1 unknown protein9.2e-4850.2Show/hide
Query:  MAADVTSLVRVLAG-DDNRTAL---GNRQESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQ
        MAADV+SLVR+L+G  D+R  +      + S AL+TRDLLG         S ELDLDLQVP+G+EKRLDLKSGKVY+QR  +  S   +++ Q     NQ
Subjt:  MAADVTSLVRVLAG-DDNRTAL---GNRQESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQ

Query:  TESKFQDLNFPPSPSKRT--LNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKE--LVKKRSTLWKSSSSPSYSSSSSSAAKEEEAA
        T   FQDLNFPP     +  LNLF++T+ +LKL  +S S  P ++SN  SVCTLDKVKSALERA+++  + KKR    +S     Y    + A       
Subjt:  TESKFQDLNFPPSPSKRT--LNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKE--LVKKRSTLWKSSSSPSYSSSSSSAAKEEEAA

Query:  EIRNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS--IIKKPRIDLNMSI
             A+P+  GCPGCLSYVLVM NNP+CPRC+++VPLP+  + KKP+IDLN+SI
Subjt:  EIRNAAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS--IIKKPRIDLNMSI

AT5G22270.1 unknown protein7.4e-0546.55Show/hide
Query:  SSPSYSSSSSSAAKEEEAAEIRNAAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL
        SS    SSSSS   EE       A + + VGCP C+ Y++  ++N+PRCPRCNS V L
Subjt:  SSPSYSSSSSSAAKEEEAAEIRNAAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGATGTTACCAGTCTGGTTCGAGTTCTCGCCGGCGACGATAATCGGACGGCCCTCGGTAATCGCCAGGAATCAACGGCTCTCGTTACTCGCGATTTGCTCGG
TCAATCCTCCAACCTCGCCGACTCTCAGGAACTCGACCTCGATTTGCAGGTCCCCTCCGGCTGGGAAAAACGCCTCGACTTGAAGTCGGGAAAAGTTTACATACAGAGAA
GTCAAACGCCAGATTCTCCGCTGAATTCAGATTCAAAGCAACACCAAATGATGATCAATCAAACGGAATCCAAATTCCAGGACTTGAATTTCCCTCCATCTCCGTCTAAA
CGGACGTTAAATCTCTTCAACGAAACCAGCTTGGATTTGAAACTGACGGCGGCGTCGCCGTCGCCGTCGCCGTCGTCGTCGAGCAATTACGCCAGCGTGTGCACTCTGGA
TAAGGTGAAATCTGCTCTGGAAAGGGCCGACAAGGAGCTGGTTAAGAAACGGTCCACGCTTTGGAAATCGTCGTCGTCGCCGTCGTACTCGTCGTCGTCGTCGTCGGCGG
CGAAGGAAGAAGAGGCGGCGGAGATTAGAAACGCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTGTCTTATGTATTGGTAATGAAAAACAACCCCAGATGTCCT
CGTTGCAATTCTGTTGTTCCATTGCCGAGCATCATCAAGAAACCTCGGATTGATCTAAATATGTCGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGATGTTACCAGTCTGGTTCGAGTTCTCGCCGGCGACGATAATCGGACGGCCCTCGGTAATCGCCAGGAATCAACGGCTCTCGTTACTCGCGATTTGCTCGG
TCAATCCTCCAACCTCGCCGACTCTCAGGAACTCGACCTCGATTTGCAGGTCCCCTCCGGCTGGGAAAAACGCCTCGACTTGAAGTCGGGAAAAGTTTACATACAGAGAA
GTCAAACGCCAGATTCTCCGCTGAATTCAGATTCAAAGCAACACCAAATGATGATCAATCAAACGGAATCCAAATTCCAGGACTTGAATTTCCCTCCATCTCCGTCTAAA
CGGACGTTAAATCTCTTCAACGAAACCAGCTTGGATTTGAAACTGACGGCGGCGTCGCCGTCGCCGTCGCCGTCGTCGTCGAGCAATTACGCCAGCGTGTGCACTCTGGA
TAAGGTGAAATCTGCTCTGGAAAGGGCCGACAAGGAGCTGGTTAAGAAACGGTCCACGCTTTGGAAATCGTCGTCGTCGCCGTCGTACTCGTCGTCGTCGTCGTCGGCGG
CGAAGGAAGAAGAGGCGGCGGAGATTAGAAACGCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTGTCTTATGTATTGGTAATGAAAAACAACCCCAGATGTCCT
CGTTGCAATTCTGTTGTTCCATTGCCGAGCATCATCAAGAAACCTCGGATTGATCTAAATATGTCGATATGA
Protein sequenceShow/hide protein sequence
MAADVTSLVRVLAGDDNRTALGNRQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQHQMMINQTESKFQDLNFPPSPSK
RTLNLFNETSLDLKLTAASPSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAEIRNAAAPMAVGCPGCLSYVLVMKNNPRCP
RCNSVVPLPSIIKKPRIDLNMSI