; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004086 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004086
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationscaffold4:934099..935285
RNA-Seq ExpressionSpg004086
SyntenySpg004086
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012065.1 hypothetical protein SDJN02_26973, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-10081.82Show/hide
Query:  MAADVTSLVRVLA--------GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQ
        MAA V++L+RVLA        GYND+DNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  Q
Subjt:  MAADVTSLVRVLA--------GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQ

Query:  MMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EE
         M NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKLT+S SPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK   EE
Subjt:  MMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EE

Query:  EAAAEIRNS-------AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        E A E RN        AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  EAAAEIRNS-------AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]7.5e-10888.76Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES
        MAA+V+SL+RVLAGY DDDNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAAEIRN
        KFQDLNFPPSPSKRTLNLF+ETSLDLKLT+S     SS+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA     +EE AAEIRN
Subjt:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAAEIRN

Query:  SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]2.0e-10889.11Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES
        MAA+V+SL+RVLAGY +DDNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEIRNS
        KFQDLNFPPSPSKRTLNLF+ETSLDLKLT+   SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA    +EE AAEIRNS
Subjt:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEIRNS

Query:  AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]1.2e-10083.72Show/hide
Query:  MAADVTSLVRVLA-----GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMI
        MAA V++L+RVLA     GYND+DNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  Q M 
Subjt:  MAADVTSLVRVLA-----GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMI

Query:  NQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAA
        NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKLT+S SPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK   EEE A
Subjt:  NQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAA

Query:  AEIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
         E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  AEIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]2.1e-10286.51Show/hide
Query:  MAADVTSLVRVL--AGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQT
        MAA V+SL+RVL  AGYND+DNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q M NQT
Subjt:  MAADVTSLVRVL--AGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLTASSP-SPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEI
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKLT+SSP SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK   EEE A E 
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLTASSP-SPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEI

Query:  RN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        RN +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  RN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein3.6e-10888.76Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES
        MAA+V+SL+RVLAGY DDDNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAAEIRN
        KFQDLNFPPSPSKRTLNLF+ETSLDLKLT+S     SS+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA     +EE AAEIRN
Subjt:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK----EEEAAAEIRN

Query:  SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896409.5e-10989.11Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES
        MAA+V+SL+RVLAGY +DDNRTALGNGQ+STALVTRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q MINQTES
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEIRNS
        KFQDLNFPPSPSKRTLNLF+ETSLDLKLT+   SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSSSAA    +EE AAEIRNS
Subjt:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEIRNS

Query:  AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLP+ IKKPRIDLNMSI
Subjt:  AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370422.1e-9583Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES
        MAA V+SL+RVLAGY D DNR AL   +ESTAL TRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ+QM  NQTES
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTES

Query:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKE--EEAAAEIRNSA
        K QDLNFPPSPSKRTLNLF+ETSLDL LT       SS+NYASVCTLDKVKSALERADKEL+KKRSTLWKS SSP       SAAKE  EE AAE R SA
Subjt:  KFQDLNFPPSPSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKE--EEAAAEIRNSA

Query:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        APMAVGCPGCLSYVLV KNNPRCPRCNSVVPLPS IKKPRIDLN+SI
Subjt:  APMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548765.6e-10183.72Show/hide
Query:  MAADVTSLVRVLA-----GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMI
        MAA V++L+RVLA     GYND+DNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  Q M 
Subjt:  MAADVTSLVRVLA-----GYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMI

Query:  NQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAA
        NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKLT+S SPSPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK   EEE A
Subjt:  NQTESKFQDLNFPPSPSKRTLNLFSETSLDLKLTAS-SPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAA

Query:  AEIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
         E RN     AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  AEIRNS----AAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684501.0e-10286.51Show/hide
Query:  MAADVTSLVRVL--AGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQT
        MAA V+SL+RVL  AGYND+DNRT LGNGQE T LVTRDLLGQSSNLA SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ Q M NQT
Subjt:  MAADVTSLVRVL--AGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLTASSP-SPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEI
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKLT+SSP SPS S+NYASVCTLDKVKSALERADKELVKKRS+LWKS+SSPSYSSSSS+AAK   EEE A E 
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLTASSP-SPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAK---EEEAAAEI

Query:  RN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
        RN +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL S +KKPRIDLNMSI
Subjt:  RN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein3.3e-4546.91Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNGQEST-ALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQR---SQTPDSPLNSDSK
        MAADV+SLVR+L+ + DD        G  ST AL+TRDLLG    +         S ELDLD+QVP+GWEKRLDLKSGKVY+Q+   S +  S  +    
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNGQEST-ALVTRDLLGQSSNLA-------DSQELDLDLQVPSGWEKRLDLKSGKVYIQR---SQTPDSPLNSDSK

Query:  QQQMMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKLTASS-----PSPSSS-------SNYASVCTLDKVKSALERADKELVKKRSTLWK
          +   NQT  +FQDLN PP     P+K  L+LF    +TSL+LKL  SS     P P SS       S  +SVCTLDKVK ALERA+K+  K++S    
Subjt:  QQQMMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKLTASS-----PSPSSS-------SNYASVCTLDKVKSALERADKELVKKRSTLWK

Query:  SSSSPSYSSSSSSAAKEEEAAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI
              Y  ++S+             +A+ +A GCPGCLSYV V KNNP+CPRC+S VPLP+ +KKP+IDLN+S+
Subjt:  SSSSPSYSSSSSSAAKEEEAAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSIIKKPRIDLNMSI

AT1G79160.1 unknown protein2.4e-4850.98Show/hide
Query:  MAADVTSLVRVLAGYNDDDNRTALGNG-QESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQ
        MAADV+SLVR+L+GY DD        G + S AL+TRDLLG         S ELDLDLQVP+G+EKRLDLKSGKVY+QR  +  S   +++ Q     NQ
Subjt:  MAADVTSLVRVLAGYNDDDNRTALGNG-QESTALVTRDLLGQSSNLAD--SQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQ

Query:  TESKFQDLNFPPSPSKRT--LNLFSETSLDLKLTASS-PSPSSSSNYASVCTLDKVKSALERADKE--LVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAA
        T   FQDLNFPP     +  LNLF +T+ +LKL  SS  S  ++SN  SVCTLDKVKSALERA+++  + KKR    +S     Y    + A        
Subjt:  TESKFQDLNFPPSPSKRT--LNLFSETSLDLKLTASS-PSPSSSSNYASVCTLDKVKSALERADKE--LVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAA

Query:  EIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS--IIKKPRIDLNMSI
             A+P+  GCPGCLSYVLVM NNP+CPRC+++VPLP+  + KKP+IDLN+SI
Subjt:  EIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS--IIKKPRIDLNMSI

AT5G22270.1 unknown protein4.8e-0443.28Show/hide
Query:  KSSSSPSYSS-----SSSSAAKEEEAAAEIRNSAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL
        K + SPS  S     SSSS+   EEA      + + + VGCP C+ Y++  ++N+PRCPRCNS V L
Subjt:  KSSSSPSYSS-----SSSSAAKEEEAAAEIRNSAAPMAVGCPGCLSYVLV-MKNNPRCPRCNSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGATGTTACCAGTCTGGTTCGAGTCCTCGCCGGCTACAACGACGACGATAATCGGACGGCCCTCGGTAATGGCCAGGAATCGACGGCTCTCGTTACTCGCGA
TTTGCTCGGTCAATCTTCCAACCTCGCCGACTCTCAGGAACTCGACCTCGATTTGCAGGTTCCCTCCGGCTGGGAAAAACGCCTCGACTTGAAGTCGGGAAAAGTTTACA
TACAGAGAAGTCAAACTCCAGATTCTCCGCTGAATTCAGATTCGAAGCAACAGCAAATGATGATCAATCAAACGGAATCCAAATTCCAGGACTTGAATTTCCCTCCATCT
CCGTCTAAACGGACGTTAAATCTCTTCAGCGAAACCAGCTTGGATTTGAAACTGACGGCGTCGTCGCCGTCGCCGTCGTCGTCGAGCAATTACGCCAGCGTGTGCACTCT
GGATAAGGTGAAATCTGCTTTGGAAAGGGCCGACAAGGAGCTGGTTAAGAAACGGTCGACGCTTTGGAAATCGTCGTCGTCGCCGTCGTACTCCTCGTCGTCGTCGTCGG
CGGCGAAGGAAGAAGAGGCGGCGGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTGTCTTATGTATTGGTAATGAAAAACAACCCCAGA
TGTCCTCGTTGCAATTCTGTTGTTCCATTGCCGAGCATCATCAAGAAACCTCGGATTGATCTAAATATGTCGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCGATGTTACCAGTCTGGTTCGAGTCCTCGCCGGCTACAACGACGACGATAATCGGACGGCCCTCGGTAATGGCCAGGAATCGACGGCTCTCGTTACTCGCGA
TTTGCTCGGTCAATCTTCCAACCTCGCCGACTCTCAGGAACTCGACCTCGATTTGCAGGTTCCCTCCGGCTGGGAAAAACGCCTCGACTTGAAGTCGGGAAAAGTTTACA
TACAGAGAAGTCAAACTCCAGATTCTCCGCTGAATTCAGATTCGAAGCAACAGCAAATGATGATCAATCAAACGGAATCCAAATTCCAGGACTTGAATTTCCCTCCATCT
CCGTCTAAACGGACGTTAAATCTCTTCAGCGAAACCAGCTTGGATTTGAAACTGACGGCGTCGTCGCCGTCGCCGTCGTCGTCGAGCAATTACGCCAGCGTGTGCACTCT
GGATAAGGTGAAATCTGCTTTGGAAAGGGCCGACAAGGAGCTGGTTAAGAAACGGTCGACGCTTTGGAAATCGTCGTCGTCGCCGTCGTACTCCTCGTCGTCGTCGTCGG
CGGCGAAGGAAGAAGAGGCGGCGGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCTGGATGTTTGTCTTATGTATTGGTAATGAAAAACAACCCCAGA
TGTCCTCGTTGCAATTCTGTTGTTCCATTGCCGAGCATCATCAAGAAACCTCGGATTGATCTAAATATGTCGATATGA
Protein sequenceShow/hide protein sequence
MAADVTSLVRVLAGYNDDDNRTALGNGQESTALVTRDLLGQSSNLADSQELDLDLQVPSGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQQQMMINQTESKFQDLNFPPS
PSKRTLNLFSETSLDLKLTASSPSPSSSSNYASVCTLDKVKSALERADKELVKKRSTLWKSSSSPSYSSSSSSAAKEEEAAAEIRNSAAPMAVGCPGCLSYVLVMKNNPR
CPRCNSVVPLPSIIKKPRIDLNMSI