; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005248 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005248
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold11:33908123..33914791
RNA-Seq ExpressionSpg005248
SyntenySpg005248
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582051.1 hypothetical protein SDJN03_22053, partial [Cucurbita argyrosperma subsp. sororia]4.4e-11672.17Show/hide
Query:  GDMAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIA
        GDM  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A
Subjt:  GDMAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIA

Query:  KSFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLD
        + FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD
Subjt:  KSFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLD

Query:  LDINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSL
         D+NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSL
Subjt:  LDINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSL

Query:  QICAEKSVR
        Q  AEK  R
Subjt:  QICAEKSVR

KAG7018483.1 hypothetical protein SDJN02_20352, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-11572.4Show/hide
Query:  DMAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAK
        DM  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AK
Subjt:  DMAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAK

Query:  SFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDL
        SFQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD 
Subjt:  SFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDL

Query:  DINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQ
        D+NNAL+K    DP+SGGPSSL+T +HQN +LES+I   STLQNSC ILNLSV+NPGS AAG M+++SSDIEQCP D                VSDYSLQ
Subjt:  DINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQ

Query:  ICAEKSVR
          AEK  R
Subjt:  ICAEKSVR

XP_022955760.1 uncharacterized protein LOC111457658 isoform X1 [Cucurbita moschata]1.9e-11471.99Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

XP_022955766.1 uncharacterized protein LOC111457658 isoform X2 [Cucurbita moschata]3.8e-11572.31Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK VR
Subjt:  CAEKSVR

XP_038905854.1 uncharacterized protein LOC120091797 [Benincasa hispida]2.4e-11471.57Show/hide
Query:  MAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSF
        M  +IPEIQSR +E+     + NASAGS KTSPEVF SAIEFYVWSD+GINLYVDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKSF
Subjt:  MAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSF

Query:  QWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDI
        QWNN   + K DY++KETPS SNLM  +CTE D+L+EADG V++SA+TS A NAD SENLDEDQ  ISSE D D+QNQK A SE C  EDNRA NLD +I
Subjt:  QWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDI

Query:  NNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQIC
        +N LQKK + DPISGG S+L+TL HQN  LESE+   STLQ SCSILN  VENPGSSAAGSM+M+SSDI+QC KDVSCSPCR LP RDS NVSDYSLQ  
Subjt:  NNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQIC

Query:  AEKSVR
        AEKS R
Subjt:  AEKSVR

TrEMBL top hitse value%identityAlignment
A0A0A0KXF4 Uncharacterized protein4.9e-11371.75Show/hide
Query:  GDMAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAK
        GDM  +IPE+QSR +E+     + NASA S KTS EVF SAIEFYVWSD+GINLYVDL+S PLDW ERLKNEVY CES+YRDK LQQN CWF GHKE AK
Subjt:  GDMAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAK

Query:  SFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDL
        SFQWNN  G+ K  YLQKETPS SNLMI + TE  +L+EADGSVI+S +TS AINADASENLDE+Q  ISSE DFD QNQK A SE C  EDNRA +LD 
Subjt:  SFQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDL

Query:  DINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQ
        +I+N LQKK + DPISGG S L+ L HQN  LESE+   STLQNSCS LNLSVENPGSSAAGSM+M+SSDIEQC KDVSCSPCRALP  DS NVSDY LQ
Subjt:  DINNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQ

Query:  ICAEKSVR
          AEKS R
Subjt:  ICAEKSVR

A0A6J1GUR5 uncharacterized protein LOC111457658 isoform X19.0e-11571.99Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

A0A6J1GUR9 uncharacterized protein LOC111457658 isoform X21.8e-11572.31Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK VR
Subjt:  CAEKSVR

A0A6J1IPS1 uncharacterized protein LOC111479409 isoform X26.5e-11372.13Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE  FDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA
        +NNAL+K    DP SGGPSSL+T +HQN + E  E STLQNSC ILNLSV+NPGS AAGSM+M+SSDIEQCP D                VSDYSLQ   
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA

Query:  EKSVR
        EK VR
Subjt:  EKSVR

A0A6J1IXP1 uncharacterized protein LOC111479409 isoform X46.5e-11372.13Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE  FDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA
        +NNAL+K    DP SGGPSSL+T +HQN + E  E STLQNSC ILNLSV+NPGS AAGSM+M+SSDIEQCP D                VSDYSLQ   
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA

Query:  EKSVR
        EK VR
Subjt:  EKSVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G59430.1 unknown protein9.8e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK

AT3G59430.2 unknown protein9.8e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK

AT3G59430.3 unknown protein9.8e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK

AT4G24110.1 unknown protein8.3e-0442.11Show/hide
Query:  EGKVVQGIFRRSDGREWRLECRCVLESEIAESIAATME
        E +V++GIFR+ +G +W+LEC C +E E A  +    E
Subjt:  EGKVVQGIFRRSDGREWRLECRCVLESEIAESIAATME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGGCGGTGGCGGCGACAACAATCATCTTCTCACCAGCGGAGAGGAGGGCAAGGGGAAGGGGCAAGGGGAAGGGGAAAGGGAAGGAAGGGGGAAGGGGAGGGGA
TCAAGGGGAGGGCAAAGTCGTGCAGGGGATATTCAGGAGGAGTGATGGTAGAGAATGGAGATTGGAGTGTAGATGCGTGCTTGAATCTGAGATCGCCGAATCCATAGCAG
CGACGATGGAGATCAAGAATTTTGAGTACATCGACTTCATCTTGAAGAAACTTTACTTAGTACGTGAATTCAGTAACCTGCAGTTCAGAACTTTATACGGTGATATGGCT
GAACATATCCCTGAGATCCAAAGTAGATGTATGGAGAATGATGCGTGCCTAGCTGATGCAAATGCTTCTGCTGGTTCTCTTAAAACTTCTCCAGAAGTCTTCTCATCTGC
TATTGAGTTCTATGTCTGGTCTGATGACGGGATTAATCTTTACGTGGATTTAGACTCTAGGCCTTTGGACTGGGCTGAAAGATTAAAAAATGAGGTTTACTTCTGTGAGA
GCGTCTACAGAGACAAATGTTTGCAGCAGAACTTCTGTTGGTTTAATGGTCATAAAGAGATTGCAAAATCTTTTCAGTGGAATAACCAACCTGGCGTAACCAAGGATGAC
TATTTACAGAAAGAAACTCCCTCCAGCTCAAACCTGATGATAAAAGACTGCACGGAGATTGACCAACTAAATGAAGCTGATGGATCTGTAATCTACTCTGCAATGACATC
ACGTGCCATTAATGCAGATGCTTCTGAGAATTTAGATGAAGACCAGGCAACCATTTCTTCTGAAATTGATTTTGATGTGCAAAACCAGAAACCTGCTGAGTCTGAAATTT
GTACTACAGAGGATAATCGTGCAATGAATCTTGATTTAGATATTAATAATGCTTTACAGAAAAAGGCAAGTTGTGATCCAATTTCTGGTGGTCCATCTAGTCTTGCCACA
TTAGAACATCAAAATTCTGTGCTTGAAAGTGAAATTTCAACACTACAGAACAGCTGCAGTATTTTAAACCTCTCTGTCGAGAATCCTGGAAGCTCAGCAGCTGGTTCCAT
GGAGATGCAATCATCAGATATTGAACAATGCCCTAAAGACGTCTCTTGTTCACCTTGTAGAGCATTGCCACTGAGAGACTCAAAGAATGTTTCTGATTACAGCTTGCAAA
TATGTGCAGAGAAGTCGGTAAGGTGTTGTTTCGTTGCCAATTTCCTTGTGTTCTGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAGGCGGTGGCGGCGACAACAATCATCTTCTCACCAGCGGAGAGGAGGGCAAGGGGAAGGGGCAAGGGGAAGGGGAAAGGGAAGGAAGGGGGAAGGGGAGGGGA
TCAAGGGGAGGGCAAAGTCGTGCAGGGGATATTCAGGAGGAGTGATGGTAGAGAATGGAGATTGGAGTGTAGATGCGTGCTTGAATCTGAGATCGCCGAATCCATAGCAG
CGACGATGGAGATCAAGAATTTTGAGTACATCGACTTCATCTTGAAGAAACTTTACTTAGTACGTGAATTCAGTAACCTGCAGTTCAGAACTTTATACGGTGATATGGCT
GAACATATCCCTGAGATCCAAAGTAGATGTATGGAGAATGATGCGTGCCTAGCTGATGCAAATGCTTCTGCTGGTTCTCTTAAAACTTCTCCAGAAGTCTTCTCATCTGC
TATTGAGTTCTATGTCTGGTCTGATGACGGGATTAATCTTTACGTGGATTTAGACTCTAGGCCTTTGGACTGGGCTGAAAGATTAAAAAATGAGGTTTACTTCTGTGAGA
GCGTCTACAGAGACAAATGTTTGCAGCAGAACTTCTGTTGGTTTAATGGTCATAAAGAGATTGCAAAATCTTTTCAGTGGAATAACCAACCTGGCGTAACCAAGGATGAC
TATTTACAGAAAGAAACTCCCTCCAGCTCAAACCTGATGATAAAAGACTGCACGGAGATTGACCAACTAAATGAAGCTGATGGATCTGTAATCTACTCTGCAATGACATC
ACGTGCCATTAATGCAGATGCTTCTGAGAATTTAGATGAAGACCAGGCAACCATTTCTTCTGAAATTGATTTTGATGTGCAAAACCAGAAACCTGCTGAGTCTGAAATTT
GTACTACAGAGGATAATCGTGCAATGAATCTTGATTTAGATATTAATAATGCTTTACAGAAAAAGGCAAGTTGTGATCCAATTTCTGGTGGTCCATCTAGTCTTGCCACA
TTAGAACATCAAAATTCTGTGCTTGAAAGTGAAATTTCAACACTACAGAACAGCTGCAGTATTTTAAACCTCTCTGTCGAGAATCCTGGAAGCTCAGCAGCTGGTTCCAT
GGAGATGCAATCATCAGATATTGAACAATGCCCTAAAGACGTCTCTTGTTCACCTTGTAGAGCATTGCCACTGAGAGACTCAAAGAATGTTTCTGATTACAGCTTGCAAA
TATGTGCAGAGAAGTCGGTAAGGTGTTGTTTCGTTGCCAATTTCCTTGTGTTCTGCAATTAA
Protein sequenceShow/hide protein sequence
MQKAVAATTIIFSPAERRARGRGKGKGKGKEGGRGGDQGEGKVVQGIFRRSDGREWRLECRCVLESEIAESIAATMEIKNFEYIDFILKKLYLVREFSNLQFRTLYGDMA
EHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSFQWNNQPGVTKDD
YLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDINNALQKKASCDPISGGPSSLAT
LEHQNSVLESEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICAEKSVRCCFVANFLVFCN