; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy10g009100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy10g009100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionUnknown protein
Genome locationChr10:35548793..35554466
RNA-Seq ExpressionLcy10g009100
SyntenyLcy10g009100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582051.1 hypothetical protein SDJN03_22053, partial [Cucurbita argyrosperma subsp. sororia]1.8e-11471.99Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

KAG7018483.1 hypothetical protein SDJN02_20352, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-11572.31Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LES+I   STLQNSC ILNLSV+NPGS AAG M+++SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

XP_022955760.1 uncharacterized protein LOC111457658 isoform X1 [Cucurbita moschata]1.8e-11471.99Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

XP_022955766.1 uncharacterized protein LOC111457658 isoform X2 [Cucurbita moschata]3.6e-11571.61Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVRRGR
         AEK VR  +
Subjt:  CAEKSVRRGR

XP_038905854.1 uncharacterized protein LOC120091797 [Benincasa hispida]3.0e-11471.57Show/hide
Query:  MAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSF
        M  +IPEIQSR +E+     + NASAGS KTSPEVF SAIEFYVWSD+GINLYVDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKSF
Subjt:  MAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSF

Query:  QWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDI
        QWNN   + K DY++KETPS SNLM  +CTE D+L+EADG V++SA+TS A NAD SENLDEDQ  ISSE D D+QNQK A SE C  EDNRA NLD +I
Subjt:  QWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDI

Query:  NNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQIC
        +N LQKK + DPISGG S+L+TL HQN  LESE+   STLQ SCSILN  VENPGSSAAGSM+M+SSDI+QC KDVSCSPCR LP RDS NVSDYSLQ  
Subjt:  NNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQIC

Query:  AEKSVR
        AEKS R
Subjt:  AEKSVR

TrEMBL top hitse value%identityAlignment
A0A6J1GUR5 uncharacterized protein LOC111457658 isoform X18.5e-11571.99Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVR
         AEK  R
Subjt:  CAEKSVR

A0A6J1GUR9 uncharacterized protein LOC111457658 isoform X21.7e-11571.61Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDLDS PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE A+ 
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C  IDQL+EA+GSVI+SA+TSRAINADASE +DEDQ  ISSE DFDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI
        +NNAL+K    DP+SGGPSSL+T +HQN +LE++I   STLQNSC ILNLSV+NPGS AAG M+M+SSDIEQCP D                VSDYSLQ 
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLESEI---STLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQI

Query:  CAEKSVRRGR
         AEK VR  +
Subjt:  CAEKSVRRGR

A0A6J1IPS1 uncharacterized protein LOC111479409 isoform X26.1e-11372.13Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE  FDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA
        +NNAL+K    DP SGGPSSL+T +HQN + E  E STLQNSC ILNLSV+NPGS AAGSM+M+SSDIEQCP D                VSDYSLQ   
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA

Query:  EKSVR
        EK VR
Subjt:  EKSVR

A0A6J1IUF8 uncharacterized protein LOC111479409 isoform X13.0e-11271.8Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE  FDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA
        +NNAL+K    DP SGGPSSL+T +HQN + E  E STLQNSC ILNLSV+NPGS AAGSM+M+SSDIEQCP D                VSDYSLQ   
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA

Query:  EKSVR
        EK  R
Subjt:  EKSVR

A0A6J1IXP1 uncharacterized protein LOC111479409 isoform X41.5e-11472.17Show/hide
Query:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS
        M  +IPEIQSRC+END CLAD ANASAGS KTSPEVF SAIEFYVWSD+GINL+VDL+S PLDW ERLKNEVY CES+YRDKCLQQN CWF GHKE AKS
Subjt:  MAEHIPEIQSRCMENDACLAD-ANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKS

Query:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD
        FQWNN PG+ KDDYLQKE PSSSNLM  +C   DQL+EA+GSVI+SA+TSRAINADASE++DEDQ  ISSE  FDVQNQK A SEICT EDNR MNLD D
Subjt:  FQWNNQPGVTKDDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLD

Query:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA
        +NNAL+K    DP SGGPSSL+T +HQN + E  E STLQNSC ILNLSV+NPGS AAGSM+M+SSDIEQCP D                VSDYSLQ   
Subjt:  INNALQKKASCDPISGGPSSLATLEHQNSVLE-SEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICA

Query:  EKSVRRGRT
        EK VRRG T
Subjt:  EKSVRRGRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G59430.1 unknown protein7.1e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK

AT3G59430.2 unknown protein7.1e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK

AT3G59430.3 unknown protein7.1e-0544.44Show/hide
Query:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK
        SS+ EF+V  ++GI+L VDL+  P DW   +++EV  C+S+ R K
Subjt:  SSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAACATATCCCTGAGATCCAAAGTAGATGTATGGAGAATGATGCGTGCCTAGCTGATGCAAATGCTTCTGCTGGTTCTCTTAAAACTTCTCCAGAAGTCTTCTC
ATCTGCTATTGAGTTCTATGTCTGGTCTGATGACGGGATTAATCTTTACGTGGATTTAGACTCTAGGCCTTTGGACTGGGCTGAAAGATTAAAAAATGAGGTTTACTTCT
GTGAGAGCGTCTACAGAGACAAATGTTTGCAGCAGAACTTCTGTTGGTTTAATGGTCATAAAGAGATTGCAAAATCTTTTCAGTGGAATAACCAACCTGGCGTAACCAAG
GATGACTATTTACAGAAAGAAACTCCCTCCAGCTCAAACCTGATGATAAAAGACTGCACGGAGATTGACCAACTAAATGAAGCTGATGGATCTGTAATCTACTCTGCAAT
GACATCACGTGCCATTAATGCAGATGCTTCTGAGAATTTAGATGAAGACCAGGCAACCATTTCTTCTGAAATTGATTTTGATGTGCAAAACCAGAAACCTGCTGAGTCTG
AAATTTGTACTACAGAGGATAATCGTGCAATGAATCTTGATTTAGATATTAATAATGCTTTACAGAAAAAGGCAAGTTGTGATCCAATTTCTGGTGGTCCATCTAGTCTT
GCCACATTAGAACATCAAAATTCTGTGCTTGAAAGTGAAATTTCAACACTACAGAACAGCTGCAGTATTTTAAACCTCTCTGTCGAGAATCCTGGAAGCTCAGCAGCTGG
TTCCATGGAGATGCAATCATCAGATATTGAACAATGCCCTAAAGACGTCTCTTGTTCACCTTGTAGAGCATTGCCACTGAGAGACTCAAAGAATGTTTCTGATTACAGCT
TGCAAATATGTGCAGAGAAGTCGGTAAGGAGAGGAAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
TGGAGAACAGTTTCAAAGTTCTTAAGCCACTCCTTAATCGACTCTCCGCAATGTAAAAGTTCGAACAAATCGTTACTATCTTCTCGATTCTTTCTACAATTTCCAATCAG
GTGATATGGCTGAACATATCCCTGAGATCCAAAGTAGATGTATGGAGAATGATGCGTGCCTAGCTGATGCAAATGCTTCTGCTGGTTCTCTTAAAACTTCTCCAGAAGTC
TTCTCATCTGCTATTGAGTTCTATGTCTGGTCTGATGACGGGATTAATCTTTACGTGGATTTAGACTCTAGGCCTTTGGACTGGGCTGAAAGATTAAAAAATGAGGTTTA
CTTCTGTGAGAGCGTCTACAGAGACAAATGTTTGCAGCAGAACTTCTGTTGGTTTAATGGTCATAAAGAGATTGCAAAATCTTTTCAGTGGAATAACCAACCTGGCGTAA
CCAAGGATGACTATTTACAGAAAGAAACTCCCTCCAGCTCAAACCTGATGATAAAAGACTGCACGGAGATTGACCAACTAAATGAAGCTGATGGATCTGTAATCTACTCT
GCAATGACATCACGTGCCATTAATGCAGATGCTTCTGAGAATTTAGATGAAGACCAGGCAACCATTTCTTCTGAAATTGATTTTGATGTGCAAAACCAGAAACCTGCTGA
GTCTGAAATTTGTACTACAGAGGATAATCGTGCAATGAATCTTGATTTAGATATTAATAATGCTTTACAGAAAAAGGCAAGTTGTGATCCAATTTCTGGTGGTCCATCTA
GTCTTGCCACATTAGAACATCAAAATTCTGTGCTTGAAAGTGAAATTTCAACACTACAGAACAGCTGCAGTATTTTAAACCTCTCTGTCGAGAATCCTGGAAGCTCAGCA
GCTGGTTCCATGGAGATGCAATCATCAGATATTGAACAATGCCCTAAAGACGTCTCTTGTTCACCTTGTAGAGCATTGCCACTGAGAGACTCAAAGAATGTTTCTGATTA
CAGCTTGCAAATATGTGCAGAGAAGTCGGTAAGGAGAGGAAGAACTTAAGTGTTGCAATGGAGAGTTCAGAATGCTCTCAGTTCCCTGACTCTTTGGAGAAGACGTTGCC
TGTATCTCATATTATCGAATCTAATGGAGCACATAAGAGAAAGAGGAAACTCACTAAGAATGAAACAAGATGTCATTATAGTGAACCTGATAGGAGAGTTTTAAGAAGCC
TAATGAAAAATGCTAGGCGAGTGCTACCTAGAAGATCCCAGCGGCTAAATTTAAAGACTGTGAGTTCGTAAAGTGCATTGGGACATGATGAAACATGGAAGGATCAATTT
CAAGTTTCACATCGTCTGTGGCCTAAGGAAGTATTACATCAACAGTACGTATGAATATCTTTATCAAATGTTTGCCCATGGAAAAGCTGTTCGTTTTGAGAAGACCAACA
TTTTCCAAGATGAAAATCAATCAGCTTATTTTAGGAATGTATCACTCCAACAATGATTCTTATTCAAGAACTTGAATTGGTTAACCAGTTCATCATCTTACAGATGGTTA
CCATAAATGTAGAATCAGAAAACTTTACAATTTTATAATGCTCACAAGGCATCCAGATTTGAGATTTTAGATCTCATACCATTAAAAGTAGTGGCAAACGTGATTCAGAA
TCATTAAAATTGTGGATTGTTGTTGCTGTTCTCGTGGGATTGTTTCCATCTGAAGAAAATGCTTACGAGCATCAATGTTTGAAGATAGCATAGAATTCAAATGCAAATTT
TGATTCTTACGAATATTCAGAATCAATTCACCCAATCTAGTTCATCTCATTCATGATGCTTTAGACAAAACTACAATGCATAAATGTTTCAATTGAAATTGAGCTGTCAC
CATCTTTGACCCCTCATAACATGTACCAGATTGTTTTTCTATCACACATTTGTTTCCCTATGGTTTTCTTTCAAAATATAGCCTCTCCCTAAGGTCATTCTCTACCTAAA
GAGGTAGATCCCTAGCACCCCTTCCTTACGATGTTGTTTGTTCTTTAAATCTAAAAGGACAGTTTTTCTTTTCTTCGCTCCATTAAATGCTGATAAAGCTTTTTGATTTT
TTATGATAATTCTATCTCAAAACCAATTGGCAATGAGTGGGGTGACCCTTCTACTTTATAATTTGTGAGGTCTCCACTAATTTTTTCAAGGTGGGATCCTCAACAAAATG
CTTCTTACCGGCCCCAAGCAGGATTTGACATTTTGTGTCTCTAATGATTCTAACTTGTGCAGGTTGCCACGTTATAAATCGACATTTTTTCCAGGGAAAGTACAGGCTTG
AATCCAGGTTTCTTCTCAGTTGGATGGGTAAGAACTCTTGTGATTTTATGGAAGAAATCAAGTGTGTGTAATGCTGATAAGTTTCATACGAAGGAATTTGTATAAACAAG
ATTCTTTAGTAACACATCTCTGATCATTATATGAAATTCACTTTAATAAAAACGACTTTCAGTATGGATCAGAGCTTGTGTAGTTTCTGCTGTTGGGAGAAATTTTGTCA
TTTTTTTCTTGTTGGATTGCACAAATTGGAACTAGTCCTAGCTTAAAATTTTTCATTTGATGTTTC
Protein sequenceShow/hide protein sequence
MAEHIPEIQSRCMENDACLADANASAGSLKTSPEVFSSAIEFYVWSDDGINLYVDLDSRPLDWAERLKNEVYFCESVYRDKCLQQNFCWFNGHKEIAKSFQWNNQPGVTK
DDYLQKETPSSSNLMIKDCTEIDQLNEADGSVIYSAMTSRAINADASENLDEDQATISSEIDFDVQNQKPAESEICTTEDNRAMNLDLDINNALQKKASCDPISGGPSSL
ATLEHQNSVLESEISTLQNSCSILNLSVENPGSSAAGSMEMQSSDIEQCPKDVSCSPCRALPLRDSKNVSDYSLQICAEKSVRRGRT