; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008116 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008116
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold1:53232822..53238312
RNA-Seq ExpressionSpg008116
SyntenySpg008116
Gene Ontology termsNA
InterPro domainsIPR040320 - Uncharacterized protein At4g37920-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603165.1 hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia]3.0e-10492.66Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
         QN+Q+ PNHVS +AVSI
Subjt:  FQNSQTNPNHVSEDAVSI

XP_023544083.1 uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo]1.6e-10593.58Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEAKDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
         QN+QT PNHVS +AVSI
Subjt:  FQNSQTNPNHVSEDAVSI

XP_038883874.1 uncharacterized protein At4g37920 isoform X1 [Benincasa hispida]2.1e-10593.12Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
        FQN Q+ PNHVSEDAVSI
Subjt:  FQNSQTNPNHVSEDAVSI

XP_038883875.1 uncharacterized protein At4g37920 isoform X2 [Benincasa hispida]2.1e-10593.12Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
        FQN Q+ PNHVSEDAVSI
Subjt:  FQNSQTNPNHVSEDAVSI

XP_038883876.1 uncharacterized protein At4g37920 isoform X3 [Benincasa hispida]2.1e-10593.12Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
        FQN Q+ PNHVSEDAVSI
Subjt:  FQNSQTNPNHVSEDAVSI

TrEMBL top hitse value%identityAlignment
A0A0A0L3X1 Uncharacterized protein4.7e-10390.45Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAY+RTLENVETLDSAQVKFD+ILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALAT F+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+VIQRLFILKDTIETEYLEQN+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNP--NHVSEDAVSI
        FQN Q+ P  NH SEDA+SI
Subjt:  FQNSQTNP--NHVSEDAVSI

A0A1S3B4W5 uncharacterized protein At4g37920, chloroplastic isoform X11.2e-10390.91Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFD+ILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAF+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+VIQRLFILKDTIETEYLEQN+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNP--NHVSEDAVSI
        FQN Q+ P  NH SEDA+SI
Subjt:  FQNSQTNP--NHVSEDAVSI

A0A6J1DBT6 uncharacterized protein At4g37920, chloroplastic isoform X19.5e-9692.04Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE V+TLD AQ KFDDILNSPSL+VACEKI SLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY+LY+ATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEA+DPNAMYTTPKELHKWIKIMLDSYHLNQEDT++REAR+M QPVVIQRLFILKDTIETEYLEQ E
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  F
        F
Subjt:  F

A0A6J1F3Z5 uncharacterized protein At4g379201.5e-10492.66Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
         QN+Q+ PNHVS +AVSI
Subjt:  FQNSQTNPNHVSEDAVSI

A0A6J1HRT8 uncharacterized protein At4g379202.5e-10492.2Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSP+L+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY+LYKATKS LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAVSI
         QN Q+ PNHVS +AVSI
Subjt:  FQNSQTNPNHVSEDAVSI

SwissProt top hitse value%identityAlignment
Q84WN0 Uncharacterized protein At4g379202.1e-8473.78Show/hide
Query:  EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQL
        E  DG      VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY L
Subjt:  EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQL

Query:  YKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKD
        YKATKS+LRS+ PKEIKLLK+LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKD
Subjt:  YKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKD

Query:  TIETEYLEQNEFQNSQTNPNHVSED
        TIE EYL++      +T P    ED
Subjt:  TIETEYLEQNEFQNSQTNPNHVSED

Arabidopsis top hitse value%identityAlignment
AT1G36320.1 unknown protein6.9e-4644.39Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        +A L    ++AV AYD + E+++ L++A++K  DI+NSPSL+ AC KI SLA+  +LDS+L+L+I  AW++AKES  MK EVK+I+Y LY   + NL+ +
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYL
         PKE+++LK+LL+I DP+E+ SAL  AF PGD  E  D + +YTTP+ L   +K +L++YH ++E + ++EA+ +  P +I ++  LK  +E +Y+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYL

AT4G37920.1 unknown protein1.5e-8573.78Show/hide
Query:  EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQL
        E  DG      VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY L
Subjt:  EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQL

Query:  YKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKD
        YKATKS+LRS+ PKEIKLLK+LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKD
Subjt:  YKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKD

Query:  TIETEYLEQNEFQNSQTNPNHVSED
        TIE EYL++      +T P    ED
Subjt:  TIETEYLEQNEFQNSQTNPNHVSED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAATTGCGTGTCCGACACGTGTTCGGAGCGTGTCCGGACGAATCCGTGTCCCACACCAGAATGGAAAATCAAGAGGTTCATTAAAGGCCTCCATGATGAGATCTG
CAGTTCCGTAGTCCTAAAAGGGTCCACGACTTTTGCCGATGCGCTCAAAGACGCATTAATCATGGATAAGAATGGGGCGAAGAAGTGCCTGACTAAGAGTGAAACAAATG
CAGAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAGCTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTT
ACAACGGTGGCTCGGCTGGCGGCTAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATAT
ACTGAATTCTCCCTCGTTGGAAGTGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTG
CAAAAGAATCCACAACCATGAAGAACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTA
AAGCATTTGCTAAACATAATAGATCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAAC
TCCAAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATAC
AAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAGTTTCCATA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAATTGCGTGTCCGACACGTGTTCGGAGCGTGTCCGGACGAATCCGTGTCCCACACCAGAATGGAAAATCAAGAGGTTCATTAAAGGCCTCCATGATGAGATCTG
CAGTTCCGTAGTCCTAAAAGGGTCCACGACTTTTGCCGATGCGCTCAAAGACGCATTAATCATGGATAAGAATGGGGCGAAGAAGTGCCTGACTAAGAGTGAAACAAATG
CAGAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAGCTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTT
ACAACGGTGGCTCGGCTGGCGGCTAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATAT
ACTGAATTCTCCCTCGTTGGAAGTGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTG
CAAAAGAATCCACAACCATGAAGAACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTA
AAGCATTTGCTAAACATAATAGATCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAAC
TCCAAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATAC
AAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAGTTTCCATA
TAG
Protein sequenceShow/hide protein sequence
MQNCVSDTCSERVRTNPCPTPEWKIKRFIKGLHDEICSSVVLKGSTTFADALKDALIMDKNGAKKCLTKSETNAEKPASKVLPVQAQGGNQRARVFALTKEEANDGDVVV
TTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLL
KHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI