; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010745 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010745
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr1:5120484..5125459
RNA-Seq ExpressionLag0010745
SyntenyLag0010745
Gene Ontology termsNA
InterPro domainsIPR040320 - Uncharacterized protein At4g37920-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603165.1 hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia]2.5e-10490.18Show/hide
Query:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK
        L +   VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATK
Subjt:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK

Query:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE
        S LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETE
Subjt:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE

Query:  YLEQNEFQNSQTNPNHVSEDAISI
        YLEQNE QN+Q+ PNHVS +A+SI
Subjt:  YLEQNEFQNSQTNPNHVSEDAISI

XP_023544083.1 uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo]1.3e-10591.07Show/hide
Query:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK
        L +   VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATK
Subjt:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK

Query:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE
        S LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEAKDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETE
Subjt:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE

Query:  YLEQNEFQNSQTNPNHVSEDAISI
        YLEQNE QN+QT PNHVS +A+SI
Subjt:  YLEQNEFQNSQTNPNHVSEDAISI

XP_038883874.1 uncharacterized protein At4g37920 isoform X1 [Benincasa hispida]2.3e-10592.66Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAISI
        FQN Q+ PNHVSEDA+SI
Subjt:  FQNSQTNPNHVSEDAISI

XP_038883875.1 uncharacterized protein At4g37920 isoform X2 [Benincasa hispida]2.3e-10592.66Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAISI
        FQN Q+ PNHVSEDA+SI
Subjt:  FQNSQTNPNHVSEDAISI

XP_038883876.1 uncharacterized protein At4g37920 isoform X3 [Benincasa hispida]2.3e-10592.66Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVVIQRLFILKDTIETEYLEQNE
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSEDAISI
        FQN Q+ PNHVSEDA+SI
Subjt:  FQNSQTNPNHVSEDAISI

TrEMBL top hitse value%identityAlignment
A0A0A0L3X1 Uncharacterized protein3.0e-10390.91Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAY+RTLENVETLDSAQVKFD+ILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALAT F+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+VIQRLFILKDTIETEYLEQN+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNP--NHVSEDAISI
        FQN Q+ P  NH SEDAISI
Subjt:  FQNSQTNP--NHVSEDAISI

A0A1S3B4W5 uncharacterized protein At4g37920, chloroplastic isoform X17.9e-10491.36Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCL+AVSAYDRTLENVETLDSAQ KFD+ILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAF+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+VIQRLFILKDTIETEYLEQN+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNP--NHVSEDAISI
        FQN Q+ P  NH SEDAISI
Subjt:  FQNSQTNP--NHVSEDAISI

A0A6J1DBT6 uncharacterized protein At4g37920, chloroplastic isoform X11.0e-9592.04Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLAARCLSAVSAYDRTLE V+TLD AQ KFDDILNSPSL+VACEKI SLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY+LY+ATKS+LRSM
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
        APKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEA+DPNAMYTTPKELHKWIKIMLDSYHLNQEDT++REAR+M QPVVIQRLFILKDTIETEYLEQ E
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  F
        F
Subjt:  F

A0A6J1F3Z5 uncharacterized protein At4g379201.2e-10490.18Show/hide
Query:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK
        L +   VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY LYKATK
Subjt:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK

Query:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE
        S LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETE
Subjt:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE

Query:  YLEQNEFQNSQTNPNHVSEDAISI
        YLEQNE QN+Q+ PNHVS +A+SI
Subjt:  YLEQNEFQNSQTNPNHVSEDAISI

A0A6J1HRT8 uncharacterized protein At4g379202.1e-10489.73Show/hide
Query:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK
        L +   VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSP+L+VACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMY+LYKATK
Subjt:  LTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATK

Query:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE
        S LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETE
Subjt:  SNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETE

Query:  YLEQNEFQNSQTNPNHVSEDAISI
        YLEQNE QN Q+ PNHVS +A+SI
Subjt:  YLEQNEFQNSQTNPNHVSEDAISI

SwissProt top hitse value%identityAlignment
Q84WN0 Uncharacterized protein At4g379202.3e-8476.17Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY LYKATKS+LRS+
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
         PKEIKLLK+LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKDTIE EYL++  
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSED
            +T P    ED
Subjt:  FQNSQTNPNHVSED

Arabidopsis top hitse value%identityAlignment
AT1G36320.1 unknown protein5.7e-4644.39Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        +A L    ++AV AYD + E+++ L++A++K  DI+NSPSL+ AC KI SLA+  +LDS+L+L+I  AW++AKES  MK EVK+I+Y LY   + NL+ +
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYL
         PKE+++LK+LL+I DP+E+ SAL  AF PGD  E  D + +YTTP+ L   +K +L++YH ++E + ++EA+ +  P +I ++  LK  +E +Y+
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYL

AT4G37920.1 unknown protein1.6e-8576.17Show/hide
Query:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM
        VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY LYKATKS+LRS+
Subjt:  VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSM

Query:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE
         PKEIKLLK+LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKDTIE EYL++  
Subjt:  APKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNE

Query:  FQNSQTNPNHVSED
            +T P    ED
Subjt:  FQNSQTNPNHVSED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGAAGAAGGTACAACCTTGATGGGGAGAAGCTTCGTCGTTGGAAGTCAAGAGGAAACCTCCTCCTATGTTTGCATGGCAGCCATCCAAGGCCCTTCGTTAGTAG
CCACAGAGGTCGGCTCTTGCCCTCCTTCTGTGCACTACATGTAAAAGGCACAACTCTGGACAATGCTGGACAGTCGAAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAG
CTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTTACAAATACGCTCACTCTAAACTTCGGCTTTTGCTTG
ATTGTTCACCTTATTCGTGCCACTAGGTTAGGCTTAAAATTTGGGGCATTACAATATTTATCTTATCCTAATCTTTTAACCAACTTCAGAACGGTGGCTCGGCTGGCGGC
TAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATATACTGAATTCTCCCTCGTTGGAAG
TGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAG
AACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAACATAATAGA
TCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGA
TAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGAT
ACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAATTTCCATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCGAAGAAGGTACAACCTTGATGGGGAGAAGCTTCGTCGTTGGAAGTCAAGAGGAAACCTCCTCCTATGTTTGCATGGCAGCCATCCAAGGCCCTTCGTTAGTAG
CCACAGAGGTCGGCTCTTGCCCTCCTTCTGTGCACTACATGTAAAAGGCACAACTCTGGACAATGCTGGACAGTCGAAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAG
CTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTTACAAATACGCTCACTCTAAACTTCGGCTTTTGCTTG
ATTGTTCACCTTATTCGTGCCACTAGGTTAGGCTTAAAATTTGGGGCATTACAATATTTATCTTATCCTAATCTTTTAACCAACTTCAGAACGGTGGCTCGGCTGGCGGC
TAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATATACTGAATTCTCCCTCGTTGGAAG
TGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAG
AACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAACATAATAGA
TCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGA
TAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGAT
ACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAATTTCCATATAG
Protein sequenceShow/hide protein sequence
MGRRRYNLDGEKLRRWKSRGNLLLCLHGSHPRPFVSSHRGRLLPSFCALHVKGTTLDNAGQSKKPASKVLPVQAQGGNQRARVFALTKEEANDGDVVVTNTLTLNFGFCL
IVHLIRATRLGLKFGALQYLSYPNLLTNFRTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMK
NEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKD
TIETEYLEQNEFQNSQTNPNHVSEDAISI