; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005797 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005797
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF3353 domain-containing protein
Genome locationscaffold11:1250716..1257834
RNA-Seq ExpressionSpg005797
SyntenySpg005797
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
InterPro domainsIPR021788 - Protein CHAPERONE-LIKE PROTEIN OF POR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031803.1 uncharacterized protein E6C27_scaffold848G00460 [Cucumis melo var. makuwa]1.3e-14392.66Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA
        MALAVSNIFHCPK+RLSQRQ HSKFSVLQL+SSSIR REI+RERR++ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKR+EAERIGDEA
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        FEPAVSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+ SLAYTG+LNFIEF+G YIPVFLYNNQELL+TSSSA+MLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

KAG6607296.1 hypothetical protein SDJN03_00638, partial [Cucurbita argyrosperma subsp. sororia]4.7e-13890.18Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK+RLSQ Q+HSK  VLQL SSSIRFREIT+ERR+VICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW L    AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKR+LRSLALVFGCIA+SSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

XP_004150624.1 uncharacterized protein LOC101213790 [Cucumis sativus]9.8e-14493.33Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK+RLSQRQ HSKFSVLQL SSSIR REITRERR+VICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+ SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

XP_022150965.1 uncharacterized protein LOC111018989 [Momordica charantia]1.1e-14292.98Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPKIR+SQRQ HSK SV  LQSSSIRFREITRERR VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKRKEAER+GDEAT
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EP VSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+SSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

XP_038893848.1 uncharacterized protein LOC120082659 [Benincasa hispida]6.2e-14695.44Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIF+CPK+RLS+RQLH KFSVLQLQSSSIR REITRERR VICSAASAAGSSS DSDFNPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSK SPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+SSLAYTGILNFIEF+G YIPVFLYNNQELLITSSSAVMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Q8 Uncharacterized protein4.8e-14493.33Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPK+RLSQRQ HSKFSVLQL SSSIR REITRERR+VICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAERIGDEAT
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+ SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

A0A5A7SNI3 Uncharacterized protein6.2e-14492.66Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA
        MALAVSNIFHCPK+RLSQRQ HSKFSVLQL+SSSIR REI+RERR++ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKR+EAERIGDEA
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        FEPAVSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+ SLAYTG+LNFIEF+G YIPVFLYNNQELL+TSSSA+MLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

A0A6J1D9X7 uncharacterized protein LOC1110189895.3e-14392.98Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALAVSNIFHCPKIR+SQRQ HSK SV  LQSSSIRFREITRERR VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKRKEAER+GDEAT
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EP VSPSFTEDGEDSGRG+RMGKRLLRSLALVFGCIA+SSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

A0A6J1GBN1 uncharacterized protein LOC1114526983.9e-13890.18Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK+RLSQ Q HSK  VLQL SSSIRFREIT+ERR+VICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW L    AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKR+LRSLALVFGCIA+SSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

A0A6J1KH23 uncharacterized protein LOC1114931475.1e-13890.53Show/hide
Query:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT
        MALA S IFHCPK+RLSQ+  HSK  VLQL SSSIR REITRERR+VICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAER GDEA 
Subjt:  MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW LI   AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        EPAVSPSFTEDGEDSGRG+RMGKR+LRSLALVFGCIA+SSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSAVMLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08640.1 Chloroplast J-like domain 17.6e-9462.5Show/hide
Query:  MALAVSNI--FHCPKIRLSQRQLHSKF-SVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGD
        MA + SN   +H P+I L       K  S ++L    +       + R+VI +A+SAAG+   D+DFNPYEVLGVNPIEGFD +K  Y +K K+A+R GD
Subjt:  MALAVSNI--FHCPKIRLSQRQLHSKF-SVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGD

Query:  EATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKL
        EATAA LEKAYDK+M AQ  NRKKGVTFGS KVSKDIKYAD QPI+PWGPR S+SS  DM IN+AIS VF+AW+ IKR+ EYKPLQF++F FVYRIFEKL
Subjt:  EATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKL

Query:  KAFEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR
        K+FE   SP + E+GE+SGRG+RMGKRLLRSL+LVFG I ++SLAYTG LN IE++G  IP+ LYNNQEL++T+SSA MLY++AS+YR
Subjt:  KAFEPAVSPSFTEDGEDSGRGVRMGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAAATTCGGCTTTCTCAGAGGCAATTACATTCCAAATTCTCCGTTTTGCAGTTACAGTCGTCGTCGATCAGATT
TAGAGAAATAACACGAGAGAGAAGGGTGGTTATTTGCTCAGCAGCTTCTGCGGCAGGAAGTTCTAGTCCAGACAGTGACTTCAACCCGTATGAGGTTCTAGGTGTAAACC
CAATTGAGGGATTTGACATGGTCAAAGCAGCATATACTAAAAAGCGTAAGGAGGCTGAAAGGATAGGTGATGAAGCAACTGCAGCCAGACTGGAGAAAGCTTATGACAAA
GTCATGATGGCACAATTCACAAATCGGAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAGCCAATTGTACCATGGGGGCC
AAGGTCTTCCAAGTCCAGCCCAAAAGATATGCAAATTAATATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAACGCAGTGCTGAATATAAACCTCTTCAGT
TCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCCGTATCACCCTCATTTACTGAAGACGGTGAAGATTCAGGAAGAGGTGTACGG
ATGGGAAAGCGGTTGCTTCGATCTCTTGCTTTAGTGTTTGGATGTATTGCTATTTCCTCTTTGGCATATACTGGTATTTTGAATTTCATTGAGTTCATCGGCAGCTATAT
TCCAGTTTTTCTCTACAATAACCAGGAACTATTGATCACCAGTTCATCGGCTGTCATGCTTTACATCATGGCATCTTACTACAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAAATTCGGCTTTCTCAGAGGCAATTACATTCCAAATTCTCCGTTTTGCAGTTACAGTCGTCGTCGATCAGATT
TAGAGAAATAACACGAGAGAGAAGGGTGGTTATTTGCTCAGCAGCTTCTGCGGCAGGAAGTTCTAGTCCAGACAGTGACTTCAACCCGTATGAGGTTCTAGGTGTAAACC
CAATTGAGGGATTTGACATGGTCAAAGCAGCATATACTAAAAAGCGTAAGGAGGCTGAAAGGATAGGTGATGAAGCAACTGCAGCCAGACTGGAGAAAGCTTATGACAAA
GTCATGATGGCACAATTCACAAATCGGAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAGCCAATTGTACCATGGGGGCC
AAGGTCTTCCAAGTCCAGCCCAAAAGATATGCAAATTAATATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAACGCAGTGCTGAATATAAACCTCTTCAGT
TCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCCGTATCACCCTCATTTACTGAAGACGGTGAAGATTCAGGAAGAGGTGTACGG
ATGGGAAAGCGGTTGCTTCGATCTCTTGCTTTAGTGTTTGGATGTATTGCTATTTCCTCTTTGGCATATACTGGTATTTTGAATTTCATTGAGTTCATCGGCAGCTATAT
TCCAGTTTTTCTCTACAATAACCAGGAACTATTGATCACCAGTTCATCGGCTGTCATGCTTTACATCATGGCATCTTACTACAGATGA
Protein sequenceShow/hide protein sequence
MALAVSNIFHCPKIRLSQRQLHSKFSVLQLQSSSIRFREITRERRVVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERIGDEATAARLEKAYDK
VMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGVR
MGKRLLRSLALVFGCIAISSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSAVMLYIMASYYR