; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G23720 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G23720
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionNifU domain-containing protein
Genome locationChr1:19209639..19216636
RNA-Seq ExpressionCSPI01G23720
SyntenyCSPI01G23720
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0016226 - iron-sulfur cluster assembly (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR021788 - Protein CHAPERONE-LIKE PROTEIN OF POR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031803.1 uncharacterized protein E6C27_scaffold848G00460 [Cucumis melo var. makuwa]3.6e-14695.45Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
        MALAVSNIFHCPKLRLSQRQFHSKFSVLQL SSSIRLREI+RERRM+ICSAASAAGSSNPDSD NPYE VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIK +AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISL YTG+LNFIEF+G YIP FLYNNQELLVTSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

XP_004150624.1 uncharacterized protein LOC101213790 [Cucumis sativus]4.7e-154100Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

XP_022150965.1 uncharacterized protein LOC111018989 [Momordica charantia]1.6e-13890.18Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIFHCPK+R+SQRQFHSK SV  L SSSIR REITRERR VI +AASAAGSS+P+SD NPYEV+ VNPIEGFDM+KAAYTKKR+EAER+GDEAT
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

XP_022998548.1 uncharacterized protein LOC111493147 [Cucurbita maxima]2.8e-13890.18Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IFHCPKLRLSQ+ FHSK  VLQLHSSSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW LI G AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SL YTGILNFIEF+G+YIP  LYN+QELL+TSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

XP_038893848.1 uncharacterized protein LOC120082659 [Benincasa hispida]2.4e-14293.33Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIF+CPKLRLS+RQ H KFSVLQL SSSIRLREITRERR VICSAASAAGSS+ DSD NPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSK SP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G YIP FLYNNQELL+TSSSA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Q8 Uncharacterized protein2.3e-154100Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

A0A5A7SNI3 Uncharacterized protein1.8e-14695.45Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
        MALAVSNIFHCPKLRLSQRQFHSKFSVLQL SSSIRLREI+RERRM+ICSAASAAGSSNPDSD NPYE VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIK +AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISL YTG+LNFIEF+G YIP FLYNNQELLVTSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

A0A6J1D9X7 uncharacterized protein LOC1110189897.9e-13990.18Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIFHCPK+R+SQRQFHSK SV  L SSSIR REITRERR VI +AASAAGSS+P+SD NPYEV+ VNPIEGFDM+KAAYTKKR+EAER+GDEAT
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

A0A6J1GBN1 uncharacterized protein LOC1114526983.0e-13889.82Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IFHCPKLRLSQ QFHSK  VLQLHSSSIR REIT+ERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW L  G AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SL YTGILNFIEF+G+YIP  LYN+QELL+TSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

A0A6J1KH23 uncharacterized protein LOC1114931471.3e-13890.18Show/hide
Query:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IFHCPKLRLSQ+ FHSK  VLQLHSSSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW LI G AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SL YTGILNFIEF+G+YIP  LYN+QELL+TSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08640.1 Chloroplast J-like domain 17.2e-9268.31Show/hide
Query:  ERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI
        + R+VI +A+SAAG+   D+D NPYEVLGVNPIEGFD +K  Y +K ++A+R GDEATAA LEKAYDK+M AQ  NRKKGVTFGS KVSKDIKYAD QPI
Subjt:  ERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI

Query:  VPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLG
        +PWGPR S+SS  DM IN+AIS VF+AW+ IK + EYKPLQF++F FVYRIFEKLK+FE   SP + E+GE+SGRG+RMGKRLLRSL+LVFG I + SL 
Subjt:  VPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVISLG

Query:  YTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR
        YTG LN IE++G  IP  LYNNQEL+VT+SSA MLY++AS+YR
Subjt:  YTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAACTTCGGCTTTCTCAGAGGCAGTTTCATTCCAAATTCTCCGTCTTGCAGCTACACTCATCGTCAATCAGATT
AAGAGAGATAACGCGAGAGAGAAGGATGGTTATTTGTTCAGCGGCTTCTGCGGCAGGAAGTTCTAATCCAGACAGTGACTCCAACCCGTATGAGGTTCTAGGTGTAAACC
CAATTGAGGGGTTTGACATGGTCAAAGCAGCGTATACTAAAAAACGCAGGGAGGCTGAGCGGATAGGTGATGAAGCAACTGCAGCTAGACTCGAGAAGGCTTATGACAAA
GTCATGATGGCACAATTCACAAATCGGAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAACCAATTGTGCCATGGGGGCC
AAGGTCTTCCAAGTCCAGCCCAAGAGATATGCAAATCAACATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAAGGCAGTGCTGAATACAAACCTCTACAGT
TCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCACCTTCATTTACAGAAGATGGTGAAGATTCAGGACGAGGTATACGG
ATGGGAAAGCGGTTGCTTCGTTCTCTTGCGTTAGTGTTTGGATGTATTGCTGTCATCTCTTTGGGATATACTGGTATCTTGAATTTCATTGAGTTCTTGGGCAACTATAT
TCCAGAATTTTTGTACAATAACCAGGAATTATTGGTCACTAGTTCATCGGCTCTCATGCTGTACATCATGGCATCTTACTACAGATGA
mRNA sequenceShow/hide mRNA sequence
CACAGTTTCGAACCCCTTCGTCTCTCTCGCTCCGTCTTCGACACAGAGCTCAACAATGGCGTTAGCAGTCTCCAACATCTTCCACTGCCCTAAACTTCGGCTTTCTCAGA
GGCAGTTTCATTCCAAATTCTCCGTCTTGCAGCTACACTCATCGTCAATCAGATTAAGAGAGATAACGCGAGAGAGAAGGATGGTTATTTGTTCAGCGGCTTCTGCGGCA
GGAAGTTCTAATCCAGACAGTGACTCCAACCCGTATGAGGTTCTAGGTGTAAACCCAATTGAGGGGTTTGACATGGTCAAAGCAGCGTATACTAAAAAACGCAGGGAGGC
TGAGCGGATAGGTGATGAAGCAACTGCAGCTAGACTCGAGAAGGCTTATGACAAAGTCATGATGGCACAATTCACAAATCGGAAGAAGGGTGTCACTTTTGGCTCAGTGA
AGGTTTCTAAGGACATCAAGTATGCTGACAACCAACCAATTGTGCCATGGGGGCCAAGGTCTTCCAAGTCCAGCCCAAGAGATATGCAAATCAACATGGCAATATCTGCT
GTATTTACTGCTTGGGTCCTTATCAAAGGCAGTGCTGAATACAAACCTCTACAGTTCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACC
AGCTGTATCACCTTCATTTACAGAAGATGGTGAAGATTCAGGACGAGGTATACGGATGGGAAAGCGGTTGCTTCGTTCTCTTGCGTTAGTGTTTGGATGTATTGCTGTCA
TCTCTTTGGGATATACTGGTATCTTGAATTTCATTGAGTTCTTGGGCAACTATATTCCAGAATTTTTGTACAATAACCAGGAATTATTGGTCACTAGTTCATCGGCTCTC
ATGCTGTACATCATGGCATCTTACTACAGATGATGTTAAACCAGCATGTACCATTCAATCTTGCACTCTACTTTTGAAGGAAACGATTTTTGATATATTATTGATTTGGT
AATCTAGTTTTGTAAAATGAGGAAACAATACTGATGTTGTTATTGAGGAAATGTAGATGTAGCTTCAGAAATTTTTCGTGCAATGGGATATATGTTAGCTATCATCTCAT
ATATAAAAGAGGTTGCGATCTCAATTAGCAAATGTGATATTTATTGAGTTGAGGTGGAAAACAAAGAACACAGTTGATGAAGTTGTAGTTATCTCAATTGGTTAATTTTG
GCTTCTTTGATATTTTCATTTAATGCACACA
Protein sequenceShow/hide protein sequence
MALAVSNIFHCPKLRLSQRQFHSKFSVLQLHSSSIRLREITRERRMVICSAASAAGSSNPDSDSNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDK
VMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPRDMQINMAISAVFTAWVLIKGSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIR
MGKRLLRSLALVFGCIAVISLGYTGILNFIEFLGNYIPEFLYNNQELLVTSSSALMLYIMASYYR