; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G006500 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G006500
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionNifU domain-containing protein
Genome locationCma_Chr01:3386687..3393239
RNA-Seq ExpressionCmaCh01G006500
SyntenyCmaCh01G006500
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0016226 - iron-sulfur cluster assembly (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR021788 - Protein CHAPERONE-LIKE PROTEIN OF POR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607296.1 hypothetical protein SDJN03_00638, partial [Cucurbita argyrosperma subsp. sororia]6.0e-14997.19Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQ   HSK PVLQLHSSSIR REIT+ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWAL SGGAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSA+MLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

XP_004150624.1 uncharacterized protein LOC101213790 [Cucumis sativus]1.8e-13790.18Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALA S IFHCPKLRLSQ+ FHSK  VLQLHSSSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW LI G AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SL YTGILNFIEF+G+YIP  LYN+QELL+TSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

XP_022949308.1 uncharacterized protein LOC111452698 [Cucurbita moschata]1.2e-14997.54Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQ  FHSK PVLQLHSSSIR REIT+ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWAL SGGAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSA+MLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

XP_022998548.1 uncharacterized protein LOC111493147 [Cucurbita maxima]3.1e-153100Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

XP_023523316.1 uncharacterized protein LOC111787550 [Cucurbita pepo subsp. pepo]1.6e-14997.54Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQQ FHSK PVLQLHSSSIR REITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWAL+SGGAEYKPLQFLAFAFVYRIFEKLKA 
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEF+GSYIPVVLYNSQELLITSSSA+MLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Q8 Uncharacterized protein8.7e-13890.18Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALA S IFHCPKLRLSQ+ FHSK  VLQLHSSSIR REITRERRMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW LI G AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SL YTGILNFIEF+G+YIP  LYN+QELL+TSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

A0A5A7SNI3 Uncharacterized protein1.1e-13589.16Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERNGDEA
        MALA S IFHCPKLRLSQ+ FHSK  VLQL SSSIR REI+RERRM+ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKR+EAER GDEA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRKEAERNGDEA

Query:  AAARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKA
         AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSPRDMQINMAISAVFTAW LI   AEYKPLQFLAF FVYRIFEKLKA
Subjt:  AAARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAV SLAYTG+LNFIEFMG YIPV LYN+QELL+TSSSA+MLY MASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

A0A6J1D9X7 uncharacterized protein LOC1110189896.5e-13387.37Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALA S IFHCPK+R+SQ+ FHSKS V  L SSSIR REITRERR VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKRKEAER GDEA 
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW LI   AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EP VSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEF+GSYIP  LYN+QELLITS+SA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

A0A6J1GBN1 uncharacterized protein LOC1114526985.8e-15097.54Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQ  FHSK PVLQLHSSSIR REIT+ERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWAL SGGAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSA+MLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

A0A6J1KH23 uncharacterized protein LOC1114931471.5e-153100Show/hide
Query:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
        MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA
Subjt:  MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAA

Query:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08640.1 Chloroplast J-like domain 11.4e-9262.23Show/hide
Query:  FHCPKLRLSQQMFHSKSP-VLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAAAARLEKA
        +H P++ L   +   KSP  ++L    +       + R+VI +A+SAAG+   D+DFNPYEVLGVNPIEGFD +K  Y +K K+A+R+GDEA AA LEKA
Subjt:  FHCPKLRLSQQMFHSKSP-VLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAAAARLEKA

Query:  YDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPS
        YDK+M AQ  NRKKG+TFGS KVSKDIKYAD QPI+PWGPRFS+SS  DM IN+AIS VF+AW  I    EYKPLQF++F FVYRIFEKLK+FE   SP 
Subjt:  YDKVMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPS

Query:  FTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR
        + E+GE+SGRG+RMGKR+LRSL+LVFG I ++SLAYTG LN IE+MG  IP+VLYN+QEL++T+SSA MLY +AS+YR
Subjt:  FTEDGEDSGRGIRMGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCAGCCTCCTACATCTTCCACTGCCCTAAGCTTCGGCTTTCTCAGCAGATGTTTCATTCCAAATCCCCCGTTTTGCAGTTACATTCGTCGTCAATCAGATC
TAGAGAAATAACACGAGAGAGAAGGATGGTTATTTGCTCAGCAGCTTCTGCCGCAGGAAGTTCAAGTCCAGACAGTGACTTCAATCCGTATGAGGTTCTGGGCGTAAACC
CAATTGAGGGATTTGATATGGTCAAAGCAGCATATACTAAAAAACGCAAGGAAGCTGAGAGGAACGGTGATGAAGCAGCTGCAGCCAGACTGGAGAAAGCTTATGACAAA
GTGATGATGGCACAATTCACCAACCGGAAGAAGGGTCTTACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAGCCAATTGTCCCATGGGGGCC
AAGGTTTTCAAAATCTAGCCCCAGAGATATGCAAATTAACATGGCAATATCTGCTGTATTTACTGCTTGGGCCCTTATCAGTGGCGGTGCTGAATATAAACCTCTGCAGT
TCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCACCCTCATTTACTGAAGATGGTGAAGATTCAGGACGAGGTATACGG
ATGGGAAAGCGGATACTACGATCTCTTGCTCTAGTGTTTGGATGTATTGCTGTTTCCTCTTTGGCATATACTGGTATCTTGAATTTCATCGAGTTCATGGGCAGCTATAT
CCCAGTTGTTCTCTACAATAGTCAGGAACTATTGATCACCAGTTCATCAGCTGTCATGCTTTACTTCATGGCATCTTACTACAGATGA
mRNA sequenceShow/hide mRNA sequence
TACACTGTTTGGAACCTCTTCGTCTCTTTCGCTCCGCCTTTGACACAGCTGAGCGATGGCGTTAGCAGCCTCCTACATCTTCCACTGCCCTAAGCTTCGGCTTTCTCAGC
AGATGTTTCATTCCAAATCCCCCGTTTTGCAGTTACATTCGTCGTCAATCAGATCTAGAGAAATAACACGAGAGAGAAGGATGGTTATTTGCTCAGCAGCTTCTGCCGCA
GGAAGTTCAAGTCCAGACAGTGACTTCAATCCGTATGAGGTTCTGGGCGTAAACCCAATTGAGGGATTTGATATGGTCAAAGCAGCATATACTAAAAAACGCAAGGAAGC
TGAGAGGAACGGTGATGAAGCAGCTGCAGCCAGACTGGAGAAAGCTTATGACAAAGTGATGATGGCACAATTCACCAACCGGAAGAAGGGTCTTACTTTTGGCTCAGTGA
AGGTTTCTAAGGACATCAAGTATGCTGACAACCAGCCAATTGTCCCATGGGGGCCAAGGTTTTCAAAATCTAGCCCCAGAGATATGCAAATTAACATGGCAATATCTGCT
GTATTTACTGCTTGGGCCCTTATCAGTGGCGGTGCTGAATATAAACCTCTGCAGTTCTTGGCATTTGCTTTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACC
AGCTGTATCACCCTCATTTACTGAAGATGGTGAAGATTCAGGACGAGGTATACGGATGGGAAAGCGGATACTACGATCTCTTGCTCTAGTGTTTGGATGTATTGCTGTTT
CCTCTTTGGCATATACTGGTATCTTGAATTTCATCGAGTTCATGGGCAGCTATATCCCAGTTGTTCTCTACAATAGTCAGGAACTATTGATCACCAGTTCATCAGCTGTC
ATGCTTTACTTCATGGCATCTTACTACAGATGA
Protein sequenceShow/hide protein sequence
MALAASYIFHCPKLRLSQQMFHSKSPVLQLHSSSIRSREITRERRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRKEAERNGDEAAAARLEKAYDK
VMMAQFTNRKKGLTFGSVKVSKDIKYADNQPIVPWGPRFSKSSPRDMQINMAISAVFTAWALISGGAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIR
MGKRILRSLALVFGCIAVSSLAYTGILNFIEFMGSYIPVVLYNSQELLITSSSAVMLYFMASYYR