; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G013020 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G013020
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionNifU domain-containing protein
Genome locationchr02:17396777..17404181
RNA-Seq ExpressionLsi02G013020
SyntenyLsi02G013020
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
InterPro domainsIPR021788 - Protein CHAPERONE-LIKE PROTEIN OF POR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031803.1 uncharacterized protein E6C27_scaffold848G00460 [Cucumis melo var. makuwa]1.1e-14293.71Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
        MALAVSNIF+CPKLRLSQR  HSKFSVLQL+SSSIRLR I+RE+RM+ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SLAYTG+LNFIEF+G YIPVFLYNNQELL+TSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_004150624.1 uncharacterized protein LOC101213790 [Cucumis sativus]8.3e-14394.39Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIF+CPKLRLSQR  HSKFSVLQL SSSIRLR ITRE+RMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_022150965.1 uncharacterized protein LOC111018989 [Momordica charantia]7.3e-13991.58Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIF+CPK+R+SQR  HSK SV  LQSSSIR R ITRE+R VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKR+EAER+GDEAT
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_022998548.1 uncharacterized protein LOC111493147 [Cucurbita maxima]5.8e-13690.18Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IF+CPKLRLSQ++ HSK  VLQL SSSIR R ITRE+RMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW LI   AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

XP_038893848.1 uncharacterized protein LOC120082659 [Benincasa hispida]5.2e-14596.14Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIFYCPKLRLS+R LH KFSVLQLQSSSIRLR ITRE+R VICSAASAAGSSS DSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSK SPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEF+G YIPVFLYNNQELLITSSSA+MLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

TrEMBL top hitse value%identityAlignment
A0A0A0M0Q8 Uncharacterized protein4.0e-14394.39Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIF+CPKLRLSQR  HSKFSVLQL SSSIRLR ITRE+RMVICSAASAAGSS+PDSD NPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIK SAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SL YTGILNFIEF+G+YIP FLYNNQELL+TSSSALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A5A7SNI3 Uncharacterized protein5.3e-14393.71Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
        MALAVSNIF+CPKLRLSQR  HSKFSVLQL+SSSIRLR I+RE+RM+ICSAASAAGSS+PDSDFNPYE VLGVNPIEGFDMVKAAYTKKRREAERIGDEA
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYE-VLGVNPIEGFDMVKAAYTKKRREAERIGDEA

Query:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA
        TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSP+DMQINMAISAVFTAWVLIKR+AEYKPLQFLAF FVYRIFEKLKA
Subjt:  TAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKA

Query:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAV SLAYTG+LNFIEF+G YIPVFLYNNQELL+TSSSALMLYIMASYYR
Subjt:  FEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1D9X7 uncharacterized protein LOC1110189893.5e-13991.58Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALAVSNIF+CPK+R+SQR  HSK SV  LQSSSIR R ITRE+R VI +AASAAGSSSP+SDFNPYEV+ VNPIEGFDM+KAAYTKKR+EAER+GDEAT
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EP VSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEF+GSYIP FLYNNQELLITS+SALMLYIMASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1GBN1 uncharacterized protein LOC1114526986.9e-13589.47Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IF+CPKLRLSQ   HSK  VLQL SSSIR R IT+E+RMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW L    AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

A0A6J1KH23 uncharacterized protein LOC1114931472.8e-13690.18Show/hide
Query:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT
        MALA S IF+CPKLRLSQ++ HSK  VLQL SSSIR R ITRE+RMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKR+EAER GDEA 
Subjt:  MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEAT

Query:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF
        AARLEKAYDKVMMAQFTNRKKG+TFGSVKVSKDIKYADNQPIVPWGPR SKSSP+DMQINMAISAVFTAW LI   AEYKPLQFLAFAFVYRIFEKLKAF
Subjt:  AARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAF

Query:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        EPAVSPSFTEDGEDSGRGIRMGKR+LRSLALVFGCIAVSSLAYTGILNFIEF+GSYIPV LYN+QELLITSSSA+MLY MASYYR
Subjt:  EPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08640.1 Chloroplast J-like domain 11.0e-9369.14Show/hide
Query:  EKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI
        + R+VI +A+SAAG+   D+DFNPYEVLGVNPIEGFD +K  Y +K ++A+R GDEATAA LEKAYDK+M AQ  NRKKGVTFGS KVSKDIKYAD QPI
Subjt:  EKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDKVMMAQFTNRKKGVTFGSVKVSKDIKYADNQPI

Query:  VPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLA
        +PWGPR S+SS  DM IN+AIS VF+AW+ IKR+ EYKPLQF++F FVYRIFEKLK+FE   SP + E+GE+SGRG+RMGKRLLRSL+LVFG I ++SLA
Subjt:  VPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIRMGKRLLRSLALVFGCIAVSSLA

Query:  YTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR
        YTG LN IE++G  IP+ LYNNQEL++T+SSA MLY++AS+YR
Subjt:  YTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCAGTCTCCAACATCTTCTACTGCCCTAAACTTCGGCTTTCTCAGAGGCTGTTACATTCGAAATTCTCCGTTTTGCAGCTACAGTCGTCGTCAATCAGATT
AAGAGCGATAACACGAGAGAAAAGGATGGTTATCTGCTCAGCGGCTTCTGCAGCAGGAAGTTCTAGTCCAGACAGTGACTTCAACCCGTATGAGGTTCTAGGTGTAAACC
CAATTGAGGGGTTTGACATGGTCAAAGCAGCATATACTAAAAAACGCAGGGAGGCTGAGAGGATAGGTGATGAAGCAACTGCAGCTAGACTGGAGAAAGCTTATGACAAA
GTCATGATGGCACAATTCACAAATCGAAAGAAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAACCAATTGTGCCATGGGGGCC
AAGGTCTTCCAAGTCCAGCCCAAAAGATATGCAAATCAATATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAACGCAGTGCTGAATATAAACCTCTACAGT
TCTTGGCATTTGCATTTGTTTATCGGATTTTTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCACCCTCATTTACAGAAGATGGTGAAGATTCAGGACGAGGTATACGG
ATGGGAAAGCGGTTGCTTCGCTCTCTTGCTTTAGTGTTTGGATGTATTGCTGTTTCCTCTTTGGCATATACTGGTATTTTGAATTTCATCGAGTTCATCGGCAGCTATAT
TCCAGTATTTCTCTACAATAACCAGGAACTATTGATCACCAGTTCATCGGCTCTCATGCTTTACATCATGGCATCTTACTACAGATGA
mRNA sequenceShow/hide mRNA sequence
AAATCAGAACTATAATAAAACAATACCCAGTTTCGAACCCCTTCGTCTCTTTCACTCCGCCTTTGACACAGAGCTCAGCAATGGCGTTAGCAGTCTCCAACATCTTCTAC
TGCCCTAAACTTCGGCTTTCTCAGAGGCTGTTACATTCGAAATTCTCCGTTTTGCAGCTACAGTCGTCGTCAATCAGATTAAGAGCGATAACACGAGAGAAAAGGATGGT
TATCTGCTCAGCGGCTTCTGCAGCAGGAAGTTCTAGTCCAGACAGTGACTTCAACCCGTATGAGGTTCTAGGTGTAAACCCAATTGAGGGGTTTGACATGGTCAAAGCAG
CATATACTAAAAAACGCAGGGAGGCTGAGAGGATAGGTGATGAAGCAACTGCAGCTAGACTGGAGAAAGCTTATGACAAAGTCATGATGGCACAATTCACAAATCGAAAG
AAGGGTGTCACTTTTGGCTCAGTGAAGGTTTCTAAGGACATCAAGTATGCTGACAACCAACCAATTGTGCCATGGGGGCCAAGGTCTTCCAAGTCCAGCCCAAAAGATAT
GCAAATCAATATGGCAATATCTGCTGTATTTACTGCTTGGGTCCTTATCAAACGCAGTGCTGAATATAAACCTCTACAGTTCTTGGCATTTGCATTTGTTTATCGGATTT
TTGAAAAGCTGAAAGCTTTTGAACCAGCTGTATCACCCTCATTTACAGAAGATGGTGAAGATTCAGGACGAGGTATACGGATGGGAAAGCGGTTGCTTCGCTCTCTTGCT
TTAGTGTTTGGATGTATTGCTGTTTCCTCTTTGGCATATACTGGTATTTTGAATTTCATCGAGTTCATCGGCAGCTATATTCCAGTATTTCTCTACAATAACCAGGAACT
ATTGATCACCAGTTCATCGGCTCTCATGCTTTACATCATGGCATCTTACTACAGATGATGTTAATCAGCATGTACCATTCAATCTTGCAGCCAACTTTTGAAGGAAACGG
TTTTGATATATTCTTGATTTGGTAATCTAGTTTTGTAAAATGAGGAACAGTATTGATGTTGTTATCGAGGAAATATAATGCTAGCTTCAGAAATTTTATGTGCAACGCCG
ATATGTCAGCTATCATATATAAAAGAGGTCGCTATCTCAATTAACACAAGTGATATTCATCGAGGAGATGGAAAACAAAGAACAAAATTTATAAAGTTGTAGTTATCTCA
ATTGGTTAATTTTGGTTTAGTC
Protein sequenceShow/hide protein sequence
MALAVSNIFYCPKLRLSQRLLHSKFSVLQLQSSSIRLRAITREKRMVICSAASAAGSSSPDSDFNPYEVLGVNPIEGFDMVKAAYTKKRREAERIGDEATAARLEKAYDK
VMMAQFTNRKKGVTFGSVKVSKDIKYADNQPIVPWGPRSSKSSPKDMQINMAISAVFTAWVLIKRSAEYKPLQFLAFAFVYRIFEKLKAFEPAVSPSFTEDGEDSGRGIR
MGKRLLRSLALVFGCIAVSSLAYTGILNFIEFIGSYIPVFLYNNQELLITSSSALMLYIMASYYR