; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022892 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022892
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description30S ribosomal protein S2, chloroplastic
Genome locationscaffold94:9639..12464
RNA-Seq ExpressionSpg022892
SyntenySpg022892
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0006412 - translation (biological process)
GO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0045261 - proton-transporting ATP synthase complex, catalytic core F(1) (cellular component)
GO:0045263 - proton-transporting ATP synthase complex, coupling factor F(o) (cellular component)
GO:0046933 - proton-transporting ATP synthase activity, rotational mechanism (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0032549 - ribonucleoside binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR020537 - ATP synthase, F0 complex, subunit C, DCCD-binding site
IPR018130 - Ribosomal protein S2, conserved site
IPR005953 - ATP synthase, F0 complex, subunit C, bacterial/chloroplast
IPR005706 - Ribosomal protein S2, bacteria/mitochondria/plastid
IPR002379 - V-ATPase proteolipid subunit C-like domain
IPR001865 - Ribosomal protein S2
IPR000568 - ATP synthase, F0 complex, subunit A
IPR000454 - ATP synthase, F0 complex, subunit C
IPR023011 - ATP synthase, F0 complex, subunit A, active site
IPR023591 - Ribosomal protein S2, flavodoxin-like domain superfamily
IPR035908 - ATP synthase, F0 complex, subunit A superfamily
IPR035921 - F/V-ATP synthase subunit C superfamily
IPR038662 - F1F0 ATP synthase subunit C superfamily
IPR045082 - ATP synthase, F0 complex, subunit A, bacterial/chloroplast


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD4981695.1 hypothetical protein E3N88_18366 [Mikania micrantha]2.0e-21891.94Show/hide
Query:  KWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ
        KWNP+MAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNK ADSVA AA RARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ
Subjt:  KWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ

Query:  KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI
        KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQ EEY ALQECITLGIPTICLIDTNCDPDLADISIPANDDAI+SIRLILNKLVFAI
Subjt:  KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI

Query:  CEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYF
        C  SA +AVRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWK+IQLPHGELAAPTNDINTTVALALLTSVAYF
Subjt:  CEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYF

Query:  YAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP---------------ELIMNPLISAASVIAAGLA
        YAGL+KKGL YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP VVPIP               ELIMNPLISAASVIA GLA
Subjt:  YAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP---------------ELIMNPLISAASVIAAGLA

Query:  VGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIR
        VGL SIGPGVGQGTAAGQAVEGIARQPEAEGKIR
Subjt:  VGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIR

KAF1862723.1 hypothetical protein Lal_00044951 [Lupinus albus]1.7e-21781.31Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQ LIVGTKNKAADSV RAA RARCHYV+KKWLGGMLTNW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHK RDLRTEQKTG LN LPKRDAAMLKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQ EY ALQECITLGIPTI LIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN
        SIRLILNKLVFAIC+                            SAI+ VRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGT+FLFIFVSN
Subjt:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN

Query:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
        WSGALLPWK+IQLPHGELAAPTNDINTTVALALLTS AYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
Subjt:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP

Query:  LVVPIP--------------------------------------------ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG
        LVVPIP                                            ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG
Subjt:  LVVPIP--------------------------------------------ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG

Query:  KIR
        KIR
Subjt:  KIR

KAF3449772.1 hypothetical protein FNV43_RR05850 [Rhamnella rubrinervis]3.5e-20792.62Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFL+EACDLVFDAASRGKQFLIVGTKNKAADSVARAA +ARCHYVNKKWLGGMLTNW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQKTG LNRLPKRDAA+LKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT
        SIRLILNKLVFAICEGSAIIAVRNPQT+PTD QNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGT+FLFIFVSNWSGALLPWK+IQLPHGELAAPTNDINT
Subjt:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT

Query:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        TVALALLTSVAYFYAGL+KKGL YF KYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLV+PIP + +    S    +
Subjt:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

KAG8363203.1 hypothetical protein BUALT_BualtPtG0001100 [Buddleja alternifolia]1.6e-19990.59Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        M+EAGVHFGHGTRKWNP+MAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVA AA +ARCH VNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLR EQKTG LNRLPKRDAAM+KRQL  LQTYLGGIKYMTGLPDIVIIVDQ EEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAI+
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT
        SIRLILNKLVFAIC  SA IAVRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWK+I+LPHGELAAPTNDINT
Subjt:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT

Query:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        TVALALLTSVAYFYAGL+KKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP + +    S    +
Subjt:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

RYR38668.1 hypothetical protein Ahy_A09g043808 [Arachis hypogaea]2.7e-19989.57Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYIS KRKGIHI+NLTRTARFLSEACDLVFDAAS+GKQFLIVGTKNKAAD +ARAATRARCHYVNKKWLGGML NW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFR LRTEQKTG ++ LPK+D A+LKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT
        SIRLILNKLVFAICE  AI+ VRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPF+GTMFLFIFVSNWS ALLPWK+IQLPHGELAAPTNDINT
Subjt:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT

Query:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        TVALALLTSVAYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP + +    S    +
Subjt:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

TrEMBL top hitse value%identityAlignment
A0A445BJ06 Uncharacterized protein1.3e-19989.57Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYIS KRKGIHI+NLTRTARFLSEACDLVFDAAS+GKQFLIVGTKNKAAD +ARAATRARCHYVNKKWLGGML NW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFR LRTEQKTG ++ LPK+D A+LKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT
        SIRLILNKLVFAICE  AI+ VRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPF+GTMFLFIFVSNWS ALLPWK+IQLPHGELAAPTNDINT
Subjt:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT

Query:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        TVALALLTSVAYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP + +    S    +
Subjt:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

A0A445BJ28 ATP-synt_C domain-containing protein1.3e-19989.31Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYIS KRKGIHI+NLTRTARFLSEAC+LVFDAAS+GKQFLIVGTKNKAAD +ARAATRARC+YVNKKWLGGMLTNW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFR LRTEQKTG ++ LPK+D A+LKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT
        SIRLILNKLVFAICE  AI+ VRNPQT+PT GQN FEYVLEFIRDVSKTQIGEEYGPWVPF+GTMFLFIFVSNWSGALLPWK+IQLPHGELAAPTNDINT
Subjt:  SIRLILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINT

Query:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        TVALALLTSVAYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP++ +    S    +
Subjt:  TVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

A0A5N6NLV6 DNA-directed RNA polymerase9.6e-21991.94Show/hide
Query:  KWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ
        KWNP+MAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNK ADSVA AA RARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ
Subjt:  KWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQ

Query:  KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI
        KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQ EEY ALQECITLGIPTICLIDTNCDPDLADISIPANDDAI+SIRLILNKLVFAI
Subjt:  KTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI

Query:  CEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYF
        C  SA +AVRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWK+IQLPHGELAAPTNDINTTVALALLTSVAYF
Subjt:  CEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYF

Query:  YAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP---------------ELIMNPLISAASVIAAGLA
        YAGL+KKGL YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP VVPIP               ELIMNPLISAASVIA GLA
Subjt:  YAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP---------------ELIMNPLISAASVIAAGLA

Query:  VGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIR
        VGL SIGPGVGQGTAAGQAVEGIARQPEAEGKIR
Subjt:  VGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIR

A0A6A5L7H7 Uncharacterized protein6.0e-19785Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQ LIVGTKNKAADSV RAA RARCHYV+KKWLGGMLTNW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHK RDLRTEQKTG LN LPKRDAAMLKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQ EY ALQECITLGIPTI LIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN
        SIRLILNKLVFAIC+                            SAI+ VRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGT+FLFIFVSN
Subjt:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN

Query:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
        WSGALLPWK+IQLPHGELAAPTNDINTTVALALLTS AYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
Subjt:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP

Query:  LVVPIPELIMNPLISAASVI
        LVVPIP + +    S    +
Subjt:  LVVPIPELIMNPLISAASVI

A0A6A5LUY7 ATP-synt_C domain-containing protein8.1e-21881.31Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQ LIVGTKNKAADSV RAA RARCHYV+KKWLGGMLTNW TTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHK RDLRTEQKTG LN LPKRDAAMLKRQLSHL+TYLGGIKYMTGLPDIVIIVDQQ EY ALQECITLGIPTI LIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN
        SIRLILNKLVFAIC+                            SAI+ VRNPQT+PT GQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGT+FLFIFVSN
Subjt:  SIRLILNKLVFAICEG---------------------------SAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSN

Query:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
        WSGALLPWK+IQLPHGELAAPTNDINTTVALALLTS AYFYAGLSKKGL+YFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP
Subjt:  WSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVP

Query:  LVVPIP--------------------------------------------ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG
        LVVPIP                                            ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG
Subjt:  LVVPIP--------------------------------------------ELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEG

Query:  KIR
        KIR
Subjt:  KIR

SwissProt top hitse value%identityAlignment
Q0ZJ31 30S ribosomal protein S2, chloroplastic2.4e-11894.22Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAA +ARCHYVNKKWLGGM TNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQKTG LNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNP
        SIRLILNKLVFAICEG +   +RNP
Subjt:  SIRLILNKLVFAICEGSAIIAVRNP

Q1KXW9 30S ribosomal protein S2, chloroplastic3.5e-11793.33Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNP+MAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNK ADSVA AA RARCHYVNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQKTGGL+RLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQ EEY ALQECITLGIPTICLIDTNCDPDLADISIPANDDAI+
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNP
        SIRLILNKLVFAICEG +   +RNP
Subjt:  SIRLILNKLVFAICEGSAIIAVRNP

Q49L09 30S ribosomal protein S2, chloroplastic1.9e-11894.22Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHI NLT+TARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAA RARCHYVNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQK G LNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNP
        SIRLILNKLVFAICEG +   +RNP
Subjt:  SIRLILNKLVFAICEGSAIIAVRNP

Q4VZP4 30S ribosomal protein S2, chloroplastic9.8e-12095.54Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAA+RGKQFLIVGTKNKAADSVARAATR RCHYVNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQKTGGLNRLPKRDAAM KRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRN
        SIRLILNKLVFAI EG +  ++RN
Subjt:  SIRLILNKLVFAICEGSAIIAVRN

Q56P10 30S ribosomal protein S2, chloroplastic3.5e-11793.33Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MMEAGVHFGHGTRKWNP+MAPYISAKRKGIHI NLTRTARFLSEACDLVFDAASRGKQFLIVGTKNK ADSVA AA RARCHYVNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
        TRLHKFRDLRTEQKTGGL+RLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQ EEY ALQECITLGIPTICLIDTNCDPDLADISIPANDDAI+
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEGSAIIAVRNP
        SIRLILNKLVFAICEG +   +RNP
Subjt:  SIRLILNKLVFAICEGSAIIAVRNP

Arabidopsis top hitse value%identityAlignment
ATCG00140.1 ATP synthase subunit C family protein2.5e-3398.77Show/hide
Query:  MNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLAFMEALTIYGLVVALALLFANPFV
        MNPL+SAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLAFMEALTIYGLVVALALLFANPFV
Subjt:  MNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLAFMEALTIYGLVVALALLFANPFV

ATCG00150.1 ATPase, F0 complex, subunit A protein1.9e-8684.21Show/hide
Query:  LILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVA
        LI + +V AI  GSA++A+RNPQT+PTDGQNFFE+VLEFIRDVSKTQIGEEYGPWVPFIGT+FLFIFVSNWSGALLPWK+IQLP GELAAPTNDINTTVA
Subjt:  LILNKLVFAICEGSAIIAVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVA

Query:  LALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI
        LALLTSVAYFYAGLSKKGL YF KYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIP + +    S    +
Subjt:  LALLTSVAYFYAGLSKKGLSYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVI

ATCG00160.1 ribosomal protein S22.4e-11392.59Show/hide
Query:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE
        MM AGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAAD V+RAA RARCHYVNKKWLGGMLTNWSTTE
Subjt:  MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTE

Query:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA
         RLHKFRDLRTEQKT G NRLPKRDAA+LKRQLS L+TYLGGIKYMTGLPDIVII+DQQEEY AL+ECITLGIPTI LIDTNC+PDLADISIPANDDAIA
Subjt:  TRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIA

Query:  SIRLILNKLVFAICEG
        SIR ILNKLVFAICEG
Subjt:  SIRLILNKLVFAICEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAGCAGGAGTTCATTTTGGTCATGGTACTAGGAAATGGAATCCTAGAATGGCACCTTATATCTCTGCAAAACGTAAAGGTATTCATATTATAAATCTTACTAG
AACTGCTCGTTTTTTATCAGAAGCTTGTGATTTAGTTTTTGATGCAGCAAGTAGGGGAAAACAATTCTTAATTGTTGGTACCAAAAATAAAGCAGCGGATTCAGTAGCCC
GGGCTGCAACAAGGGCTCGGTGTCATTATGTTAATAAAAAATGGCTCGGGGGTATGTTAACAAATTGGTCTACTACAGAAACGAGACTTCATAAGTTCAGGGACTTAAGA
ACGGAACAAAAGACGGGGGGACTCAACCGTCTTCCGAAAAGAGATGCCGCTATGTTGAAGAGACAATTATCTCACTTGCAAACATATCTGGGCGGGATTAAATATATGAC
AGGGTTACCCGATATTGTAATAATCGTCGATCAGCAAGAAGAATATCGGGCTCTTCAAGAATGTATCACGTTGGGAATTCCAACTATTTGTTTAATTGATACAAATTGTG
ACCCGGATCTCGCAGATATTTCGATTCCAGCGAATGATGATGCTATAGCTTCAATCCGATTAATTCTTAACAAATTAGTATTTGCAATTTGTGAGGGTTCAGCCATTATA
GCTGTTCGTAATCCACAAACCGTTCCTACTGACGGTCAGAATTTCTTCGAATATGTCCTTGAATTCATTCGAGACGTGAGCAAAACTCAGATTGGCGAAGAATATGGTCC
ATGGGTTCCCTTTATTGGAACTATGTTTCTATTTATTTTTGTTTCGAATTGGTCAGGGGCTCTTTTACCTTGGAAACTCATACAGTTACCTCACGGAGAGTTAGCCGCAC
CCACAAATGATATAAATACTACTGTTGCTTTAGCTTTACTCACATCAGTAGCATATTTCTATGCGGGTCTTAGCAAAAAAGGATTAAGTTATTTCGGTAAATACATTCAA
CCAACTCCAATCCTTTTACCCATTAACATCTTAGAAGATTTCACAAAACCCTTATCACTTAGTTTTCGACTTTTCGGAAATATATTAGCTGATGAATTAGTAGTTGTTGT
TCTTGTTTCTTTAGTACCTTTAGTAGTTCCTATACCTGAACTTATCATGAATCCACTGATTTCTGCCGCTTCCGTTATTGCTGCTGGGTTGGCCGTTGGGCTTGCTTCTA
TTGGACCTGGGGTTGGTCAAGGTACTGCTGCGGGCCAAGCTGTAGAAGGGATCGCGAGACAACCCGAGGCGGAGGGAAAAATCCGAGGTACTTTATTGCTTAGTTTGGCT
TTTATGGAAGCTTTAACAATTTATGGACTGGTTGTAGCATTAGCACTTTTATTTGCGAATCCTTTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAGCAGGAGTTCATTTTGGTCATGGTACTAGGAAATGGAATCCTAGAATGGCACCTTATATCTCTGCAAAACGTAAAGGTATTCATATTATAAATCTTACTAG
AACTGCTCGTTTTTTATCAGAAGCTTGTGATTTAGTTTTTGATGCAGCAAGTAGGGGAAAACAATTCTTAATTGTTGGTACCAAAAATAAAGCAGCGGATTCAGTAGCCC
GGGCTGCAACAAGGGCTCGGTGTCATTATGTTAATAAAAAATGGCTCGGGGGTATGTTAACAAATTGGTCTACTACAGAAACGAGACTTCATAAGTTCAGGGACTTAAGA
ACGGAACAAAAGACGGGGGGACTCAACCGTCTTCCGAAAAGAGATGCCGCTATGTTGAAGAGACAATTATCTCACTTGCAAACATATCTGGGCGGGATTAAATATATGAC
AGGGTTACCCGATATTGTAATAATCGTCGATCAGCAAGAAGAATATCGGGCTCTTCAAGAATGTATCACGTTGGGAATTCCAACTATTTGTTTAATTGATACAAATTGTG
ACCCGGATCTCGCAGATATTTCGATTCCAGCGAATGATGATGCTATAGCTTCAATCCGATTAATTCTTAACAAATTAGTATTTGCAATTTGTGAGGGTTCAGCCATTATA
GCTGTTCGTAATCCACAAACCGTTCCTACTGACGGTCAGAATTTCTTCGAATATGTCCTTGAATTCATTCGAGACGTGAGCAAAACTCAGATTGGCGAAGAATATGGTCC
ATGGGTTCCCTTTATTGGAACTATGTTTCTATTTATTTTTGTTTCGAATTGGTCAGGGGCTCTTTTACCTTGGAAACTCATACAGTTACCTCACGGAGAGTTAGCCGCAC
CCACAAATGATATAAATACTACTGTTGCTTTAGCTTTACTCACATCAGTAGCATATTTCTATGCGGGTCTTAGCAAAAAAGGATTAAGTTATTTCGGTAAATACATTCAA
CCAACTCCAATCCTTTTACCCATTAACATCTTAGAAGATTTCACAAAACCCTTATCACTTAGTTTTCGACTTTTCGGAAATATATTAGCTGATGAATTAGTAGTTGTTGT
TCTTGTTTCTTTAGTACCTTTAGTAGTTCCTATACCTGAACTTATCATGAATCCACTGATTTCTGCCGCTTCCGTTATTGCTGCTGGGTTGGCCGTTGGGCTTGCTTCTA
TTGGACCTGGGGTTGGTCAAGGTACTGCTGCGGGCCAAGCTGTAGAAGGGATCGCGAGACAACCCGAGGCGGAGGGAAAAATCCGAGGTACTTTATTGCTTAGTTTGGCT
TTTATGGAAGCTTTAACAATTTATGGACTGGTTGTAGCATTAGCACTTTTATTTGCGAATCCTTTTGTTTAA
Protein sequenceShow/hide protein sequence
MMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLR
TEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAICEGSAII
AVRNPQTVPTDGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKLIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLSKKGLSYFGKYIQ
PTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPELIMNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLA
FMEALTIYGLVVALALLFANPFV