; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh03G010840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh03G010840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSWIM-type domain-containing protein
Genome locationCmo_Chr03:8103068..8109074
RNA-Seq ExpressionCmoCh03G010840
SyntenyCmoCh03G010840
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR007527 - Zinc finger, SWIM-type
IPR008796 - Photosystem I reaction centre subunit N, chloroplastic
IPR044907 - Photosystem I reaction centre subunit N superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604288.1 Photosystem I reaction center subunit N, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0094.61Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFI+GECTNIECPTRFHIEKGKKR MGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSV                         F  +S+  
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLSGGLWSKKHRLQLARGMVRICPHKSIARVLPLRSFRIYQESLQEYLLRPTQQAMATMNSSVLACNYAISGAGSADLSSKINAAPSV
                      SGGLWSKKHRLQLARGMVRICPHKSIARVLPLRSFRIYQESLQEY  RPTQQAMATMNSSVLACNYAISGAGSADLSSKINAAPSV
Subjt:  KKAMPKKNRKRKRLSGGLWSKKHRLQLARGMVRICPHKSIARVLPLRSFRIYQESLQEYLLRPTQQAMATMNSSVLACNYAISGAGSADLSSKINAAPSV

Query:  ASPGVVGYKLPAIRAQQARVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFT
        ASPGVVGYKLPAIRAQQARVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFT
Subjt:  ASPGVVGYKLPAIRAQQARVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFT

Query:  GCQDLAKQK
        GCQDLAKQK
Subjt:  GCQDLAKQK

XP_008441058.1 PREDICTED: uncharacterized protein LOC103485285 [Cucumis melo]0.0e+0094.55Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPY RVDAFI+GECTNIECPTRFHIE+G+KRS GSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG ILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGP DREAIGP A +IPYICNEIQQQTMSM+YLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASI MWVERNKKSIFI+QDTSE+N FILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGI+RLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEI+PI DIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQRE+FKRLGKLV+SIWDG+DTSVVLE+F RDF+DQTAFMEYFKGCWVPKIEMWLSAMR FPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSV+LDEENHLFAKVLSQKD+S+SH+VWNPGSEFSFCDCSWS+QGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNM+CEN PSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDE+QKLVELNSSNDISSVVNKLPLKWASGKGRTS RKPSSTV+FP ESN V
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAM KKN+KRKRLS
Subjt:  KKAMPKKNRKRKRLS

XP_022949833.1 uncharacterized protein LOC111453111 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAMPKKNRKRKRLS
Subjt:  KKAMPKKNRKRKRLS

XP_022977961.1 uncharacterized protein LOC111478094 [Cucurbita maxima]0.0e+0099.16Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCT LVFDSRQHALPVAW+ITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGV+TSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLK KLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQI DSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAMPKK+RKRKRLS
Subjt:  KKAMPKKNRKRKRLS

XP_023543629.1 uncharacterized protein LOC111803459 [Cucurbita pepo subsp. pepo]0.0e+0099.16Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSRQHALPVAW+ITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATE DPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDF RDF+DQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSST+AFPLESNIV
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAMPKK+RKRKRLS
Subjt:  KKAMPKKNRKRKRLS

TrEMBL top hitse value%identityAlignment
A0A0A0KIZ0 SWIM-type domain-containing protein0.0e+0094.27Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPY RVDAFI+GECTNIECPTRFHIE+G+KRS GSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG ILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGP DREAIGP A +IPYICNEIQQQTMSM+YLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDD+ASI MWVERNKKSIFI+QDTSE+N FILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGI+RLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRA SVEPGWKVSGFLIDDAATEIDPI DIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQRE+FKRLGKLV+SIWDGVD SVVLE+F RDF+DQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSV+LD+ENHLFAKVLSQKD+S+SH+VWNPGSEFSFCDCSWS+QGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNM+CEN PSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDE+QKLVELNSSNDISSVVNKLPLKWASGKGRTS RKPSST+ FP ESN V
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAM KKN+KRKRLS
Subjt:  KKAMPKKNRKRKRLS

A0A1S3B2L2 uncharacterized protein LOC1034852850.0e+0094.55Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPY RVDAFI+GECTNIECPTRFHIE+G+KRS GSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG ILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGP DREAIGP A +IPYICNEIQQQTMSM+YLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASI MWVERNKKSIFI+QDTSE+N FILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGI+RLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEI+PI DIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQRE+FKRLGKLV+SIWDG+DTSVVLE+F RDF+DQTAFMEYFKGCWVPKIEMWLSAMR FPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSV+LDEENHLFAKVLSQKD+S+SH+VWNPGSEFSFCDCSWS+QGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNM+CEN PSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDE+QKLVELNSSNDISSVVNKLPLKWASGKGRTS RKPSSTV+FP ESN V
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAM KKN+KRKRLS
Subjt:  KKAMPKKNRKRKRLS

A0A5A7SLI4 SWIM zinc finger family protein0.0e+0094.55Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPY RVDAFI+GECTNIECPTRFHIE+G+KRS GSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG ILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGP DREAIGP A +IPYICNEIQQQTMSM+YLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASI MWVERNKKSIFI+QDTSE+N FILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGI+RLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEI+PI DIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQRE+FKRLGKLV+SIWDG+DTSVVLE+F RDF+DQTAFMEYFKGCWVPKIEMWLSAMR FPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSV+LDEENHLFAKVLSQKD+S+SH+VWNPGSEFSFCDCSWS+QGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNM+CEN PSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDE+QKLVELNSSNDISSVVNKLPLKWASGKGRTS RKPSSTV+FP ESN V
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAM KKN+KRKRLS
Subjt:  KKAMPKKNRKRKRLS

A0A6J1GD77 uncharacterized protein LOC1114531110.0e+00100Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAMPKKNRKRKRLS
Subjt:  KKAMPKKNRKRKRLS

A0A6J1ILG2 uncharacterized protein LOC1114780940.0e+0099.16Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCT LVFDSRQHALPVAW+ITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
        SSIEVQREMFKRLGKLVHSIWDGV+TSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLK KLFDDSHLGAFQRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQI DSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKRLS
        KKAMPKK+RKRKRLS
Subjt:  KKAMPKKNRKRKRLS

SwissProt top hitse value%identityAlignment
O65107 Photosystem I reaction center subunit N, chloroplastic (Fragment)6.4e-2280.95Show/hide
Query:  SNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK
        +  SA A + +EYLEKSKANKELNDKKRLATSGANFARAYTV+FG+C+FP NFTGCQDLAKQK
Subjt:  SNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK

P31093 Photosystem I reaction center subunit N, chloroplastic2.5e-2667.57Show/hide
Query:  VVGYKLPAIRAQQA------RVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPEN
        VVG K  A   Q A      RV  A    RRSALL L A    AA AAS  SA A V +EYLEKSK NKELNDKKR ATSGANFARAYTVQFG+CKFP N
Subjt:  VVGYKLPAIRAQQA------RVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPEN

Query:  FTGCQDLAKQK
        FTGCQDLAKQK
Subjt:  FTGCQDLAKQK

P49107 Photosystem I reaction center subunit N, chloroplastic6.3e-3862.5Show/hide
Query:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA
        MA MNSSVL C+YAI+G+GS +L+ K+    S    G      +P I+AQ+    +   ++GRRSA+++L ATLF+ AA   ++SANAGVI+EYLE+SK 
Subjt:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA

Query:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK
        NKELNDKKRLATSGANFARA+TVQFG+CKFPENFTGCQDLAKQK
Subjt:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK

Q9SBN5 Photosystem I reaction center subunit N, chloroplastic2.0e-1552.29Show/hide
Query:  AIRAQQARVPEAK-----------NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFT
        A+RAQ A+V  A+           +   R  LL LG  L AAA A    +ANAGV+E+ L KS ANK LN+KKRLATS AN AR+ TV  GTC+FPENF 
Subjt:  AIRAQQARVPEAK-----------NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGANFARAYTVQFGTCKFPENFT

Query:  GCQDLAKQK
        GC++LA  K
Subjt:  GCQDLAKQK

Arabidopsis top hitse value%identityAlignment
AT1G60560.1 SWIM zinc finger family protein0.0e+0074.33Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        M IVES+ ++ VQ+P  E+F  ADLTWTKFGT EHHD+VAL+PY RVD FI+GEC+N ECPTRFHIE+G+KRS GSLKE+K DEYLEYR YWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG +LPSR+YRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLAL+IYNERRHVNK+GFVCHGPLDR+AIGP A +IPYICNEIQQQTMSMIYLGIPE 
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        N++EKH+E +QRYCGS+A  +SLASQYVHKLGMIIKRSTHELDLDDQASI++W ERNKKSIF YQ++SE + F+LGIQTEWQLQQ++RFGH SL+AADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSR HALPVAWII+RS+ KSDV KWMK LL RA SVEPG+K++GF+IDDAATE DPI D FCCP+LFSLWR+RRSWL+NVV+KC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
         SIEVQR++FK LG+LV+SIWDGVDT+  LE   +DF+DQTAFM+YF   W+PKI MWLS M++ PLASQEA GAIEAYH+KLK KLFDD+HLGA QRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

Query:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK
        WLVHKLTTELHS+YWLDRYADESDSFQNVKEEYI+STSW+RA++IPDS+V+LDE N L AKV SQ+DS V+ +VWNPGSEF+FCDC+WSLQGNLCKH+IK
Subjt:  WLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIK

Query:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV
        VN +CENR  Y  SMS +SF+E L N+   PMDDS+ALD+SMA T Q+ D+I++LV L+ +NDIS++VN LP+KW   KGRT+   P+S  AF       
Subjt:  VNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIV

Query:  KKAMPKKNRKRKR
             K+++KRKR
Subjt:  KKAMPKKNRKRKR

AT1G60560.2 SWIM zinc finger family protein2.3e-24878.6Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY
        M IVES+ ++ VQ+P  E+F  ADLTWTKFGT EHHD+VAL+PY RVD FI+GEC+N ECPTRFHIE+G+KRS GSLKE+K DEYLEYR YWCSFGPENY
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENY

Query:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA
        GEGG +LPSR+YRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLAL+IYNERRHVNK+GFVCHGPLDR+AIGP A +IPYICNEIQQQTMSMIYLGIPE 
Subjt:  GEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEA

Query:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST
        N++EKH+E +QRYCGS+A  +SLASQYVHKLGMIIKRSTHELDLDDQASI++W ERNKKSIF YQ++SE + F+LGIQTEWQLQQ++RFGH SL+AADST
Subjt:  NIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADST

Query:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC
        FGIKRLKYPLCTLLVFDSR HALPVAWII+RS+ KSDV KWMK LL RA SVEPG+K++GF+IDDAATE DPI D FCCP+LFSLWR+RRSWL+NVV+KC
Subjt:  FGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKC

Query:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD
         SIEVQR++FK LG+LV+SIWDGVDT+  LE   +DF+DQTAFM+YF   W+PKI MWLS M++ PLASQEA GAIEAYH+KLK KLFDD+HLGA QRVD
Subjt:  SSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVD

AT4G13970.1 zinc ion binding3.7e-17442.92Show/hide
Query:  MAIVESILDLQVQDPPEEEFYSADLTWTKF-GTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPEN
        MA  + I  L VQ+P   EF S DL W+K  G  ++ D +ALIPY RVD F+ GEC+N +CPT FH+E  ++++ G   + K D  LEY  YWCSFGP++
Subjt:  MAIVESILDLQVQDPPEEEFYSADLTWTKF-GTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPEN

Query:  YGEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPE
          +GG + PSR   +  +N A RP S RGC CHF+VKRL A P++AL+IYN  +HV++ GF CHGP D++A G  A   PYI  +++ +  S++Y+G+  
Subjt:  YGEGGCILPSRRYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPE

Query:  ANIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADS
          I+++H E +++  G + + + L  +YV +L   I+RST+ELD DD  SI MWVE ++  +F ++  S+ +PF LGIQTEWQLQQMIRFG+  L+A+DS
Subjt:  ANIVEKHLECLQRYCGSNAKANSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADS

Query:  TFGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRK
         FG   LKYP+ +L+VFDS   A+PVAWII   F+  D  +WM+AL +R H+ +P WKV+GF++DD   +I  I D+F CPVLFS WR+R +W KN++++
Subjt:  TFGIKRLKYPLCTLLVFDSRQHALPVAWIITRSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRK

Query:  CSSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRV
        C   + + E+ + LG+ V  I     T+ + + F+ DF+    F+EYF+  W P+I  W SA+++ PLASQE   A+E YH +LK +L ++    A+QR 
Subjt:  CSSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQTAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRV

Query:  DWLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISS-TSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHV
        DWLV KL T++HS +WLD Y+ + +  +  KEE++S  TS+ +AL IPDS V +   + + AK+  + D +  H+VWNPGS+F  C CSW+ +G +CKH+
Subjt:  DWLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISS-TSWHRALQIPDSSVSLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHV

Query:  IKVNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSN
        IK+  +C    + + S S   + + L+++ + P  DS+  D +++    +  +I  L  L  S+
Subjt:  IKVNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNSSN

AT5G64040.1 photosystem I reaction center subunit PSI-N, chloroplast, putative / PSI-N, putative (PSAN)4.5e-3962.5Show/hide
Query:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA
        MA MNSSVL C+YAI+G+GS +L+ K+    S    G      +P I+AQ+    +   ++GRRSA+++L ATLF+ AA   ++SANAGVI+EYLE+SK 
Subjt:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA

Query:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK
        NKELNDKKRLATSGANFARA+TVQFG+CKFPENFTGCQDLAKQK
Subjt:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK

AT5G64040.2 photosystem I reaction center subunit PSI-N, chloroplast, putative / PSI-N, putative (PSAN)4.5e-3962.5Show/hide
Query:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA
        MA MNSSVL C+YAI+G+GS +L+ K+    S    G      +P I+AQ+    +   ++GRRSA+++L ATLF+ AA   ++SANAGVI+EYLE+SK 
Subjt:  MATMNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYK-LPAIRAQQARVPEAK-NDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKA

Query:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK
        NKELNDKKRLATSGANFARA+TVQFG+CKFPENFTGCQDLAKQK
Subjt:  NKELNDKKRLATSGANFARAYTVQFGTCKFPENFTGCQDLAKQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCGTTGAATCAATACTTGATCTTCAGGTACAAGATCCGCCTGAGGAGGAGTTCTATTCAGCTGATTTGACTTGGACCAAGTTTGGCACTGTCGAACATCATGA
TGAAGTAGCTTTGATTCCTTATGTCCGAGTAGATGCATTTATAGTTGGTGAATGTACTAATATAGAGTGCCCGACCCGGTTTCATATTGAGAAGGGAAAGAAACGATCGA
TGGGAAGCTTGAAAGAGTTCAAGGATGATGAATATTTGGAATATCGACAGTATTGGTGCTCATTTGGTCCTGAAAATTATGGGGAAGGTGGATGTATTTTACCCAGTAGA
AGATATAGGCTTAACACGCGAAATCGTGCTGCTAGACCTCAATCAATGCGAGGTTGCACCTGCCATTTTGTTGTAAAGCGATTGTACGCCCGCCCATCTCTTGCACTTAT
AATATATAATGAGAGGCGTCATGTGAACAAGTCTGGCTTTGTTTGCCATGGGCCACTCGACAGAGAGGCCATTGGCCCTAATGCCAATAGGATCCCATATATTTGTAATG
AGATCCAACAACAAACAATGTCAATGATTTATCTGGGAATTCCTGAGGCAAACATTGTTGAGAAACATCTTGAGTGTCTTCAGCGATACTGTGGTTCAAATGCAAAAGCA
AACAGTCTTGCTTCCCAGTATGTTCACAAACTTGGGATGATCATCAAACGGTCTACCCATGAGCTCGATCTGGATGATCAAGCTAGCATTCGCATGTGGGTTGAGCGCAA
TAAAAAATCCATATTTATTTATCAGGATACTTCAGAAGAAAATCCTTTCATTCTTGGGATTCAAACAGAATGGCAATTGCAACAGATGATTCGGTTTGGCCATCGTAGTC
TCATTGCTGCTGATTCAACATTTGGCATTAAGAGGCTTAAGTACCCCCTGTGTACACTTCTTGTGTTTGATTCTAGACAGCATGCACTCCCTGTTGCATGGATCATCACT
CGCAGCTTTGCAAAATCAGATGTATCCAAATGGATGAAGGCCTTACTTGATCGTGCTCATTCTGTAGAGCCCGGATGGAAAGTTAGTGGGTTTTTAATTGACGACGCAGC
CACAGAGATTGATCCTATCACGGACATATTCTGTTGTCCTGTGCTTTTTTCCCTTTGGCGCATTCGTAGATCATGGCTAAAAAATGTTGTTAGGAAATGCAGTAGCATTG
AAGTTCAGAGGGAAATGTTCAAACGGCTGGGAAAATTAGTGCACAGCATTTGGGACGGAGTCGATACTTCGGTTGTCTTGGAAGATTTTATCCGTGATTTCATTGACCAA
ACTGCTTTCATGGAATATTTCAAGGGTTGTTGGGTGCCAAAGATTGAGATGTGGCTTTCTGCTATGAGAGCTTTTCCACTTGCAAGCCAAGAGGCATCTGGTGCCATTGA
AGCCTATCACATGAAGCTGAAGGCAAAACTGTTTGATGACTCTCATCTTGGTGCTTTCCAGAGGGTGGACTGGTTGGTTCACAAGTTGACCACTGAATTGCATTCAACCT
ACTGGCTAGATCGATACGCCGATGAAAGTGATTCATTTCAAAATGTCAAGGAGGAGTATATTTCTTCTACTTCTTGGCACCGTGCGCTGCAAATTCCAGATTCTTCAGTT
TCCTTGGATGAGGAAAATCACCTTTTTGCCAAAGTTCTGAGCCAAAAGGACAGCAGTGTTTCACATATAGTTTGGAATCCAGGGTCAGAATTCTCATTTTGCGATTGTTC
TTGGTCATTGCAAGGAAATCTCTGCAAACATGTGATCAAGGTGAATATGATATGTGAAAATCGCCCGAGTTACAAACCTTCCATGTCTTTTCAATCATTTGAGGAAATAC
TGATGAATATGTGGAAGCTACCAATGGATGATTCTGTTGCCTTGGATGTGTCAATGGCTTGGACCCATCAGATCCTTGATGAAATTCAGAAACTAGTTGAATTGAATTCT
TCAAATGATATTAGCTCTGTGGTAAATAAGCTGCCCTTGAAATGGGCCTCTGGAAAGGGAAGAACCAGTTGTAGGAAACCATCATCTACCGTGGCTTTTCCATTGGAATC
CAATATTGTCAAAAAGGCCATGCCAAAGAAGAACCGGAAAAGAAAAAGATTATCTGGTGGCTTATGGAGTAAAAAGCACAGATTACAACTAGCACGTGGTATGGTGCGGA
TATGTCCACACAAAAGCATAGCTCGAGTCTTGCCGTTGAGGTCCTTTAGAATTTACCAGGAATCATTGCAGGAATATTTGCTGAGACCTACGCAGCAAGCAATGGCGACC
ATGAATTCCAGTGTTCTGGCTTGCAACTACGCCATTTCAGGCGCGGGATCGGCCGACCTGAGCTCTAAAATCAACGCTGCCCCTTCTGTTGCATCTCCTGGGGTTGTGGG
TTACAAGTTGCCTGCGATTAGGGCTCAACAGGCTAGGGTTCCTGAAGCCAAGAATGACGGAAGGAGGTCTGCGCTTCTTTACCTTGGTGCCACCCTCTTTGCTGCGGCTG
CGGCTGCCTCCAACTCCTCTGCTAATGCTGGAGTCATTGAAGAGTACCTTGAGAAGAGTAAAGCTAACAAGGAACTGAATGACAAGAAAAGATTGGCAACAAGTGGGGCA
AACTTTGCAAGGGCATACACCGTTCAATTTGGAACATGCAAGTTCCCAGAGAACTTCACAGGGTGCCAAGATCTTGCTAAACAAAAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
CAAACATCTTCTTCGTTCGTTCTCCTCTGTTTCAACTCTTCATTAATGCCTGCGTCTTCAGGGATTCACAAAACCATCGCTGGTCAGACTCTTTTCTTGTTCCACCTCAC
CGACTGCTTGGCATTCATTTCCAAGAAGGAACAATCAAGCTCGAGTTTTGACCCATATGCGCAGAAGCATGTTCTTAAAACCCAGAACTGACTGACCAGCAACTAGACCT
GCCAGGAATTTGTTACTTCGGAGATTGGTTGTTGCGTTCAGAATTCGCGTTTCTAGCTATTCGAATTTAGTTTTCGCCTGATGCTCTATTGCTAGGAATTGGAGTTCGGC
AGCTAGATTTCGATGGCTATCGTTGAATCAATACTTGATCTTCAGGTACAAGATCCGCCTGAGGAGGAGTTCTATTCAGCTGATTTGACTTGGACCAAGTTTGGCACTGT
CGAACATCATGATGAAGTAGCTTTGATTCCTTATGTCCGAGTAGATGCATTTATAGTTGGTGAATGTACTAATATAGAGTGCCCGACCCGGTTTCATATTGAGAAGGGAA
AGAAACGATCGATGGGAAGCTTGAAAGAGTTCAAGGATGATGAATATTTGGAATATCGACAGTATTGGTGCTCATTTGGTCCTGAAAATTATGGGGAAGGTGGATGTATT
TTACCCAGTAGAAGATATAGGCTTAACACGCGAAATCGTGCTGCTAGACCTCAATCAATGCGAGGTTGCACCTGCCATTTTGTTGTAAAGCGATTGTACGCCCGCCCATC
TCTTGCACTTATAATATATAATGAGAGGCGTCATGTGAACAAGTCTGGCTTTGTTTGCCATGGGCCACTCGACAGAGAGGCCATTGGCCCTAATGCCAATAGGATCCCAT
ATATTTGTAATGAGATCCAACAACAAACAATGTCAATGATTTATCTGGGAATTCCTGAGGCAAACATTGTTGAGAAACATCTTGAGTGTCTTCAGCGATACTGTGGTTCA
AATGCAAAAGCAAACAGTCTTGCTTCCCAGTATGTTCACAAACTTGGGATGATCATCAAACGGTCTACCCATGAGCTCGATCTGGATGATCAAGCTAGCATTCGCATGTG
GGTTGAGCGCAATAAAAAATCCATATTTATTTATCAGGATACTTCAGAAGAAAATCCTTTCATTCTTGGGATTCAAACAGAATGGCAATTGCAACAGATGATTCGGTTTG
GCCATCGTAGTCTCATTGCTGCTGATTCAACATTTGGCATTAAGAGGCTTAAGTACCCCCTGTGTACACTTCTTGTGTTTGATTCTAGACAGCATGCACTCCCTGTTGCA
TGGATCATCACTCGCAGCTTTGCAAAATCAGATGTATCCAAATGGATGAAGGCCTTACTTGATCGTGCTCATTCTGTAGAGCCCGGATGGAAAGTTAGTGGGTTTTTAAT
TGACGACGCAGCCACAGAGATTGATCCTATCACGGACATATTCTGTTGTCCTGTGCTTTTTTCCCTTTGGCGCATTCGTAGATCATGGCTAAAAAATGTTGTTAGGAAAT
GCAGTAGCATTGAAGTTCAGAGGGAAATGTTCAAACGGCTGGGAAAATTAGTGCACAGCATTTGGGACGGAGTCGATACTTCGGTTGTCTTGGAAGATTTTATCCGTGAT
TTCATTGACCAAACTGCTTTCATGGAATATTTCAAGGGTTGTTGGGTGCCAAAGATTGAGATGTGGCTTTCTGCTATGAGAGCTTTTCCACTTGCAAGCCAAGAGGCATC
TGGTGCCATTGAAGCCTATCACATGAAGCTGAAGGCAAAACTGTTTGATGACTCTCATCTTGGTGCTTTCCAGAGGGTGGACTGGTTGGTTCACAAGTTGACCACTGAAT
TGCATTCAACCTACTGGCTAGATCGATACGCCGATGAAAGTGATTCATTTCAAAATGTCAAGGAGGAGTATATTTCTTCTACTTCTTGGCACCGTGCGCTGCAAATTCCA
GATTCTTCAGTTTCCTTGGATGAGGAAAATCACCTTTTTGCCAAAGTTCTGAGCCAAAAGGACAGCAGTGTTTCACATATAGTTTGGAATCCAGGGTCAGAATTCTCATT
TTGCGATTGTTCTTGGTCATTGCAAGGAAATCTCTGCAAACATGTGATCAAGGTGAATATGATATGTGAAAATCGCCCGAGTTACAAACCTTCCATGTCTTTTCAATCAT
TTGAGGAAATACTGATGAATATGTGGAAGCTACCAATGGATGATTCTGTTGCCTTGGATGTGTCAATGGCTTGGACCCATCAGATCCTTGATGAAATTCAGAAACTAGTT
GAATTGAATTCTTCAAATGATATTAGCTCTGTGGTAAATAAGCTGCCCTTGAAATGGGCCTCTGGAAAGGGAAGAACCAGTTGTAGGAAACCATCATCTACCGTGGCTTT
TCCATTGGAATCCAATATTGTCAAAAAGGCCATGCCAAAGAAGAACCGGAAAAGAAAAAGATTATCTGGTGGCTTATGGAGTAAAAAGCACAGATTACAACTAGCACGTG
GTATGGTGCGGATATGTCCACACAAAAGCATAGCTCGAGTCTTGCCGTTGAGGTCCTTTAGAATTTACCAGGAATCATTGCAGGAATATTTGCTGAGACCTACGCAGCAA
GCAATGGCGACCATGAATTCCAGTGTTCTGGCTTGCAACTACGCCATTTCAGGCGCGGGATCGGCCGACCTGAGCTCTAAAATCAACGCTGCCCCTTCTGTTGCATCTCC
TGGGGTTGTGGGTTACAAGTTGCCTGCGATTAGGGCTCAACAGGCTAGGGTTCCTGAAGCCAAGAATGACGGAAGGAGGTCTGCGCTTCTTTACCTTGGTGCCACCCTCT
TTGCTGCGGCTGCGGCTGCCTCCAACTCCTCTGCTAATGCTGGAGTCATTGAAGAGTACCTTGAGAAGAGTAAAGCTAACAAGGAACTGAATGACAAGAAAAGATTGGCA
ACAAGTGGGGCAAACTTTGCAAGGGCATACACCGTTCAATTTGGAACATGCAAGTTCCCAGAGAACTTCACAGGGTGCCAAGATCTTGCTAAACAAAAGGTTTGA
Protein sequenceShow/hide protein sequence
MAIVESILDLQVQDPPEEEFYSADLTWTKFGTVEHHDEVALIPYVRVDAFIVGECTNIECPTRFHIEKGKKRSMGSLKEFKDDEYLEYRQYWCSFGPENYGEGGCILPSR
RYRLNTRNRAARPQSMRGCTCHFVVKRLYARPSLALIIYNERRHVNKSGFVCHGPLDREAIGPNANRIPYICNEIQQQTMSMIYLGIPEANIVEKHLECLQRYCGSNAKA
NSLASQYVHKLGMIIKRSTHELDLDDQASIRMWVERNKKSIFIYQDTSEENPFILGIQTEWQLQQMIRFGHRSLIAADSTFGIKRLKYPLCTLLVFDSRQHALPVAWIIT
RSFAKSDVSKWMKALLDRAHSVEPGWKVSGFLIDDAATEIDPITDIFCCPVLFSLWRIRRSWLKNVVRKCSSIEVQREMFKRLGKLVHSIWDGVDTSVVLEDFIRDFIDQ
TAFMEYFKGCWVPKIEMWLSAMRAFPLASQEASGAIEAYHMKLKAKLFDDSHLGAFQRVDWLVHKLTTELHSTYWLDRYADESDSFQNVKEEYISSTSWHRALQIPDSSV
SLDEENHLFAKVLSQKDSSVSHIVWNPGSEFSFCDCSWSLQGNLCKHVIKVNMICENRPSYKPSMSFQSFEEILMNMWKLPMDDSVALDVSMAWTHQILDEIQKLVELNS
SNDISSVVNKLPLKWASGKGRTSCRKPSSTVAFPLESNIVKKAMPKKNRKRKRLSGGLWSKKHRLQLARGMVRICPHKSIARVLPLRSFRIYQESLQEYLLRPTQQAMAT
MNSSVLACNYAISGAGSADLSSKINAAPSVASPGVVGYKLPAIRAQQARVPEAKNDGRRSALLYLGATLFAAAAAASNSSANAGVIEEYLEKSKANKELNDKKRLATSGA
NFARAYTVQFGTCKFPENFTGCQDLAKQKV