; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G003650 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G003650
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionNuclear transcription factor Y subunit
Genome locationCmo_Chr14:1694864..1704593
RNA-Seq ExpressionCmoCh14G003650
SyntenyCmoCh14G003650
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001289 - Nuclear transcription factor Y subunit A
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580756.1 hypothetical protein SDJN03_20758, partial [Cucurbita argyrosperma subsp. sororia]1.4e-16999.38Show/hide
Query:  MSASSLTMAAAAASV-SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESD
        MSASSLTMAAAAASV SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESD
Subjt:  MSASSLTMAAAAASV-SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESD

Query:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
        G EESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
Subjt:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR

Query:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
        GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
Subjt:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA

Query:  DRAPTSPAAFTRTENTIDSNELLT
        DRAPTSPAAFTRTENTIDSNELLT
Subjt:  DRAPTSPAAFTRTENTIDSNELLT

KAG6580757.1 Nuclear transcription factor Y subunit A-10, partial [Cucurbita argyrosperma subsp. sororia]5.4e-20198.59Show/hide
Query:  ELGRWKMAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKI
        ELGRWKMAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKI
Subjt:  ELGRWKMAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKI

Query:  SVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKT
        SVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKT
Subjt:  SVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKT

Query:  RKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSERSLMVEGMAWFCPNGL
        RKPYMHESRHLHAMRRPRGSGGRFLNTK LKNGKASMEPKKIED+NLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSERSLMVEGMAWFCPNGL
Subjt:  RKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSERSLMVEGMAWFCPNGL

Query:  QQLTTAVTFVFDREENGFLVRRIAPKPQFSATKSSLCNVVHDNCLKLRDEASPGV
        QQLTTAVTFVFDREENGFLVR IAPKPQFSATKSSLCNVVHDNCLKLRDE  PGV
Subjt:  QQLTTAVTFVFDREENGFLVRRIAPKPQFSATKSSLCNVVHDNCLKLRDEASPGV

KAG7017509.1 hypothetical protein SDJN02_19374 [Cucurbita argyrosperma subsp. argyrosperma]4.9e-17099.38Show/hide
Query:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
        MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPS FSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
Subjt:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG

Query:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
         EESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
Subjt:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG

Query:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
        GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
Subjt:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD

Query:  RAPTSPAAFTRTENTIDSNELLT
        RAPTSPAAFTRTENTIDSNELLT
Subjt:  RAPTSPAAFTRTENTIDSNELLT

XP_022934919.1 33 kDa ribonucleoprotein, chloroplastic [Cucurbita moschata]2.0e-171100Show/hide
Query:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
        MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
Subjt:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG

Query:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
        SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
Subjt:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG

Query:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
        GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
Subjt:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD

Query:  RAPTSPAAFTRTENTIDSNELLT
        RAPTSPAAFTRTENTIDSNELLT
Subjt:  RAPTSPAAFTRTENTIDSNELLT

XP_023527627.1 33 kDa ribonucleoprotein, chloroplastic [Cucurbita pepo subsp. pepo]1.1e-16998.78Show/hide
Query:  MSASSLTMAAAAASV----SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETE
        MSASSLTMAAAAASV    SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETE
Subjt:  MSASSLTMAAAAASV----SSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETE

Query:  ESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPE
        ESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPE
Subjt:  ESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPE

Query:  VPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLN
        VPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLN
Subjt:  VPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLN

Query:  MAADRAPTSPAAFTRTENTIDSNELLT
        MAADRAPTSPAAFTRTENTIDSNELLT
Subjt:  MAADRAPTSPAAFTRTENTIDSNELLT

TrEMBL top hitse value%identityAlignment
A0A5D3DNY3 Nuclear transcription factor Y subunit5.3e-14676.92Show/hide
Query:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK
        MAPQTG LKEHE ++PNSLGQLS SPARLWS +GQG QS FGDFGQVKASS IQQLGSNGKEF GT+   QV +HG +KLNTAPFSIYPGD K+S+D QK
Subjt:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK

Query:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTR-----
        PSP+FSLQSPL+EYHNRFELGFGQP++C NYPYMEQHYGILSAY PQIPGRIMLPMSLT+DDGPIYVNAKQYHGIIRRR+IRAKAMMEN+LA+T+     
Subjt:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTR-----

Query:  -------------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGR
                                 +PYMHESRHLHAMRRPRGSGGRFLNTKNLKNGK+SMEPKKI+++NLSDSTG SQCSVVLQSESG LNSPN  KGR
Subjt:  -------------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGR

Query:  GFSLSSSERSLMVEGMAWFCPNGLQQLTTAVTFVFDREENGFLVRRIAPKP
        GFSLSSSERSLM E MAWFCPNGLQQLTTAVTFVF+ E  GFLV+ IAPKP
Subjt:  GFSLSSSERSLMVEGMAWFCPNGLQQLTTAVTFVFDREENGFLVRRIAPKP

A0A6J1F452 Nuclear transcription factor Y subunit5.1e-15785.93Show/hide
Query:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK
        MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK
Subjt:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK

Query:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH
        PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH
Subjt:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH

Query:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSE-RSLMVEGMAWFCPNGLQQLTT
        ESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSE  S    G+  F  N       
Subjt:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSE-RSLMVEGMAWFCPNGLQQLTT

Query:  AVTFVFDREENGFLVRRIAPKPQFSATKSSLCNV
        ++  + D   +G ++      P++ A   + C++
Subjt:  AVTFVFDREENGFLVRRIAPKPQFSATKSSLCNV

A0A6J1F948 33 kDa ribonucleoprotein, chloroplastic9.7e-172100Show/hide
Query:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
        MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
Subjt:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG

Query:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
        SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
Subjt:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG

Query:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
        GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
Subjt:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD

Query:  RAPTSPAAFTRTENTIDSNELLT
        RAPTSPAAFTRTENTIDSNELLT
Subjt:  RAPTSPAAFTRTENTIDSNELLT

A0A6J1IY51 Nuclear transcription factor Y subunit9.7e-15689.84Show/hide
Query:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK
        MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQ+KASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK
Subjt:  MAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEFTGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQK

Query:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH
        PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH
Subjt:  PSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMH

Query:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSE-RSLMVEGMAWFCPNGLQQLTT
        ESRHLHAMRRPRGSGGRFLNTK LKNGKASMEPKKIED+NLSDSTGSSQCSVVLQSESGALNSPN GKGRGFSLSSSE  S+   G+  F  N L     
Subjt:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSE-RSLMVEGMAWFCPNGLQQLTT

Query:  AVTFVFDREENGFLV
        + T + D  E+G ++
Subjt:  AVTFVFDREENGFLV

A0A6J1J651 33 kDa ribonucleoprotein, chloroplastic2.3e-16597.22Show/hide
Query:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHL-NRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESD
        MSASSLTMAAAAASVSSSSSSSSSSPCKKLFF   L NRIPSHFSPK KPLKLLELRIHLPN YPLAFSSASHLYCAPPAFEGLEVSDPI+E+AETEESD
Subjt:  MSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHL-NRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESD

Query:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
        G EESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
Subjt:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR

Query:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
        GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
Subjt:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA

Query:  DRAPTSPAAFTRTENTIDSNELLT
        DRAPTSPAAFTRTENTIDSNELLT
Subjt:  DRAPTSPAAFTRTENTIDSNELLT

SwissProt top hitse value%identityAlignment
P19684 33 kDa ribonucleoprotein, chloroplastic4.9e-8053.54Show/hide
Query:  MAAAAASVSSSSSSSSSS-----PCKKLFFAQH--LNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG
        M+    S ++++S+SS+S       K  F   H  L+   +HF+ K   +   +L+ H P I  L  SS     CA  + +G+EV     E+     ++ 
Subjt:  MAAAAASVSSSSSSSSSS-----PCKKLFFAQH--LNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDG

Query:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG
         EE  E++E+  S S + G+LY+GNLP++MTSSQL+E+FAEAG V +V+++YD+VTDRSRGFAFVTM ++EEAKEAIR+FDGS +GGRTV+VNFPEVPRG
Subjt:  SEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRG

Query:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD
        GE+EVM  KIRS+Y  FVDSPHK+Y  NL W LTSQ LR+AF +QPG +SAKVIYDR SGRSRGFGF++F +AE   SALD+MN VE+EGRPLRLN+A  
Subjt:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAAD

Query:  RAPTS--PAAFTRTENTIDSNELLT
        +AP S  P   T  EN  D++ELL+
Subjt:  RAPTS--PAAFTRTENTIDSNELLT

P49314 31 kDa ribonucleoprotein, chloroplastic1.0e-4042.71Show/hide
Query:  KLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVP-----------RGGEKEVMGP
        KL++GNLP+++ S+ L  +F  AG+V  V+VIYDK++ RSRGF FVTM+T EE + A + F+G  I GR +RVN    P           RGG     G 
Subjt:  KLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVP-----------RGGEKEVMGP

Query:  KIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAADR
        +  +S        + VDS +++Y GNL WG+   +L+E F  Q  ++ AKV+YDR+SGRSRGFGFV++ +A++   A+DS+NG++++GR +R++ A +R
Subjt:  KIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAADR

Q08935 29 kDa ribonucleoprotein A, chloroplastic6.1e-3842.25Show/hide
Query:  EDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTV
        ED E +  DG EE R       + S D  K+++GNLP++  S+ L E+F  AG+V  V+VIYDK+T RSRGF FVTM++ EE + A + F+G  + GR +
Subjt:  EDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTV

Query:  RVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEG
        RVN    P   E        R   +   DS +++Y GNL WG+   +L   F  Q  ++ AKV+YDR+SGRSRGFGFV++ +AE+  +A++S++GV++ G
Subjt:  RVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEG

Query:  RPLRLNMAADRAP
        R +R++ A  R P
Subjt:  RPLRLNMAADRAP

Q08937 29 kDa ribonucleoprotein B, chloroplastic5.0e-4037.93Show/hide
Query:  LAFSSASHLYCAPPAFE-------GLEVSDPITEDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTD
        L+ SS+S   C+   FE        L   D + +D E  E     E                KL++GNLP+++ S+ L  +F  AG+V  V+VIYDK+T 
Subjt:  LAFSSASHLYCAPPAFE-------GLEVSDPITEDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTD

Query:  RSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVP-----------RGGEKEVMGPKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLRE
        RSRGF FVTM+T EE + A + F+G  I GR +RVN    P           RGG     G +  +S        + VDS +++Y GNL WG+   +L+E
Subjt:  RSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVP-----------RGGEKEVMGPKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLRE

Query:  AFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAADR
         F  Q  ++ AKV+YDR+SGRSRGFGFV++ ++++   A+DS+NGV+++GR +R++ A +R
Subjt:  AFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAADR

Q39061 RNA-binding protein CP33, chloroplastic7.8e-7849.85Show/hide
Query:  SLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPI-----TEDAETEESD
        S    ++A +VS+++++SS++    L  +   +++   F+PK       +L  + PN   L  +   H +      E     D I      E+   EE D
Subjt:  SLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPI-----TEDAETEESD

Query:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
          EE  EE++Q   AS + G+LY+GNLPY +TSS+L+++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ S IGGRTV+VNFPEVPR
Subjt:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR

Query:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
        GGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +GRSRGFGF+SFE+AE+ +SAL +MNGVEVEGR LRLN+A+
Subjt:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA

Query:  DR-----APTSPAAFTRTENTIDSNELLT
        +R     +P S       E +++SNE+L+
Subjt:  DR-----APTSPAAFTRTENTIDSNELLT

Arabidopsis top hitse value%identityAlignment
AT2G37220.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.6e-3839.65Show/hide
Query:  ITEDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGR
        IT + E EE DG  +     EQ  SA     KL++GNLP+ + S+QL ++F  AG+V  V+VIYDK+T RSRGF FVTM+++ E + A + F+G  + GR
Subjt:  ITEDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGR

Query:  TVRVNF-PEVPRGGEKEVMGPKIRSSYNKF-----------VDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDA
         +RVN  P  P+  +    GP  RSS+                S +++Y GNL WG+   +L   F  Q  ++ A+VIYDR+SGRS+GFGFV+++++++ 
Subjt:  TVRVNF-PEVPRGGEKEVMGPKIRSSYNKF-----------VDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDA

Query:  ESALDSMNGVEVEGRPLRLNMAADRAP
        ++A+ S++G +++GR +R++ A  R P
Subjt:  ESALDSMNGVEVEGRPLRLNMAADRAP

AT3G52380.1 chloroplast RNA-binding protein 335.6e-7949.85Show/hide
Query:  SLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPI-----TEDAETEESD
        S    ++A +VS+++++SS++    L  +   +++   F+PK       +L  + PN   L  +   H +      E     D I      E+   EE D
Subjt:  SLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHFSPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPI-----TEDAETEESD

Query:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR
          EE  EE++Q   AS + G+LY+GNLPY +TSS+L+++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ S IGGRTV+VNFPEVPR
Subjt:  GSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPR

Query:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA
        GGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +GRSRGFGF+SFE+AE+ +SAL +MNGVEVEGR LRLN+A+
Subjt:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSRGFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAA

Query:  DR-----APTSPAAFTRTENTIDSNELLT
        +R     +P S       E +++SNE+L+
Subjt:  DR-----APTSPAAFTRTENTIDSNELLT

AT5G06510.1 nuclear factor Y, subunit A108.1e-3845.5Show/hide
Query:  GSNGKEFTGTEQALQVAAHGF--EKLNTAPFSIYPGDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIM
        G     FTG +     A  G   ++ +T  F+  PG  K S D  KP   F++QS        FE GF QPM+   +P++EQ+YG++SAY  Q   GR+M
Subjt:  GSNGKEFTGTEQALQVAAHGF--EKLNTAPFSIYPGDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIM

Query:  LPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSV
        +P+ + T +DG IYVN+KQYHGIIRRR+ RAKA    +L++ RKPYMH SRHLHAMRRPRGSGGRFLNTK     K S                +SQ S 
Subjt:  LPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSV

Query:  VLQSESGALNS
        V   E+  +NS
Subjt:  VLQSESGALNS

AT5G06510.2 nuclear factor Y, subunit A108.1e-3845.5Show/hide
Query:  GSNGKEFTGTEQALQVAAHGF--EKLNTAPFSIYPGDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIM
        G     FTG +     A  G   ++ +T  F+  PG  K S D  KP   F++QS        FE GF QPM+   +P++EQ+YG++SAY  Q   GR+M
Subjt:  GSNGKEFTGTEQALQVAAHGF--EKLNTAPFSIYPGDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIM

Query:  LPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSV
        +P+ + T +DG IYVN+KQYHGIIRRR+ RAKA    +L++ RKPYMH SRHLHAMRRPRGSGGRFLNTK     K S                +SQ S 
Subjt:  LPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMMENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSV

Query:  VLQSESGALNS
        V   E+  +NS
Subjt:  VLQSESGALNS

AT5G06510.3 nuclear factor Y, subunit A108.4e-3549.43Show/hide
Query:  GDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIMLPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMM
        G  K S D  KP   F++QS        FE GF QPM+   +P++EQ+YG++SAY  Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRR+ RAKA  
Subjt:  GDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQ-IPGRIMLPMSL-TTDDGPIYVNAKQYHGIIRRRKIRAKAMM

Query:  ENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNS
          +L++ RKPYMH SRHLHAMRRPRGSGGRFLNTK     K S                +SQ S V   E+  +NS
Subjt:  ENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGGTAAAGCGCCTTCCTATCAAATGGGGTTCTTGTGCGTATATTAAGTTCGAGTTGCCGCCGCTGCTGCTGCTGCTGCTGCTGATCCTTCTTCTTCTTTTTGC
TGCTTCTGAATTTGAATTGGGGCGTTGGAAAATGGCACCACAAACTGGGTATTTGAAAGAACATGAAGGAGTTGTTCCCAACTCGCTTGGACAGTTATCATCTTCTCCTG
CTCGGCTGTGGAGTGGCTATGGGCAAGGTTGTCAATCATTCTTTGGGGATTTTGGTCAGGTGAAGGCTTCTTCCACCATTCAACAACTTGGAAGTAATGGGAAAGAGTTC
ACTGGCACCGAACAAGCTCTTCAGGTTGCTGCTCATGGCTTTGAGAAACTGAACACAGCTCCATTTTCCATCTATCCTGGTGACTGTAAGATTTCTGTGGATCCACAAAA
ACCTTCACCAATTTTTTCCCTGCAATCCCCCCTGTCAGAATATCACAATCGTTTTGAGCTTGGATTTGGCCAGCCCATGGTATGTGCAAACTATCCTTACATGGAACAGC
ATTATGGCATCCTCTCAGCATATGCTCCTCAAATACCAGGTCGGATTATGCTGCCAATGAGCTTAACAACAGATGATGGACCTATTTATGTGAATGCAAAGCAGTATCAT
GGAATCATTAGGCGCAGGAAGATCCGTGCTAAGGCAATGATGGAGAATAGACTTGCAAAAACTCGTAAGCCATATATGCACGAATCGCGTCATCTTCACGCAATGCGTCG
TCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAATGGGAAAGCCTCAATGGAACCAAAGAAAATTGAAGACATAAACCTTTCTGATTCAACTGGTT
CTTCCCAATGTTCTGTGGTTCTTCAATCAGAAAGTGGAGCTTTGAACTCCCCAAATGGAGGCAAGGGGAGGGGCTTTAGTCTCTCGAGTTCTGAGAGATCATTGATGGTG
GAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTGTGACCTTTGTGTTTGACAGGGAAGAAAATGGGTTCTTGGTGCGACGCATCGCACCAAA
ACCCCAGTTTTCTGCGACCAAATCCAGCTTGTGCAATGTCGTTCACGACAATTGTCTCAAGCTTCGCGACGAGGCTAGTCCTGGCGTTTGTTTTGCTTCTCAGCATGAGG
AGGTGACTGAGAGAAACCAAAGAATCATCCAATCAGAAAACCAGAACGAAGTAACTGCAGTAGACAAAAGGGTCTGTTTCAAGAATAAGAGCGTAAAGAACATTCTGAAG
CACTTTGTACCGAACAAAGATGCTTGGCTTGTGGGTTTGTTTGTTTGTTTAAGGACAACTTTCATTTTCCGATCAAAGCAACCTTATCCAGTGGGATTTTTTGTCTTTCA
ATTTCAAAGCTCCCCAATCCGATCCCCAAAATCCATCCCTGTAACTTGGGATTCCGCCATCTCCGACTTCTACTACTGTTTTCTTACTCAAATGTCAGCTTCTTCTCTCA
CAATGGCTGCTGCTGCAGCTTCAGTTTCGTCTTCTTCTTCTTCTTCTTCTTCTTCACCCTGCAAGAAACTCTTCTTCGCCCAACATCTCAACCGAATTCCCTCCCATTTT
TCTCCAAAACAGAAGCCATTGAAGCTTCTCGAGCTCAGAATCCATTTACCCAACATTTACCCTCTTGCTTTCTCCTCCGCTTCTCACCTCTACTGTGCTCCTCCTGCATT
CGAGGGACTCGAAGTCTCCGACCCTATAACAGAAGACGCAGAGACTGAAGAATCGGACGGAAGCGAAGAAAGCCGGGAGGAAGACGAACAAAAGGTATCGGCATCTCGCG
ACGCAGGGAAGCTTTATATTGGAAATTTACCATATGCTATGACCTCTTCCCAATTGACTGAGGTCTTCGCCGAAGCCGGTCATGTGGTTTCTGTACAGGTTATATATGAC
AAAGTTACGGATAGGAGTAGGGGATTTGCATTTGTGACAATGGCAACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTTGACGGCTCTCTAATCGGAGGGCGAACTGT
TCGGGTGAACTTCCCTGAAGTTCCAAGGGGAGGAGAAAAGGAAGTGATGGGGCCAAAGATAAGAAGCAGCTATAACAAATTTGTTGATAGTCCTCACAAGATATATGCAG
GTAACCTTGGTTGGGGCCTTACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCATCTATGATAGGGAATCTGGAAGAAGTCGA
GGTTTTGGATTTGTATCTTTTGAAACTGCTGAGGATGCAGAGTCTGCTTTGGATTCCATGAACGGAGTGGAAGTTGAAGGGCGACCACTTCGTTTGAACATGGCTGCAGA
CCGGGCCCCAACTTCTCCAGCAGCATTCACAAGGACTGAGAATACCATCGACAGCAATGAATTGCTTACAAGGACTGTTGTCCAAATTGAATACCTTCAGGATTTCCCTC
GCAATTCTCTTTCATTCCTTGCTCAACGACCAAGGAAGCCTATGTTTTTTTTTTCTTCAATGAGTGATGTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGGTAAAGCGCCTTCCTATCAAATGGGGTTCTTGTGCGTATATTAAGTTCGAGTTGCCGCCGCTGCTGCTGCTGCTGCTGCTGATCCTTCTTCTTCTTTTTGC
TGCTTCTGAATTTGAATTGGGGCGTTGGAAAATGGCACCACAAACTGGGTATTTGAAAGAACATGAAGGAGTTGTTCCCAACTCGCTTGGACAGTTATCATCTTCTCCTG
CTCGGCTGTGGAGTGGCTATGGGCAAGGTTGTCAATCATTCTTTGGGGATTTTGGTCAGGTGAAGGCTTCTTCCACCATTCAACAACTTGGAAGTAATGGGAAAGAGTTC
ACTGGCACCGAACAAGCTCTTCAGGTTGCTGCTCATGGCTTTGAGAAACTGAACACAGCTCCATTTTCCATCTATCCTGGTGACTGTAAGATTTCTGTGGATCCACAAAA
ACCTTCACCAATTTTTTCCCTGCAATCCCCCCTGTCAGAATATCACAATCGTTTTGAGCTTGGATTTGGCCAGCCCATGGTATGTGCAAACTATCCTTACATGGAACAGC
ATTATGGCATCCTCTCAGCATATGCTCCTCAAATACCAGGTCGGATTATGCTGCCAATGAGCTTAACAACAGATGATGGACCTATTTATGTGAATGCAAAGCAGTATCAT
GGAATCATTAGGCGCAGGAAGATCCGTGCTAAGGCAATGATGGAGAATAGACTTGCAAAAACTCGTAAGCCATATATGCACGAATCGCGTCATCTTCACGCAATGCGTCG
TCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAATGGGAAAGCCTCAATGGAACCAAAGAAAATTGAAGACATAAACCTTTCTGATTCAACTGGTT
CTTCCCAATGTTCTGTGGTTCTTCAATCAGAAAGTGGAGCTTTGAACTCCCCAAATGGAGGCAAGGGGAGGGGCTTTAGTCTCTCGAGTTCTGAGAGATCATTGATGGTG
GAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTGTGACCTTTGTGTTTGACAGGGAAGAAAATGGGTTCTTGGTGCGACGCATCGCACCAAA
ACCCCAGTTTTCTGCGACCAAATCCAGCTTGTGCAATGTCGTTCACGACAATTGTCTCAAGCTTCGCGACGAGGCTAGTCCTGGCGTTTGTTTTGCTTCTCAGCATGAGG
AGGTGACTGAGAGAAACCAAAGAATCATCCAATCAGAAAACCAGAACGAAGTAACTGCAGTAGACAAAAGGGTCTGTTTCAAGAATAAGAGCGTAAAGAACATTCTGAAG
CACTTTGTACCGAACAAAGATGCTTGGCTTGTGGGTTTGTTTGTTTGTTTAAGGACAACTTTCATTTTCCGATCAAAGCAACCTTATCCAGTGGGATTTTTTGTCTTTCA
ATTTCAAAGCTCCCCAATCCGATCCCCAAAATCCATCCCTGTAACTTGGGATTCCGCCATCTCCGACTTCTACTACTGTTTTCTTACTCAAATGTCAGCTTCTTCTCTCA
CAATGGCTGCTGCTGCAGCTTCAGTTTCGTCTTCTTCTTCTTCTTCTTCTTCTTCACCCTGCAAGAAACTCTTCTTCGCCCAACATCTCAACCGAATTCCCTCCCATTTT
TCTCCAAAACAGAAGCCATTGAAGCTTCTCGAGCTCAGAATCCATTTACCCAACATTTACCCTCTTGCTTTCTCCTCCGCTTCTCACCTCTACTGTGCTCCTCCTGCATT
CGAGGGACTCGAAGTCTCCGACCCTATAACAGAAGACGCAGAGACTGAAGAATCGGACGGAAGCGAAGAAAGCCGGGAGGAAGACGAACAAAAGGTATCGGCATCTCGCG
ACGCAGGGAAGCTTTATATTGGAAATTTACCATATGCTATGACCTCTTCCCAATTGACTGAGGTCTTCGCCGAAGCCGGTCATGTGGTTTCTGTACAGGTTATATATGAC
AAAGTTACGGATAGGAGTAGGGGATTTGCATTTGTGACAATGGCAACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTTGACGGCTCTCTAATCGGAGGGCGAACTGT
TCGGGTGAACTTCCCTGAAGTTCCAAGGGGAGGAGAAAAGGAAGTGATGGGGCCAAAGATAAGAAGCAGCTATAACAAATTTGTTGATAGTCCTCACAAGATATATGCAG
GTAACCTTGGTTGGGGCCTTACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCATCTATGATAGGGAATCTGGAAGAAGTCGA
GGTTTTGGATTTGTATCTTTTGAAACTGCTGAGGATGCAGAGTCTGCTTTGGATTCCATGAACGGAGTGGAAGTTGAAGGGCGACCACTTCGTTTGAACATGGCTGCAGA
CCGGGCCCCAACTTCTCCAGCAGCATTCACAAGGACTGAGAATACCATCGACAGCAATGAATTGCTTACAAGGACTGTTGTCCAAATTGAATACCTTCAGGATTTCCCTC
GCAATTCTCTTTCATTCCTTGCTCAACGACCAAGGAAGCCTATGTTTTTTTTTTCTTCAATGAGTGATGTGAGTTGA
Protein sequenceShow/hide protein sequence
MKKVKRLPIKWGSCAYIKFELPPLLLLLLLILLLLFAASEFELGRWKMAPQTGYLKEHEGVVPNSLGQLSSSPARLWSGYGQGCQSFFGDFGQVKASSTIQQLGSNGKEF
TGTEQALQVAAHGFEKLNTAPFSIYPGDCKISVDPQKPSPIFSLQSPLSEYHNRFELGFGQPMVCANYPYMEQHYGILSAYAPQIPGRIMLPMSLTTDDGPIYVNAKQYH
GIIRRRKIRAKAMMENRLAKTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKASMEPKKIEDINLSDSTGSSQCSVVLQSESGALNSPNGGKGRGFSLSSSERSLMV
EGMAWFCPNGLQQLTTAVTFVFDREENGFLVRRIAPKPQFSATKSSLCNVVHDNCLKLRDEASPGVCFASQHEEVTERNQRIIQSENQNEVTAVDKRVCFKNKSVKNILK
HFVPNKDAWLVGLFVCLRTTFIFRSKQPYPVGFFVFQFQSSPIRSPKSIPVTWDSAISDFYYCFLTQMSASSLTMAAAAASVSSSSSSSSSSPCKKLFFAQHLNRIPSHF
SPKQKPLKLLELRIHLPNIYPLAFSSASHLYCAPPAFEGLEVSDPITEDAETEESDGSEESREEDEQKVSASRDAGKLYIGNLPYAMTSSQLTEVFAEAGHVVSVQVIYD
KVTDRSRGFAFVTMATLEEAKEAIRMFDGSLIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRESGRSR
GFGFVSFETAEDAESALDSMNGVEVEGRPLRLNMAADRAPTSPAAFTRTENTIDSNELLTRTVVQIEYLQDFPRNSLSFLAQRPRKPMFFFSSMSDVS