; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G199360 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G199360
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionNuclear transcription factor Y subunit
Genome locationCmU531Chr10:31364127..31373754
RNA-Seq ExpressionCmUC10G199360
SyntenyCmUC10G199360
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001289 - Nuclear transcription factor Y subunit A
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043802.1 nuclear transcription factor Y subunit A-10 [Cucumis melo var. makuwa]7.5e-18982.73Show/hide
Query:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV
        MAPQTG LKEHE I+PNSLGQLS SPARLWSAFGQG QSIFGDFG VKASS +QLGSNGKEFNGTKQ  +HGL+K+N APFSIYPGD K+SMDAQKPSPV
Subjt:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV

Query:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR---------
        FSLQSPL+EY NRFELGFGQPLIC NYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLART+         
Subjt:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR---------

Query:  ---------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLS
                             +PYMHESRHLHAMRRPRGSGGRFLNTKNLKNGK SMEPKKID+VNLSDSTGSQCSVVLQSESGTLNS N+AKGRGFSLS
Subjt:  ---------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLS

Query:  SSERSSMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRIVPAFGCNGKSFLALTNVGVGCLLSLCFAS
        SSERS MEE MAWFCPNGLQQLTTA TFVF+GEG GFLVQVIAPKPCLCCDQIQLA M ST IV+GFAMRIVPAFGCNGKSFLAL  VGVGC LSLCFAS
Subjt:  SSERSSMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRIVPAFGCNGKSFLALTNVGVGCLLSLCFAS

Query:  -QYGKWLRETKESSNQI
         +Y +WLRE+KESSNQI
Subjt:  -QYGKWLRETKESSNQI

KAA0043803.1 33 kDa ribonucleoprotein [Cucumis melo var. makuwa]1.5e-15282.09Show/hide
Query:  RRGRDGEWEKIPSH-RITSPEQNPSLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYP
        R  +DG  + +PS  R+   E++  ++  EF H H+LPQMSA S++ AAA  +   +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLP  YP
Subjt:  RRGRDGEWEKIPSH-RITSPEQNPSLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYP

Query:  LSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF
        LSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ EDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAF
Subjt:  LSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF

Query:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG
        VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRG
Subjt:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG

Query:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA
        FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSPAAF RTEN IDSKELLTSISA
Subjt:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA

KAG6580757.1 Nuclear transcription factor Y subunit A-10, partial [Cucurbita argyrosperma subsp. sororia]1.2e-15978.42Show/hide
Query:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS--EQLGSNGKEFNGTKQ----AAHGLEKVNIAPFSIYPGDCKISMDAQK
        MAPQTGYLKEHEG+VPNSLGQLSSSPARLWS +GQG QS FGDFG VKASS  +QLGSNGKEF GT+Q    AAHG EK+N APFSIYPGDCKIS+D QK
Subjt:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS--EQLGSNGKEFNGTKQ----AAHGLEKVNIAPFSIYPGDCKISMDAQK

Query:  PSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMH
        PSP+FSLQSPLSEY NRFELGFGQP++CANYPYMEQHYGILSAY PQIPGRIMLPMSLT+DDGPIYVNAKQYHGIIRRR+IRAKAMMEN+LA+TRKPYMH
Subjt:  PSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMH

Query:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTG-SQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSMEEGMAWFCPNGLQQLTTA
        ESRHLHAMRRPRGSGGRFLNTK LKNGK SMEPKKI+D+NLSDSTG SQCSVVLQSESG LNS N  KGRGFSLSSSERS M EGMAWFCPNGLQQLTTA
Subjt:  ESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTG-SQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSMEEGMAWFCPNGLQQLTTA

Query:  ATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRI----VPAFGCNGKSFLALTNVGVGCLLS-LCFAS
         TFVFD E NGFLV+ IAPKP     Q      S   +V    +++     P    +GKSFLAL NVGVGCLLS  CF++
Subjt:  ATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRI----VPAFGCNGKSFLALTNVGVGCLLS-LCFAS

XP_004136521.3 33 kDa ribonucleoprotein, chloroplastic [Cucumis sativus]3.4e-14988.27Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSA S+SMAAA  A SV SSSP   K FFTQHPNQIPSHFSPK N LKLLNL  H P  YPLSFSS SHLHCAPPAFD L++SD ETEY  +QES GEEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
        TQ EDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
Subjt:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK

Query:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL
        EVMGP+IRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAK+IYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQ+ 
Subjt:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL

Query:  TSPAAFTRTENEIDSKELLTSISA
         SPAAF RTEN ID KELLTSISA
Subjt:  TSPAAFTRTENEIDSKELLTSISA

XP_038904039.1 33 kDa ribonucleoprotein, chloroplastic [Benincasa hispida]2.2e-15692.9Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSASSVSMAAA  A SVSSSSP SKKLFFTQHPNQIPSHFSPKQNPLKLLNL IHLP  YPLSFSS SHLH  PPAFDGL+VSD ETEYAE+QES  EEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
        TQ EDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
Subjt:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK

Query:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL
        EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAE ALESMNGVEVEGRPLRLNIAAG+A 
Subjt:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL

Query:  TSPAAFTRTENEIDSKELLTSISA
        TSPAAF RTEN IDSKELLTSISA
Subjt:  TSPAAFTRTENEIDSKELLTSISA

TrEMBL top hitse value%identityAlignment
A0A0A0LBL6 Uncharacterized protein3.7e-14987.96Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSA S+SMAAA  A +V SSSP   K FFTQHPNQIPSHFSPK N LKLLNL  H P  YPLSFSS SHLHCAPPAFD L++SD ETEY  +QES GEEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
        TQ EDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK
Subjt:  TQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEK

Query:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL
        EVMGP+IRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAK+IYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQ+ 
Subjt:  EVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQAL

Query:  TSPAAFTRTENEIDSKELLTSISA
         SPAAF RTEN ID KELLTSISA
Subjt:  TSPAAFTRTENEIDSKELLTSISA

A0A1S3B7N1 33 kDa ribonucleoprotein, chloroplastic1.4e-14888.47Show/hide
Query:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-
        S+ S+AAA  A S +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLP  YPLSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ 
Subjt:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-

Query:  EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM
        EDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM
Subjt:  EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM

Query:  GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSP
        GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSP
Subjt:  GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSP

Query:  AAFTRTENEIDSKELLTSISA
        AAF RTEN IDSKELLTSISA
Subjt:  AAFTRTENEIDSKELLTSISA

A0A5A7TK11 33 kDa ribonucleoprotein7.1e-15382.09Show/hide
Query:  RRGRDGEWEKIPSH-RITSPEQNPSLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYP
        R  +DG  + +PS  R+   E++  ++  EF H H+LPQMSA S++ AAA  +   +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLP  YP
Subjt:  RRGRDGEWEKIPSH-RITSPEQNPSLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYP

Query:  LSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF
        LSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ EDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAF
Subjt:  LSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF

Query:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG
        VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRG
Subjt:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG

Query:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA
        FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSPAAF RTEN IDSKELLTSISA
Subjt:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA

A0A5D3DNY3 Nuclear transcription factor Y subunit3.6e-18982.73Show/hide
Query:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV
        MAPQTG LKEHE I+PNSLGQLS SPARLWSAFGQG QSIFGDFG VKASS +QLGSNGKEFNGTKQ  +HGL+K+N APFSIYPGD K+SMDAQKPSPV
Subjt:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV

Query:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR---------
        FSLQSPL+EY NRFELGFGQPLIC NYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLART+         
Subjt:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR---------

Query:  ---------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLS
                             +PYMHESRHLHAMRRPRGSGGRFLNTKNLKNGK SMEPKKID+VNLSDSTGSQCSVVLQSESGTLNS N+AKGRGFSLS
Subjt:  ---------------------KPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLS

Query:  SSERSSMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRIVPAFGCNGKSFLALTNVGVGCLLSLCFAS
        SSERS MEE MAWFCPNGLQQLTTA TFVF+GEG GFLVQVIAPKPCLCCDQIQLA M ST IV+GFAMRIVPAFGCNGKSFLAL  VGVGC LSLCFAS
Subjt:  SSERSSMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRIVPAFGCNGKSFLALTNVGVGCLLSLCFAS

Query:  -QYGKWLRETKESSNQI
         +Y +WLRE+KESSNQI
Subjt:  -QYGKWLRETKESSNQI

A0A5D3DPZ1 33 kDa ribonucleoprotein1.8e-14888.47Show/hide
Query:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-
        S+ S+AAA  A S +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLP  YPLSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ 
Subjt:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQ-

Query:  EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM
        EDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM
Subjt:  EDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVM

Query:  GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSP
        GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSP
Subjt:  GPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSP

Query:  AAFTRTENEIDSKELLTSISA
        AAF RTEN IDSKELLTSISA
Subjt:  AAFTRTENEIDSKELLTSISA

SwissProt top hitse value%identityAlignment
P19684 33 kDa ribonucleoprotein, chloroplastic9.8e-8353.8Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIP---SHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYG
        MS    S AA  + +S S    F++K  F+     +    +HF+ K N  K   L+ H P      + SS  L       DG++V   + E      +  
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIP---SHFSPKQNPLKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYG

Query:  EEETQEDEQKV-SVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRG
        EEE +E E++V S S E G+LYVGNLP++MTSSQLSE+F EAG V +V+++YD+VTDRSRGFAFVTM ++EEAKEAIR+FDGSQ+GGRTV+VNFPEVPRG
Subjt:  EEETQEDEQKV-SVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRG

Query:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAG
        GE+EVM  KIRS+Y  FVDSPHK+Y  NL W LTSQ LR+AF +QPG +SAKVIYDR+SG+SRGFGF++F +AE   SAL++MN VE+EGRPLRLN+A  
Subjt:  GEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAG

Query:  QALTS--PAAFTRTENEIDSKELLTSISA
        +A  S  P   T  EN+ D+ ELL+S+S+
Subjt:  QALTS--PAAFTRTENEIDSKELLTSISA

Q08935 29 kDa ribonucleoprotein A, chloroplastic7.4e-3836.4Show/hide
Query:  LKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSE-------------TEYAEVQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSE
        L L    + LPK  P S ++S      PP+   L +S S              +++ ++++    ++  E+E+  S      K++VGNLP++  S+ L+E
Subjt:  LKLLNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSE-------------TEYAEVQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSE

Query:  VFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQS
        +F  AG+V  V+VIYDK+T RSRGF FVTM++ EE + A + F+G ++ GR +RVN    P   E        R   +   DS +++Y GNL WG+   +
Subjt:  VFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQS

Query:  LREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA
        L   F  Q  ++ AKV+YDR SG+SRGFGFV++ +AE+  +A+ES++GV++ GR +R++ A
Subjt:  LREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA

Q39061 RNA-binding protein CP33, chloroplastic1.2e-8051.93Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQ--------NPLKL-LNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAE
        MS++  S A A +A + +SS+     L  +   +Q+   F+PK         NPL L  N+R H        F  ++    A  A D +Q S  E E  E
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQ--------NPLKL-LNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAE

Query:  VQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFP
         +   GEEE +E++Q    S E G+LYVGNLPY +TSS+LS++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ SQIGGRTV+VNFP
Subjt:  VQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFP

Query:  EVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRL
        EVPRGGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +G+SRGFGF+SFE+AE+ +SAL +MNGVEVEGR LRL
Subjt:  EVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRL

Query:  NIAA--GQALTSPAAFTRTENE---IDSKELLTSISA
        N+A+   +   SP +    E E   ++S E+L+++SA
Subjt:  NIAA--GQALTSPAAFTRTENE---IDSKELLTSISA

Q8LFU0 Nuclear transcription factor Y subunit A-101.0e-3945.04Show/hide
Query:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS
        FG    ++E L G     F G K    +A  G+  ++ +   F+  PG  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++S
Subjt:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS

Query:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN
        AYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA    KL+R RKPYMH SRHLHAMRRPRGSGGRFLNTK              D   
Subjt:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN

Query:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME
         S  + SQ S V   E+ T+NS  +A     +LS S  +SM+
Subjt:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME

Q9M9X4 Nuclear transcription factor Y subunit A-21.3e-3744.04Show/hide
Query:  GQLSSSPARLWSAFGQ---GSQSIFGD---FGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAPFSIYPGDCKISMDAQKP-SPVFSLQSPLSEYQNRF
        G  S+     W+AFG      +S+ GD   F  VK  S  +G  G+  +    +A  L       FS+  GD K      KP    FS+QSP        
Subjt:  GQLSSSPARLWSAFGQ---GSQSIFGD---FGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAPFSIYPGDCKISMDAQKP-SPVFSLQSPLSEYQNRF

Query:  ELGFGQPLICANYPYME-QHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKA---MMENKL-ARTRKPYMHESRHLHAMRRPRG
        ELGF QP I   YPY E Q+YG++SAYG Q   R+MLP+++ ++D  IYVN+KQYHGIIRRRQ RAKA   + + KL +R RKPYMH SRHLHA+RRPRG
Subjt:  ELGFGQPLICANYPYME-QHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKA---MMENKL-ARTRKPYMHESRHLHAMRRPRG

Query:  SGGRFLNTK---------NLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSM
        SGGRFLNTK         N K G  SM+   I        + SQ S V+  E+GT+N  N     G ++S SE +SM
Subjt:  SGGRFLNTK---------NLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSM

Arabidopsis top hitse value%identityAlignment
AT3G05690.1 nuclear factor Y, subunit A28.9e-3944.04Show/hide
Query:  GQLSSSPARLWSAFGQ---GSQSIFGD---FGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAPFSIYPGDCKISMDAQKP-SPVFSLQSPLSEYQNRF
        G  S+     W+AFG      +S+ GD   F  VK  S  +G  G+  +    +A  L       FS+  GD K      KP    FS+QSP        
Subjt:  GQLSSSPARLWSAFGQ---GSQSIFGD---FGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAPFSIYPGDCKISMDAQKP-SPVFSLQSPLSEYQNRF

Query:  ELGFGQPLICANYPYME-QHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKA---MMENKL-ARTRKPYMHESRHLHAMRRPRG
        ELGF QP I   YPY E Q+YG++SAYG Q   R+MLP+++ ++D  IYVN+KQYHGIIRRRQ RAKA   + + KL +R RKPYMH SRHLHA+RRPRG
Subjt:  ELGFGQPLICANYPYME-QHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKA---MMENKL-ARTRKPYMHESRHLHAMRRPRG

Query:  SGGRFLNTK---------NLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSM
        SGGRFLNTK         N K G  SM+   I        + SQ S V+  E+GT+N  N     G ++S SE +SM
Subjt:  SGGRFLNTK---------NLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSM

AT3G52380.1 chloroplast RNA-binding protein 338.5e-8251.93Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQ--------NPLKL-LNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAE
        MS++  S A A +A + +SS+     L  +   +Q+   F+PK         NPL L  N+R H        F  ++    A  A D +Q S  E E  E
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQ--------NPLKL-LNLRIHLPKLYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAE

Query:  VQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFP
         +   GEEE +E++Q    S E G+LYVGNLPY +TSS+LS++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ SQIGGRTV+VNFP
Subjt:  VQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFP

Query:  EVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRL
        EVPRGGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +G+SRGFGF+SFE+AE+ +SAL +MNGVEVEGR LRL
Subjt:  EVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRL

Query:  NIAA--GQALTSPAAFTRTENE---IDSKELLTSISA
        N+A+   +   SP +    E E   ++S E+L+++SA
Subjt:  NIAA--GQALTSPAAFTRTENE---IDSKELLTSISA

AT5G06510.1 nuclear factor Y, subunit A107.3e-4145.04Show/hide
Query:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS
        FG    ++E L G     F G K    +A  G+  ++ +   F+  PG  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++S
Subjt:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS

Query:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN
        AYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA    KL+R RKPYMH SRHLHAMRRPRGSGGRFLNTK              D   
Subjt:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN

Query:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME
         S  + SQ S V   E+ T+NS  +A     +LS S  +SM+
Subjt:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME

AT5G06510.2 nuclear factor Y, subunit A107.3e-4145.04Show/hide
Query:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS
        FG    ++E L G     F G K    +A  G+  ++ +   F+  PG  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++S
Subjt:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS

Query:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN
        AYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA    KL+R RKPYMH SRHLHAMRRPRGSGGRFLNTK              D   
Subjt:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVN

Query:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME
         S  + SQ S V   E+ T+NS  +A     +LS S  +SM+
Subjt:  LSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME

AT5G06510.3 nuclear factor Y, subunit A108.9e-3949.74Show/hide
Query:  GDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMM
        G  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++SAYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA  
Subjt:  GDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMM

Query:  ENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME
          KL+R RKPYMH SRHLHAMRRPRGSGGRFLNTK              D    S  + SQ S V   E+ T+NS  +A     +LS S  +SM+
Subjt:  ENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCACAAACTGGCTATTTGAAAGAACATGAAGGAATTGTTCCTAATTCCCTTGGCCAGTTATCATCTTCTCCTGCCCGTTTATGGAGTGCCTTTGGGCAAGGTTC
TCAATCAATCTTTGGGGATTTTGGTCTAGTCAAGGCTTCATCTGAACAACTTGGCAGTAATGGGAAGGAGTTCAATGGCACCAAACAAGCTGCTCATGGCTTGGAGAAAG
TGAATATAGCTCCATTTTCCATCTATCCTGGTGACTGTAAGATTTCGATGGATGCACAAAAACCTTCACCAGTTTTTTCCCTGCAATCACCCTTGTCAGAATATCAAAAT
CGTTTTGAGCTTGGATTTGGCCAACCTTTGATATGTGCAAATTATCCGTACATGGAACAGCATTATGGCATCCTGTCAGCTTATGGACCTCAAATACCAGGCCGGATTAT
GCTGCCAATGAGCTTAACATCAGATGATGGACCTATTTATGTGAATGCAAAGCAGTATCATGGAATCATTAGGCGCAGGCAGATCCGTGCTAAGGCAATGATGGAGAATA
AACTTGCAAGAACTCGTAAGCCATATATGCACGAATCGCGTCATCTTCATGCAATGCGCCGTCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAAT
GGGAAACTCTCAATGGAACCAAAGAAAATTGATGATGTGAACCTTTCTGATTCAACTGGCTCCCAATGTTCTGTGGTTCTGCAATCAGAAAGTGGAACTTTGAACTCCCG
AAATGATGCAAAGGGGAGGGGCTTTAGTCTCTCGAGTTCGGAGAGATCATCGATGGAGGAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTG
CGACCTTTGTGTTTGATGGCGAAGGCAATGGGTTCTTGGTGCAAGTCATTGCACCAAAACCCTGTCTTTGTTGCGACCAAATCCAGCTTGCTACAATGTCGTCAACGGCA
ATTGTCTCAGGCTTCGCGATGAGGATAGTCCCCGCGTTTGGTTGCAATGGCAAGTCATTTTTGGCTCTAACTAATGTAGGTGTAGGATGTCTCTTATCTCTTTGTTTTGC
TTCTCAGTATGGGAAGTGGCTAAGAGAAACAAAGGAATCATCCAATCAGATATCAATTGCAAGGGTCGCTGGAATGGGTTTCTGTCGCATGGGGGGTTTTGTGGAACTGG
TTTCATTGGATATGCGGTTTTTAGCGGATAGAGCAAGAGAGCAAGAAGAAGGGCCTGGTCCCACCGCGTTTTTCAGATCTGGAACCAACAAAGGCGGCGACTGCGGCGAA
GTTCGGCGGATAAATGCTAGAAGAGGAAGAGATGGGGAATGGGAAAAGATACCATCTCATCGTATCACCTCCCCGGAGCAGAATCCATCTCTGTCACTTGGGGAATTCCA
CCATTTCCATTTTCTTCCTCAAATGTCAGCTTCTTCTGTCTCAATGGCTGCTGCTACTACAGCAACTTCGGTTTCCTCTTCTTCTCCATTCTCCAAGAAACTCTTCTTCA
CTCAACATCCCAACCAAATTCCCTCCCATTTTTCTCCGAAACAGAACCCATTGAAGCTTCTCAACCTCCGGATTCATTTACCCAAACTTTACCCTCTTTCCTTTTCCTCT
TCTTCTCATCTCCACTGTGCTCCTCCTGCTTTCGATGGGCTCCAAGTCTCCGACTCTGAGACAGAATACGCAGAGGTACAAGAATCGTATGGAGAAGAAGAAACCCAAGA
AGACGAACAAAAGGTATCAGTGTCTCGCGAAGCAGGGAAGCTTTATGTTGGGAATTTACCATATGCTATGACTTCTTCCCAATTGTCTGAGGTCTTCACCGAAGCTGGTC
ATGTGGTTTCTGTACAGGTTATATATGACAAAGTTACGGATAGGAGTAGGGGATTTGCATTTGTGACAATGGCCACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTT
GATGGCTCTCAAATCGGTGGTCGAACTGTTCGGGTGAACTTCCCTGAAGTGCCAAGGGGAGGAGAAAAGGAAGTCATGGGGCCAAAGATAAGAAGCAGCTATAACAAATT
TGTAGATAGTCCTCACAAGATATATGCAGGGAACCTTGGTTGGGGTCTCACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCA
TCTATGATAGGGCATCTGGAAAAAGTAGAGGTTTTGGATTTGTATCCTTCGAAACTGCTGAGGATGCAGAGTCTGCTTTGGAGTCCATGAATGGAGTGGAAGTTGAAGGG
CGGCCGCTTCGTTTGAACATTGCTGCAGGGCAGGCCCTGACTTCCCCAGCAGCATTCACGAGGACTGAAAATGAAATTGACAGCAAAGAATTGCTTACCAGTATCAGTGC
CTGA
mRNA sequenceShow/hide mRNA sequence
CCAAACCAGATCACAGACAACTATGAATTAAGGACTCAGATTAGCATAATTAGGACCACTAATCATTTCCTAAGCACACTTTGAGTATATAATTTATGTTTTTTTTTAAT
CAAAAACCCATTTGGTTCAGCAAATCAAAGAAACAGAATCGGCAAATGGGAAAAACCCATATACCCAAATTGAAATTCAAGAAAATCCCAGAAAAGAAAGTTTGATTTTT
TTTTTTTTTAAATTTTTTTTGGTTAAACAATTGGGAGAAGGGAAAATCAGAGATTGGTGAAATCGCCCAAAAGTTGGATTTTTTTTATTTTTTGAAAAAAAAAAATGGGA
AAAGTGATAGAAAGAATTTTTGTAGTTAATGAATGTGGTTCATATCTATAGATATATATATATATTAATGGAAGCTTTCCATATAGAACAAAATTCTTGCATGCAAGGTA
AAGATCATTGGTTCTTGGGTTTTAGTTATTTTACACTTGTGTGCATTAATTCTTTTTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCCTCTTTCTCTCTCTGTTAGTGCTT
TGCTTGAAATGTACCAACAGCACTCATATCAAATGGGGTTCCTGTGCCTGTATTCAATACCTGCTGCTGCTGCTTCTCCTCCTCCTCCTTCTCCTTCTTCTTGTTTTTGC
TGATTTTAATTTTGAATTGGTGTGAAGGGATTAGTAAAACAACTGTTTCTTGGTGATTTTGAGGCTTGGAAAATGGCACCACAAACTGGCTATTTGAAAGAACATGAAGG
AATTGTTCCTAATTCCCTTGGCCAGTTATCATCTTCTCCTGCCCGTTTATGGAGTGCCTTTGGGCAAGGTTCTCAATCAATCTTTGGGGATTTTGGTCTAGTCAAGGCTT
CATCTGAACAACTTGGCAGTAATGGGAAGGAGTTCAATGGCACCAAACAAGCTGCTCATGGCTTGGAGAAAGTGAATATAGCTCCATTTTCCATCTATCCTGGTGACTGT
AAGATTTCGATGGATGCACAAAAACCTTCACCAGTTTTTTCCCTGCAATCACCCTTGTCAGAATATCAAAATCGTTTTGAGCTTGGATTTGGCCAACCTTTGATATGTGC
AAATTATCCGTACATGGAACAGCATTATGGCATCCTGTCAGCTTATGGACCTCAAATACCAGGCCGGATTATGCTGCCAATGAGCTTAACATCAGATGATGGACCTATTT
ATGTGAATGCAAAGCAGTATCATGGAATCATTAGGCGCAGGCAGATCCGTGCTAAGGCAATGATGGAGAATAAACTTGCAAGAACTCGTAAGCCATATATGCACGAATCG
CGTCATCTTCATGCAATGCGCCGTCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAATGGGAAACTCTCAATGGAACCAAAGAAAATTGATGATGT
GAACCTTTCTGATTCAACTGGCTCCCAATGTTCTGTGGTTCTGCAATCAGAAAGTGGAACTTTGAACTCCCGAAATGATGCAAAGGGGAGGGGCTTTAGTCTCTCGAGTT
CGGAGAGATCATCGATGGAGGAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTGCGACCTTTGTGTTTGATGGCGAAGGCAATGGGTTCTTG
GTGCAAGTCATTGCACCAAAACCCTGTCTTTGTTGCGACCAAATCCAGCTTGCTACAATGTCGTCAACGGCAATTGTCTCAGGCTTCGCGATGAGGATAGTCCCCGCGTT
TGGTTGCAATGGCAAGTCATTTTTGGCTCTAACTAATGTAGGTGTAGGATGTCTCTTATCTCTTTGTTTTGCTTCTCAGTATGGGAAGTGGCTAAGAGAAACAAAGGAAT
CATCCAATCAGATATCAATTGCAAGGGTCGCTGGAATGGGTTTCTGTCGCATGGGGGGTTTTGTGGAACTGGTTTCATTGGATATGCGGTTTTTAGCGGATAGAGCAAGA
GAGCAAGAAGAAGGGCCTGGTCCCACCGCGTTTTTCAGATCTGGAACCAACAAAGGCGGCGACTGCGGCGAAGTTCGGCGGATAAATGCTAGAAGAGGAAGAGATGGGGA
ATGGGAAAAGATACCATCTCATCGTATCACCTCCCCGGAGCAGAATCCATCTCTGTCACTTGGGGAATTCCACCATTTCCATTTTCTTCCTCAAATGTCAGCTTCTTCTG
TCTCAATGGCTGCTGCTACTACAGCAACTTCGGTTTCCTCTTCTTCTCCATTCTCCAAGAAACTCTTCTTCACTCAACATCCCAACCAAATTCCCTCCCATTTTTCTCCG
AAACAGAACCCATTGAAGCTTCTCAACCTCCGGATTCATTTACCCAAACTTTACCCTCTTTCCTTTTCCTCTTCTTCTCATCTCCACTGTGCTCCTCCTGCTTTCGATGG
GCTCCAAGTCTCCGACTCTGAGACAGAATACGCAGAGGTACAAGAATCGTATGGAGAAGAAGAAACCCAAGAAGACGAACAAAAGGTATCAGTGTCTCGCGAAGCAGGGA
AGCTTTATGTTGGGAATTTACCATATGCTATGACTTCTTCCCAATTGTCTGAGGTCTTCACCGAAGCTGGTCATGTGGTTTCTGTACAGGTTATATATGACAAAGTTACG
GATAGGAGTAGGGGATTTGCATTTGTGACAATGGCCACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTTGATGGCTCTCAAATCGGTGGTCGAACTGTTCGGGTGAA
CTTCCCTGAAGTGCCAAGGGGAGGAGAAAAGGAAGTCATGGGGCCAAAGATAAGAAGCAGCTATAACAAATTTGTAGATAGTCCTCACAAGATATATGCAGGGAACCTTG
GTTGGGGTCTCACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCATCTATGATAGGGCATCTGGAAAAAGTAGAGGTTTTGGA
TTTGTATCCTTCGAAACTGCTGAGGATGCAGAGTCTGCTTTGGAGTCCATGAATGGAGTGGAAGTTGAAGGGCGGCCGCTTCGTTTGAACATTGCTGCAGGGCAGGCCCT
GACTTCCCCAGCAGCATTCACGAGGACTGAAAATGAAATTGACAGCAAAGAATTGCTTACCAGTATCAGTGCCTGA
Protein sequenceShow/hide protein sequence
MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQN
RFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKPYMHESRHLHAMRRPRGSGGRFLNTKNLKN
GKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSSMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTA
IVSGFAMRIVPAFGCNGKSFLALTNVGVGCLLSLCFASQYGKWLRETKESSNQISIARVAGMGFCRMGGFVELVSLDMRFLADRAREQEEGPGPTAFFRSGTNKGGDCGE
VRRINARRGRDGEWEKIPSHRITSPEQNPSLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPKLYPLSFSS
SSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMF
DGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEG
RPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA