; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G195820 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G195820
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNuclear transcription factor Y subunit
Genome locationCiama_Chr10:31233016..31241540
RNA-Seq ExpressionCaUC10G195820
SyntenyCaUC10G195820
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001289 - Nuclear transcription factor Y subunit A
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043802.1 nuclear transcription factor Y subunit A-10 [Cucumis melo var. makuwa]1.8e-17078.59Show/hide
Query:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV
        MAPQTG LKEHE I+PNSLGQLS SPARLWSAFGQG QSIFGDFG VKASS +QLGSNGKEFNGTKQ  +HGL+K+N APFSIYPGD K+SMDAQKPSPV
Subjt:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV

Query:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR-----KDVN
        FSLQSPL+EY NRFELGFGQPLIC NYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLART+     KDVN
Subjt:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR-----KDVN

Query:  -SFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLWAIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDD
         SF  +Y    I  T +  D  + L+                                PYMHESRHLHAMRRPRGSGGRFLNTKNLKNGK SMEPKKID+
Subjt:  -SFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLWAIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDD

Query:  VNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSLMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIV
        VNLSDSTGSQCSVVLQSESGTLNS N+AKGRGFSLSSSERSLMEE MAWFCPNGLQQLTTA TFVF+GEG GFLVQVIAPKPCLCCDQIQLA M ST IV
Subjt:  VNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSLMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIV

Query:  SGFAMRIVPAF
        +GFAMRIVPAF
Subjt:  SGFAMRIVPAF

KAA0043803.1 33 kDa ribonucleoprotein [Cucumis melo var. makuwa]1.4e-15683.75Show/hide
Query:  RRRRDGEWEKIISSYHLPGAEIHLSLG-EFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPL
        R  +DG  + + SS  LPGAE  + +  EF H H+LPQMSA S++ AAA  +   +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLPNFYPL
Subjt:  RRRRDGEWEKIISSYHLPGAEIHLSLG-EFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPL

Query:  SFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF
        SFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ EEDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAF
Subjt:  SFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF

Query:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG
        VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRG
Subjt:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG

Query:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA
        FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSPAAF RTEN IDSKELLTSISA
Subjt:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA

XP_004136521.3 33 kDa ribonucleoprotein, chloroplastic [Cucumis sativus]9.1e-15188.92Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSA S+SMAAA  A SV SSSP   K FFTQHPNQIPSHFSPK N LKLLNL  H PNFYPLSFSS SHLHCAPPAFD L++SD ETEY  +QES GEEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
        TQ EEDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
Subjt:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE

Query:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA
        KEVMGP+IRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAK+IYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQ+
Subjt:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA

Query:  LTSPAAFTRTENEIDSKELLTSISA
          SPAAF RTEN ID KELLTSISA
Subjt:  LTSPAAFTRTENEIDSKELLTSISA

XP_008442930.1 PREDICTED: 33 kDa ribonucleoprotein, chloroplastic [Cucumis melo]1.0e-14989.13Show/hide
Query:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE
        S+ S+AAA  A S +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLPNFYPLSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ 
Subjt:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE

Query:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
        EEDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
Subjt:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV

Query:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS
        MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TS
Subjt:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS

Query:  PAAFTRTENEIDSKELLTSISA
        PAAF RTEN IDSKELLTSISA
Subjt:  PAAFTRTENEIDSKELLTSISA

XP_038904039.1 33 kDa ribonucleoprotein, chloroplastic [Benincasa hispida]7.7e-15893.54Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSASSVSMAAA  A SVSSSSP SKKLFFTQHPNQIPSHFSPKQNPLKLLNL IHLPNFYPLSFSS SHLH  PPAFDGL+VSD ETEYAE+QES  EEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
        TQ EEDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
Subjt:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE

Query:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA
        KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAE ALESMNGVEVEGRPLRLNIAAG+A
Subjt:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA

Query:  LTSPAAFTRTENEIDSKELLTSISA
         TSPAAF RTEN IDSKELLTSISA
Subjt:  LTSPAAFTRTENEIDSKELLTSISA

TrEMBL top hitse value%identityAlignment
A0A0A0LBL6 Uncharacterized protein9.8e-15188.62Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        MSA S+SMAAA  A +V SSSP   K FFTQHPNQIPSHFSPK N LKLLNL  H PNFYPLSFSS SHLHCAPPAFD L++SD ETEY  +QES GEEE
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
        TQ EEDEQKVSVSREAGKLY+GNLPYAMTSSQLSEVF EAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE
Subjt:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGE

Query:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA
        KEVMGP+IRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAK+IYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQ+
Subjt:  KEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQA

Query:  LTSPAAFTRTENEIDSKELLTSISA
          SPAAF RTEN ID KELLTSISA
Subjt:  LTSPAAFTRTENEIDSKELLTSISA

A0A1S3B7N1 33 kDa ribonucleoprotein, chloroplastic4.9e-15089.13Show/hide
Query:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE
        S+ S+AAA  A S +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLPNFYPLSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ 
Subjt:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE

Query:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
        EEDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
Subjt:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV

Query:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS
        MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TS
Subjt:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS

Query:  PAAFTRTENEIDSKELLTSISA
        PAAF RTEN IDSKELLTSISA
Subjt:  PAAFTRTENEIDSKELLTSISA

A0A5A7TK11 33 kDa ribonucleoprotein7.0e-15783.75Show/hide
Query:  RRRRDGEWEKIISSYHLPGAEIHLSLG-EFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPL
        R  +DG  + + SS  LPGAE  + +  EF H H+LPQMSA S++ AAA  +   +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLPNFYPL
Subjt:  RRRRDGEWEKIISSYHLPGAEIHLSLG-EFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPL

Query:  SFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF
        SFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ EEDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAF
Subjt:  SFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAF

Query:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG
        VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRG
Subjt:  VTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRG

Query:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA
        FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TSPAAF RTEN IDSKELLTSISA
Subjt:  FGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTSPAAFTRTENEIDSKELLTSISA

A0A5D3DNY3 Nuclear transcription factor Y subunit8.5e-17178.59Show/hide
Query:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV
        MAPQTG LKEHE I+PNSLGQLS SPARLWSAFGQG QSIFGDFG VKASS +QLGSNGKEFNGTKQ  +HGL+K+N APFSIYPGD K+SMDAQKPSPV
Subjt:  MAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASS-EQLGSNGKEFNGTKQ-AAHGLEKVNIAPFSIYPGDCKISMDAQKPSPV

Query:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR-----KDVN
        FSLQSPL+EY NRFELGFGQPLIC NYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLART+     KDVN
Subjt:  FSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTR-----KDVN

Query:  -SFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLWAIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDD
         SF  +Y    I  T +  D  + L+                                PYMHESRHLHAMRRPRGSGGRFLNTKNLKNGK SMEPKKID+
Subjt:  -SFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLWAIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDD

Query:  VNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSLMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIV
        VNLSDSTGSQCSVVLQSESGTLNS N+AKGRGFSLSSSERSLMEE MAWFCPNGLQQLTTA TFVF+GEG GFLVQVIAPKPCLCCDQIQLA M ST IV
Subjt:  VNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSLMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIV

Query:  SGFAMRIVPAF
        +GFAMRIVPAF
Subjt:  SGFAMRIVPAF

A0A5D3DPZ1 33 kDa ribonucleoprotein6.4e-15089.13Show/hide
Query:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE
        S+ S+AAA  A S +SSSP  KK FFTQHPNQIPSHFSPK N LKLLNL IHLPNFYPLSFSS SH+HCAPPAFD L++SD ETEY +VQES GEE+TQ 
Subjt:  SSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQE

Query:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
        EEDEQK+SVSREAGKLY+GNLPYAMTSSQLSEVF EAG VVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV
Subjt:  EEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEV

Query:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS
        MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLR+AFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA +A TS
Subjt:  MGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQALTS

Query:  PAAFTRTENEIDSKELLTSISA
        PAAF RTEN IDSKELLTSISA
Subjt:  PAAFTRTENEIDSKELLTSISA

SwissProt top hitse value%identityAlignment
P19684 33 kDa ribonucleoprotein, chloroplastic6.5e-8354.55Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIP---SHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYG
        MS    S AA  + +S S    F++K  F+     +    +HF+ K N  K   L+ H P      + SS  L       DG++V   + E  EV  S  
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIP---SHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYG

Query:  EEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPR
        EEE  EE++E+  S S E G+LYVGNLP++MTSSQLSE+F EAG V +V+++YD+VTDRSRGFAFVTM ++EEAKEAIR+FDGSQ+GGRTV+VNFPEVPR
Subjt:  EEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPR

Query:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA
        GGE+EVM  KIRS+Y  FVDSPHK+Y  NL W LTSQ LR+AF +QPG +SAKVIYDR+SG+SRGFGF++F +AE   SAL++MN VE+EGRPLRLN+A 
Subjt:  GGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAA

Query:  GQALTS--PAAFTRTENEIDSKELLTSISA
         +A  S  P   T  EN+ D+ ELL+S+S+
Subjt:  GQALTS--PAAFTRTENEIDSKELLTSISA

P49314 31 kDa ribonucleoprotein, chloroplastic1.2e-3635.14Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE
        M++SSVS   +     V+S +P S K      PN   S FS   + L L            LS SS  H     P         + +++ ++      E+
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEE

Query:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP----
          E  ++ + S   E  KL+VGNLP+++ S+ L+ +F  AG+V  V+VIYDK++ RSRGF FVTM+T EE + A + F+G +I GR +RVN    P    
Subjt:  TQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP----

Query:  -------RGGEKEVMGPKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNG
               RGG     G +  +S        + VDS +++Y GNL WG+   +L+E F  Q  ++ AKV+YDR SG+SRGFGFV++ +A++   A++S+NG
Subjt:  -------RGGEKEVMGPKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNG

Query:  VEVEGRPLRLNIA
        ++++GR +R++ A
Subjt:  VEVEGRPLRLNIA

Q08935 29 kDa ribonucleoprotein A, chloroplastic1.6e-3637.75Show/hide
Query:  LPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYA-----EVQESYGEEETQEEEDEQKVSVSREAG---KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQ
        LP   P S ++S      PP+   L +S S + ++     +V  S  ++    E+ +  V   R      K++VGNLP++  S+ L+E+F  AG+V  V+
Subjt:  LPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYA-----EVQESYGEEETQEEEDEQKVSVSREAG---KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQ

Query:  VIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGIL
        VIYDK+T RSRGF FVTM++ EE + A + F+G ++ GR +RVN    P   E        R   +   DS +++Y GNL WG+   +L   F  Q  ++
Subjt:  VIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGIL

Query:  SAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA
         AKV+YDR SG+SRGFGFV++ +AE+  +A+ES++GV++ GR +R++ A
Subjt:  SAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA

Q08937 29 kDa ribonucleoprotein B, chloroplastic3.1e-3736.03Show/hide
Query:  VSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREA
        ++SSS  S +  F     Q PS   P  + L   +L     N   LS SSSS   C+   F        E+ ++      G ++ +++ +  +     E 
Subjt:  VSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDGLQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREA

Query:  GKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP-----------RGGEKEVMG
         KL+VGNLP+++ S+ L+ +F  AG+V  V+VIYDK+T RSRGF FVTM+T EE + A + F+G +I GR +RVN    P           RGG     G
Subjt:  GKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP-----------RGGEKEVMG

Query:  PKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA
         +  +S        + VDS +++Y GNL WG+   +L+E F  Q  ++ AKV+YDR SG+SRGFGFV++ ++++   A++S+NGV+++GR +R++ A
Subjt:  PKIRSSY------NKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA

Q39061 RNA-binding protein CP33, chloroplastic1.9e-7952.07Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLH---C-----APPAFDGLQVSDSETEYAEV
        MS++  S A A +A + +SS+     L  +   +Q+   F+PK        L  + PN  PL   S+   H   C     A  A D +Q S  E E  E 
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLH---C-----APPAFDGLQVSDSETEYAEV

Query:  QESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF
        +   GEEE   EE++Q    S E G+LYVGNLPY +TSS+LS++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ SQIGGRTV+VNF
Subjt:  QESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF

Query:  PEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLR
        PEVPRGGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +G+SRGFGF+SFE+AE+ +SAL +MNGVEVEGR LR
Subjt:  PEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLR

Query:  LNIAA--GQALTSPAAFTRTENE---IDSKELLTSISA
        LN+A+   +   SP +    E E   ++S E+L+++SA
Subjt:  LNIAA--GQALTSPAAFTRTENE---IDSKELLTSISA

Arabidopsis top hitse value%identityAlignment
AT1G60000.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.8e-3335.96Show/hide
Query:  ETQEEEDEQKVSVSREAG---KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP
        E +E++D     +   A    KLY GNLPY + S+ L+++  +  +   V+V+Y++ T +SRGFAFVTM+ +E+    I   DG++  GR ++VNF + P
Subjt:  ETQEEEDEQKVSVSREAG---KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNFPEVP

Query:  RGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA
        +  ++ +           + ++ HK++ GNL W +TS+SL  AF     ++ A+V++D  +G+SRG+GFV + +  + E+ALES++G E+EGR +R+N+A
Subjt:  RGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA

Query:  AGQ
         G+
Subjt:  AGQ

AT2G37220.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-3740.31Show/hide
Query:  KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF-PEVPRGGEKEVMGPKIRSSYNKF-
        KL+VGNLP+ + S+QL+++F  AG+V  V+VIYDK+T RSRGF FVTM+++ E + A + F+G ++ GR +RVN  P  P+  +    GP  RSS+    
Subjt:  KLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF-PEVPRGGEKEVMGPKIRSSYNKF-

Query:  ----------VDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA
                    S +++Y GNL WG+   +L   F  Q  ++ A+VIYDR SG+S+GFGFV+++++++ ++A++S++G +++GR +R++ A
Subjt:  ----------VDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIA

AT3G52380.1 chloroplast RNA-binding protein 331.4e-8052.07Show/hide
Query:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLH---C-----APPAFDGLQVSDSETEYAEV
        MS++  S A A +A + +SS+     L  +   +Q+   F+PK        L  + PN  PL   S+   H   C     A  A D +Q S  E E  E 
Subjt:  MSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLH---C-----APPAFDGLQVSDSETEYAEV

Query:  QESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF
        +   GEEE   EE++Q    S E G+LYVGNLPY +TSS+LS++F EAG VV VQ++YDKVTDRSRGF FVTM ++EEAKEA++MF+ SQIGGRTV+VNF
Subjt:  QESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVRVNF

Query:  PEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLR
        PEVPRGGE EVM  KIR +   +VDSPHK+YAGNLGW LTSQ L++AF +QPG+L AKVIY+R +G+SRGFGF+SFE+AE+ +SAL +MNGVEVEGR LR
Subjt:  PEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLR

Query:  LNIAA--GQALTSPAAFTRTENE---IDSKELLTSISA
        LN+A+   +   SP +    E E   ++S E+L+++SA
Subjt:  LNIAA--GQALTSPAAFTRTENE---IDSKELLTSISA

AT5G06510.1 nuclear factor Y, subunit A102.8e-3335.93Show/hide
Query:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS
        FG    ++E L G     F G K    +A  G+  ++ +   F+  PG  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++S
Subjt:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS

Query:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKDVNSFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLW
        AYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA    KL+R RK                                            
Subjt:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKDVNSFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLW

Query:  AIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSS
                        PYMH SRHLHAMRRPRGSGGRFLNTK              D    S  + SQ S V   E+ T+NS  +A     S S+
Subjt:  AIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSS

AT5G06510.2 nuclear factor Y, subunit A102.8e-3335.93Show/hide
Query:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS
        FG    ++E L G     F G K    +A  G+  ++ +   F+  PG  K S D  KP   F++QS        FE GF QP++   +P++EQ+YG++S
Subjt:  FGLVKASSEQL-GSNGKEFNGTK----QAAHGL--EKVNIAPFSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILS

Query:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKDVNSFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLW
        AYG Q   GR+M+P+ + T +DG IYVN+KQYHGIIRRRQ RAKA    KL+R RK                                            
Subjt:  AYGPQ-IPGRIMLPMSL-TSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLARTRKDVNSFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLW

Query:  AIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSS
                        PYMH SRHLHAMRRPRGSGGRFLNTK              D    S  + SQ S V   E+ T+NS  +A     S S+
Subjt:  AIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDSTGSQCSVVLQSESGTLNSRNDAKGRGFSLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCATAGTCTCTTTGTTCCAACTCTGAATTGTAGGAATTGGTTGAAGGGATTAGTAAAACAACTGTTTCTTGGTGATTTTGAGGCTTGGAAAATGGCACCACAAAC
TGGCTATTTGAAAGAACATGAAGGAATTGTTCCTAATTCCCTTGGCCAGTTATCATCTTCTCCTGCCCGTTTATGGAGTGCCTTTGGGCAAGGTTCTCAATCAATCTTTG
GGGATTTTGGTCTAGTCAAGGCTTCATCTGAACAACTTGGCAGTAATGGGAAGGAGTTCAATGGCACCAAACAAGCTGCTCATGGCTTGGAGAAAGTGAATATAGCTCCA
TTTTCCATCTATCCTGGTGACTGTAAGATTTCGATGGATGCACAAAAACCTTCACCAGTTTTTTCCCTGCAATCACCCTTGTCAGAATATCAAAATCGTTTTGAGCTTGG
ATTTGGCCAACCTTTGATATGTGCAAATTATCCGTACATGGAACAGCATTATGGCATCCTGTCAGCTTATGGACCTCAAATACCAGGCCGGATTATGCTGCCAATGAGCT
TAACATCAGATGATGGACCTATTTATGTGAATGCAAAGCAGTATCATGGAATCATTAGGCGCAGGCAGATCCGTGCTAAGGCAATGATGGAGAATAAACTTGCAAGAACT
CGTAAGGATGTGAATAGTTTCCTTGTGGTATATTTCCTCACCGGTATAACTAGAACTAGAAGTCAATCTGATGCGGATGCCTGTCTGAAGAGTAACTCCCATTCGGTTTT
ATTGGAATTTCTGTTTGCCAACCTTTGGGCTATAAGAATTTTCGATATCAACATTTGGCTTGAATATGATATACTACCATATATGCACGAATCGCGTCATCTTCATGCAA
TGCGCCGTCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAATGGGAAACTCTCAATGGAACCAAAGAAAATTGATGATGTGAACCTTTCTGATTCA
ACTGGTTCCCAATGTTCTGTGGTTCTGCAATCAGAAAGTGGAACTTTGAACTCCCGAAATGATGCAAAGGGGAGGGGCTTTAGTCTCTCGAGTTCGGAGAGATCATTGAT
GGAGGAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTGCGACCTTTGTGTTTGATGGCGAAGGCAATGGGTTCTTGGTGCAAGTCATTGCAC
CAAAACCCTGTCTTTGTTGCGACCAAATCCAGCTTGCTACAATGTCGTCGACGGCAATTGTCTCAGGCTTCGCGATGAGGATAGTCCCCGCGTTTGTATGGGAAGTGGCT
AAGAGAAACAAAGGAATCATCCAATCAGATATCAAACATTCTGAGGCAATTTGTAATGTACTGAACAAAGATGCTTGGCTTGTGGGTTTGTTTATGGACAACTGTCAATG
TTTGTCTTTCAGTTTGTGTTTGCAAGGGTCGCTGGAATGGGTTTCTGTCGCATGGGGGGTTTTGTGGAACTGGTTTCATTGGATATGCGGTTTTTTAGCGGATAGAGCAA
GAGAGCAAGAAGAAGGGCCTGGTCCCACCGCGTTTTTCAGATCTGGAACCAACAAAGGCGGCGACTGCGGCGAAGTTCGGCGGATGAATGCTAGAAGAAGGAGAGATGGG
GAATGGGAAAAGATCATCTCATCGTATCACCTCCCCGGAGCAGAAATCCATTTGTCACTTGGGGAATTCCACCATTTCCATTTTCTTCCTCAAATGTCAGCTTCTTCTGT
CTCAATGGCTGCTGCTACTACAGCAACTTCGGTTTCCTCTTCTTCTCCATTCTCCAAGAAACTCTTCTTCACTCAACATCCCAACCAAATTCCCTCCCATTTTTCTCCGA
AACAGAACCCATTGAAGCTTCTCAACCTCCGGATTCATTTACCCAACTTTTACCCTCTTTCCTTTTCCTCTTCTTCTCATCTCCACTGTGCTCCTCCTGCTTTCGATGGG
CTCCAAGTCTCCGACTCTGAGACAGAATACGCAGAGGTACAAGAATCGTATGGAGAAGAAGAAACCCAAGAGGAAGAAGACGAACAAAAGGTATCGGTGTCTCGCGAAGC
AGGGAAGCTTTATGTTGGGAATTTACCATATGCTATGACTTCTTCCCAATTGTCTGAGGTCTTCACCGAAGCTGGTCATGTGGTTTCGGTACAGGTTATATATGACAAAG
TTACGGATAGGAGTAGGGGATTTGCATTTGTGACAATGGCCACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTTGATGGCTCTCAAATCGGTGGTCGAACTGTTCGG
GTGAACTTCCCTGAAGTGCCAAGGGGAGGAGAAAAGGAAGTCATGGGGCCAAAGATAAGAAGCAGCTATAACAAATTTGTAGATAGTCCTCACAAGATATATGCAGGGAA
CCTTGGTTGGGGTCTCACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCATCTATGATAGGGCATCTGGAAAAAGTAGAGGTT
TTGGATTTGTATCCTTCGAAACTGCTGAGGATGCAGAGTCTGCTTTGGAGTCCATGAATGGAGTGGAAGTTGAAGGGCGGCCGCTTCGTTTGAACATTGCTGCAGGGCAG
GCCCTGACTTCCCCAGCAGCATTCACGAGGACTGAAAATGAAATTGACAGCAAAGAATTGCTTACCAGTATCAGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCATAGTCTCTTTGTTCCAACTCTGAATTGTAGGAATTGGTTGAAGGGATTAGTAAAACAACTGTTTCTTGGTGATTTTGAGGCTTGGAAAATGGCACCACAAAC
TGGCTATTTGAAAGAACATGAAGGAATTGTTCCTAATTCCCTTGGCCAGTTATCATCTTCTCCTGCCCGTTTATGGAGTGCCTTTGGGCAAGGTTCTCAATCAATCTTTG
GGGATTTTGGTCTAGTCAAGGCTTCATCTGAACAACTTGGCAGTAATGGGAAGGAGTTCAATGGCACCAAACAAGCTGCTCATGGCTTGGAGAAAGTGAATATAGCTCCA
TTTTCCATCTATCCTGGTGACTGTAAGATTTCGATGGATGCACAAAAACCTTCACCAGTTTTTTCCCTGCAATCACCCTTGTCAGAATATCAAAATCGTTTTGAGCTTGG
ATTTGGCCAACCTTTGATATGTGCAAATTATCCGTACATGGAACAGCATTATGGCATCCTGTCAGCTTATGGACCTCAAATACCAGGCCGGATTATGCTGCCAATGAGCT
TAACATCAGATGATGGACCTATTTATGTGAATGCAAAGCAGTATCATGGAATCATTAGGCGCAGGCAGATCCGTGCTAAGGCAATGATGGAGAATAAACTTGCAAGAACT
CGTAAGGATGTGAATAGTTTCCTTGTGGTATATTTCCTCACCGGTATAACTAGAACTAGAAGTCAATCTGATGCGGATGCCTGTCTGAAGAGTAACTCCCATTCGGTTTT
ATTGGAATTTCTGTTTGCCAACCTTTGGGCTATAAGAATTTTCGATATCAACATTTGGCTTGAATATGATATACTACCATATATGCACGAATCGCGTCATCTTCATGCAA
TGCGCCGTCCACGGGGATCTGGTGGCCGTTTCTTGAACACAAAGAATCTGAAAAATGGGAAACTCTCAATGGAACCAAAGAAAATTGATGATGTGAACCTTTCTGATTCA
ACTGGTTCCCAATGTTCTGTGGTTCTGCAATCAGAAAGTGGAACTTTGAACTCCCGAAATGATGCAAAGGGGAGGGGCTTTAGTCTCTCGAGTTCGGAGAGATCATTGAT
GGAGGAGGGCATGGCATGGTTCTGCCCAAATGGGTTGCAGCAGCTGACAACTGCTGCGACCTTTGTGTTTGATGGCGAAGGCAATGGGTTCTTGGTGCAAGTCATTGCAC
CAAAACCCTGTCTTTGTTGCGACCAAATCCAGCTTGCTACAATGTCGTCGACGGCAATTGTCTCAGGCTTCGCGATGAGGATAGTCCCCGCGTTTGTATGGGAAGTGGCT
AAGAGAAACAAAGGAATCATCCAATCAGATATCAAACATTCTGAGGCAATTTGTAATGTACTGAACAAAGATGCTTGGCTTGTGGGTTTGTTTATGGACAACTGTCAATG
TTTGTCTTTCAGTTTGTGTTTGCAAGGGTCGCTGGAATGGGTTTCTGTCGCATGGGGGGTTTTGTGGAACTGGTTTCATTGGATATGCGGTTTTTTAGCGGATAGAGCAA
GAGAGCAAGAAGAAGGGCCTGGTCCCACCGCGTTTTTCAGATCTGGAACCAACAAAGGCGGCGACTGCGGCGAAGTTCGGCGGATGAATGCTAGAAGAAGGAGAGATGGG
GAATGGGAAAAGATCATCTCATCGTATCACCTCCCCGGAGCAGAAATCCATTTGTCACTTGGGGAATTCCACCATTTCCATTTTCTTCCTCAAATGTCAGCTTCTTCTGT
CTCAATGGCTGCTGCTACTACAGCAACTTCGGTTTCCTCTTCTTCTCCATTCTCCAAGAAACTCTTCTTCACTCAACATCCCAACCAAATTCCCTCCCATTTTTCTCCGA
AACAGAACCCATTGAAGCTTCTCAACCTCCGGATTCATTTACCCAACTTTTACCCTCTTTCCTTTTCCTCTTCTTCTCATCTCCACTGTGCTCCTCCTGCTTTCGATGGG
CTCCAAGTCTCCGACTCTGAGACAGAATACGCAGAGGTACAAGAATCGTATGGAGAAGAAGAAACCCAAGAGGAAGAAGACGAACAAAAGGTATCGGTGTCTCGCGAAGC
AGGGAAGCTTTATGTTGGGAATTTACCATATGCTATGACTTCTTCCCAATTGTCTGAGGTCTTCACCGAAGCTGGTCATGTGGTTTCGGTACAGGTTATATATGACAAAG
TTACGGATAGGAGTAGGGGATTTGCATTTGTGACAATGGCCACTTTGGAGGAAGCTAAAGAAGCAATTCGGATGTTTGATGGCTCTCAAATCGGTGGTCGAACTGTTCGG
GTGAACTTCCCTGAAGTGCCAAGGGGAGGAGAAAAGGAAGTCATGGGGCCAAAGATAAGAAGCAGCTATAACAAATTTGTAGATAGTCCTCACAAGATATATGCAGGGAA
CCTTGGTTGGGGTCTCACTTCTCAGAGTCTTAGAGAGGCTTTTGAAAACCAACCAGGGATATTGAGTGCCAAGGTCATCTATGATAGGGCATCTGGAAAAAGTAGAGGTT
TTGGATTTGTATCCTTCGAAACTGCTGAGGATGCAGAGTCTGCTTTGGAGTCCATGAATGGAGTGGAAGTTGAAGGGCGGCCGCTTCGTTTGAACATTGCTGCAGGGCAG
GCCCTGACTTCCCCAGCAGCATTCACGAGGACTGAAAATGAAATTGACAGCAAAGAATTGCTTACCAGTATCAGTGCCTGAAATCTGATGTTGATGGTTGAGTTGGCCCG
AAAATTAAACTGAAACTGAAGGCTAAGGAAATTGTTCTGAAAATTGCATACCTTCAAGTCCTCCCTCGCAATTTTCTTTCATTCCTTGCTCAACTACCAAGGAAGCCTAG
ATTTTTTTCATTGAGTGGTGTATGTGAGCTGAGGAAAGTGTGAAAGTGTTGCCATAGGTTAACGAGTTTTTTATGTATGACTTGAGAATTAGACTACATGTACTATGGAG
TCTCTTTGCTGAGTTAATACTACGATATATTCATCTGTCGGCCTTTTATTACCTTATCTTTGCTAATTAAGACTTCGAGTATAC
Protein sequenceShow/hide protein sequence
MSHSLFVPTLNCRNWLKGLVKQLFLGDFEAWKMAPQTGYLKEHEGIVPNSLGQLSSSPARLWSAFGQGSQSIFGDFGLVKASSEQLGSNGKEFNGTKQAAHGLEKVNIAP
FSIYPGDCKISMDAQKPSPVFSLQSPLSEYQNRFELGFGQPLICANYPYMEQHYGILSAYGPQIPGRIMLPMSLTSDDGPIYVNAKQYHGIIRRRQIRAKAMMENKLART
RKDVNSFLVVYFLTGITRTRSQSDADACLKSNSHSVLLEFLFANLWAIRIFDINIWLEYDILPYMHESRHLHAMRRPRGSGGRFLNTKNLKNGKLSMEPKKIDDVNLSDS
TGSQCSVVLQSESGTLNSRNDAKGRGFSLSSSERSLMEEGMAWFCPNGLQQLTTAATFVFDGEGNGFLVQVIAPKPCLCCDQIQLATMSSTAIVSGFAMRIVPAFVWEVA
KRNKGIIQSDIKHSEAICNVLNKDAWLVGLFMDNCQCLSFSLCLQGSLEWVSVAWGVLWNWFHWICGFLADRAREQEEGPGPTAFFRSGTNKGGDCGEVRRMNARRRRDG
EWEKIISSYHLPGAEIHLSLGEFHHFHFLPQMSASSVSMAAATTATSVSSSSPFSKKLFFTQHPNQIPSHFSPKQNPLKLLNLRIHLPNFYPLSFSSSSHLHCAPPAFDG
LQVSDSETEYAEVQESYGEEETQEEEDEQKVSVSREAGKLYVGNLPYAMTSSQLSEVFTEAGHVVSVQVIYDKVTDRSRGFAFVTMATLEEAKEAIRMFDGSQIGGRTVR
VNFPEVPRGGEKEVMGPKIRSSYNKFVDSPHKIYAGNLGWGLTSQSLREAFENQPGILSAKVIYDRASGKSRGFGFVSFETAEDAESALESMNGVEVEGRPLRLNIAAGQ
ALTSPAAFTRTENEIDSKELLTSISA