; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G25040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G25040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Description30-kDa cleavage and polyadenylation specificity factor 30
Genome locationChr1:20470839..20476654
RNA-Seq ExpressionCSPI01G25040
SyntenyCSPI01G25040
Gene Ontology termsGO:0000381 - regulation of alternative mRNA splicing, via spliceosome (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:1990247 - N6-methyladenosine-containing RNA binding (molecular function)
InterPro domainsIPR000571 - Zinc finger, CCCH-type
IPR007275 - YTH domain
IPR036855 - Zinc finger, CCCH-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141524.1 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis sativus]3.4e-15868.56Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPA-------LSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP  A +SSL ++ SDSSAPPA       LS  L  +++ EP GAP  N+G RRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPA-------LSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPP +EEI+QKIQHL                V   ++NE    
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----

Query:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
              + +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGT HYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPS+GEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVS+AAESKR++EKAKGVNPDIG+ENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

XP_008445183.1 PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Cucumis melo]1.8e-16772.67Show/hide
Query:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP     A SSSL L+PSDSSAPP LSN LP S    L PEPLGAPTAN+GTRRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPPSVEEI+QKIQHL                V   ++NE    
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----

Query:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
                                               I+   NG               I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVSIAAESKR++EKAKGVNPDIGNENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

XP_008459517.1 PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30-like [Cucumis melo]5.2e-15969.93Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPP---ALSNFLPSSLTP----EPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP  A +SSL L+ SDSSAPP   A+SN L  +L P    EP GAP  N+G RRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPP---ALSNFLPSSLTP----EPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNETIR-
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPP VEEI+QKIQHL                V   ++NE  + 
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNETIR-

Query:  --------KGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
                +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  --------KGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        +ADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVSIAAESKR++EKAKGVNPDIG+ENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

XP_011648522.2 LOW QUALITY PROTEIN: 30-kDa cleavage and polyadenylation specificity factor 30, partial [Cucumis sativus]5.6e-17775.34Show/hide
Query:  STLFFVDSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYD
        S+    DSEG+LSFDFEGGLDAVLTNPTVAPSSSL LVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGF+HQYD
Subjt:  STLFFVDSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYD

Query:  KSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE-----
         SRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NGSDCRYRHAKLPGPPPSVEEI+QKIQHL                V   ++NE     
Subjt:  KSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE-----

Query:  -------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDS
                                              IR   NG               I RYFIVKSCN ENLE    +GV ATQRSNEAKLNEAFDS
Subjt:  -------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDS

Query:  ADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLE
        ADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYG+   LKWLKL ELSFQKTRHLRNTYNE+LP+KISRDCQELEPSIGEQLASLLYLE
Subjt:  ADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLE

Query:  PDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        PDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
Subjt:  PDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

XP_038894441.1 30-kDa cleavage and polyadenylation specificity factor 30-like [Benincasa hispida]1.2e-16371.4Show/hide
Query:  DSEGILSFDFEGGLDAVLTNP-TVAPSSSLHLVPSDSSAPPALSNFLPSSL----TPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDK
        DS+G+LSFDFEGGLDA  TNP   A +SSL L+ SDSSAPPALSN +P  L    TPE  GAPT N G+RRSFRQTVCRHWLRSLCMKGDACGF+HQYDK
Subjt:  DSEGILSFDFEGGLDAVLTNP-TVAPSSSLHLVPSDSSAPPALSNFLPSSL----TPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDK

Query:  SRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------ENVLSLRR-------NE------
        SRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPPSVEEI+QKIQH+        N L L+R       NE      
Subjt:  SRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------ENVLSLRR-------NE------

Query:  ---TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSA
            + +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFDSA
Subjt:  ---TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSA

Query:  DNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEP
        DNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYLEP
Subjt:  DNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEP

Query:  DGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        DGELMAVSIAAE+KR++EKAKGVNPDIGNENPDIV F
Subjt:  DGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

TrEMBL top hitse value%identityAlignment
A0A0A0KSR6 Uncharacterized protein1.6e-15868.56Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPA-------LSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP  A +SSL ++ SDSSAPPA       LS  L  +++ EP GAP  N+G RRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPA-------LSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPP +EEI+QKIQHL                V   ++NE    
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----

Query:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
              + +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGT HYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPS+GEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVS+AAESKR++EKAKGVNPDIG+ENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

A0A1S3BC28 30-kDa cleavage and polyadenylation specificity factor 308.7e-16872.67Show/hide
Query:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP     A SSSL L+PSDSSAPP LSN LP S    L PEPLGAPTAN+GTRRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPPSVEEI+QKIQHL                V   ++NE    
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----

Query:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
                                               I+   NG               I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVSIAAESKR++EKAKGVNPDIGNENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

A0A1S3CAV8 30-kDa cleavage and polyadenylation specificity factor 30-like2.5e-15969.93Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPP---ALSNFLPSSLTP----EPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP  A +SSL L+ SDSSAPP   A+SN L  +L P    EP GAP  N+G RRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPP---ALSNFLPSSLTP----EPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNETIR-
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPP VEEI+QKIQHL                V   ++NE  + 
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNETIR-

Query:  --------KGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
                +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  --------KGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        +ADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVSIAAESKR++EKAKGVNPDIG+ENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

A0A5D3C2W1 30-kDa cleavage and polyadenylation specificity factor 308.7e-16872.67Show/hide
Query:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        DSEG+LSFDFEGGLDA  TNP     A SSSL L+PSDSSAPP LSN LP S    L PEPLGAPTAN+GTRRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNP---TVAPSSSLHLVPSDSSAPPALSNFLPSS----LTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPPSVEEI+QKIQHL                V   ++NE    
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL--------------ENVLSLRRNE----

Query:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
                                               I+   NG               I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  --------------------------------------TIRKGMNGV--------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVSIAAESKR++EKAKGVNPDIGNENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

A0A6J1CE78 30-kDa cleavage and polyadenylation specificity factor 302.6e-15668.11Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPS-------SLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY
        D EG+LSFDFEGGLDA  TNP    ++SL L+ SD SAPPA S    S       ++T E  GA T N+G+RRSFRQTVCRHWLRSLCMKGDACGF+HQY
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPS-------SLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQY

Query:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHLENVLSLRRNE------------------
        DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFC NG DCRYRHAKLPGPPP VEEI+QKIQHL +      N+                  
Subjt:  DKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHLENVLSLRRNE------------------

Query:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD
             T+ +G+ G                                                I RYFIVKSCN ENLELSVQQGV ATQRSNEAKLNEAFD
Subjt:  -----TIRKGMNGV-----------------------------------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFD

Query:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL
        SADNVILIFSVNRTRHFQGCA MMSRIGGSVSGGNWKYAHGTAHYGQ F LKWLKL ELSFQKTRHLRN YNE+LP+KISRDCQELEPSIGEQLASLLYL
Subjt:  SADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYL

Query:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        EPDGELMAVS+AAESKR++EKAKGVNPDIG+ENPDIV F
Subjt:  EPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

SwissProt top hitse value%identityAlignment
A9LNK9 30-kDa cleavage and polyadenylation specificity factor 301.0e-13360.95Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI
        D++G LSFDFEGGLD    +  V  ++S+ + P ++S+  A+ N  P   T +   A  A  G  RSFRQTVCRHWLR LCMKGDACGF+HQ+DK+RMPI
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI

Query:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------------ENV-------------------
        CRFFRLYGECREQDCVYKHTNEDIKECNMYK GFC NG DCRYRHAKLPGPPP VEE++QKIQ L              NV                   
Subjt:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------------ENV-------------------

Query:  ----LSLRRNETIRKGMNGV----------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQG
            L  ++ +  ++  + V                      + RYF+VKS N EN ELSVQQGV ATQRSNEAKLNEAFDS +NVILIFSVNRTRHFQG
Subjt:  ----LSLRRNETIRKGMNGV----------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQG

Query:  CANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKRKD
        CA M SRIGG + GGNWK+ HGTA YG+ F +KWLKL ELSF KTR+LRN YNE+LP+KISRDCQELEPS+GEQLASLLYLEPD ELMA+SIAAE+KR++
Subjt:  CANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKRKD

Query:  EKAKGVNPDIGNENPDIVLF
        EKAKGVNP+   ENPDIV F
Subjt:  EKAKGVNPDIGNENPDIVLF

B2RR83 3'-5' RNA helicase YTHDC22.3e-2446.15Show/hide
Query:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ
        +RYFI+KS N  NLE+S Q+G+ +T  SNE KLN AF  +  V L+FSV  + HFQG + M S IG   S  +W    G+A  G +F ++W++   L FQ
Subjt:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ

Query:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE
           HL N +N++  ++ISRD QELEP +GEQL  L    P GE
Subjt:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE

Q0DA50 Zinc finger CCCH domain-containing protein 455.1e-11752.14Show/hide
Query:  EGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPICR
        +G LSFDFEGGLD         P +     P+  S+ P           P   G      G R S+RQTVCRHWLR LCMKG+ACGF+HQ+DK+RMP+CR
Subjt:  EGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPICR

Query:  FFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKI----------QHLENVLSLR--------------------
        FFR +GECRE DC YKH+ +D+KECNMYK GFC NG +CRY+H KLPGPPP VEE++QKI          QH  N  + +                    
Subjt:  FFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKI----------QHLENVLSLR--------------------

Query:  ---------------------------------------RNETIRKGMNGVI--------------LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLN
                                                N+ ++   NG                 RYFIVKSCN ENLE+SVQQG+ ATQRSNEAKLN
Subjt:  ---------------------------------------RNETIRKGMNGVI--------------LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLN

Query:  EAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLAS
        EAF+S +NVILIFS+NRTR+FQGCA M SRIGG + GGNWK AHGTAHYG+ F ++WLKL ELSFQKT HLRN YN++LP+KISRDCQELEP IGEQLAS
Subjt:  EAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLAS

Query:  LLYLEPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF
        LLYLEPD EL A+ IAAE+K+++EKAKGV+ D   +N DIVLF
Subjt:  LLYLEPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF

Q5R746 3'-5' RNA helicase YTHDC23.0e-2446.15Show/hide
Query:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ
        +RYFI+KS N  NLE+S Q+G+ +T  SNE KLN AF  +  V L+FSV  + HFQG + M S IG   S  +W    G+A  G +F ++W++   L FQ
Subjt:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ

Query:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE
           HL N +N++  ++ISRD QELEP +GEQL  L    P GE
Subjt:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE

Q9H6S0 3'-5' RNA helicase YTHDC26.7e-2446.15Show/hide
Query:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ
        +RYFI+KS N  NLE+S Q+G+ +T  SNE KLN AF  +  V L+FSV  + HFQG + M S IG   S  +W    G+A  G +F ++W++   L FQ
Subjt:  LRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQ

Query:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE
           HL N +N++  ++ISRD QELEP +GEQL  L    P GE
Subjt:  KTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGE

Arabidopsis top hitse value%identityAlignment
AT1G30460.1 cleavage and polyadenylation specificity factor 307.3e-13560.95Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI
        D++G LSFDFEGGLD    +  V  ++S+ + P ++S+  A+ N  P   T +   A  A  G  RSFRQTVCRHWLR LCMKGDACGF+HQ+DK+RMPI
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI

Query:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------------ENV-------------------
        CRFFRLYGECREQDCVYKHTNEDIKECNMYK GFC NG DCRYRHAKLPGPPP VEE++QKIQ L              NV                   
Subjt:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL-------------ENV-------------------

Query:  ----LSLRRNETIRKGMNGV----------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQG
            L  ++ +  ++  + V                      + RYF+VKS N EN ELSVQQGV ATQRSNEAKLNEAFDS +NVILIFSVNRTRHFQG
Subjt:  ----LSLRRNETIRKGMNGV----------------------ILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQG

Query:  CANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKRKD
        CA M SRIGG + GGNWK+ HGTA YG+ F +KWLKL ELSF KTR+LRN YNE+LP+KISRDCQELEPS+GEQLASLLYLEPD ELMA+SIAAE+KR++
Subjt:  CANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKRKD

Query:  EKAKGVNPDIGNENPDIVLF
        EKAKGVNP+   ENPDIV F
Subjt:  EKAKGVNPDIGNENPDIVLF

AT1G30460.2 cleavage and polyadenylation specificity factor 303.5e-6067.88Show/hide
Query:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI
        D++G LSFDFEGGLD    +  V  ++S+ + P ++S+  A+ N  P   T +   A  A  G  RSFRQTVCRHWLR LCMKGDACGF+HQ+DK+RMPI
Subjt:  DSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTVCRHWLRSLCMKGDACGFIHQYDKSRMPI

Query:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL
        CRFFRLYGECREQDCVYKHTNEDIKECNMYK GFC NG DCRYRHAKLPGPPP VEE++QKIQ L
Subjt:  CRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHL

AT4G11970.1 YTH family protein3.9e-2749.29Show/hide
Query:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE
        G   RYFI+KS N +N+++SV++G+ ATQ  NE  L  AF  +  VILIFSVN +  FQG A M+S +G       W    G  + +G+ F +KWL+L E
Subjt:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE

Query:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL
        L FQKT HL+N  N+  P+KISRDCQEL   IGE L  LL
Subjt:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL

AT4G11970.2 YTH family protein3.9e-2749.29Show/hide
Query:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE
        G   RYFI+KS N +N+++SV++G+ ATQ  NE  L  AF  +  VILIFSVN +  FQG A M+S +G       W    G  + +G+ F +KWL+L E
Subjt:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE

Query:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL
        L FQKT HL+N  N+  P+KISRDCQEL   IGE L  LL
Subjt:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL

AT4G11970.3 YTH family protein3.9e-2749.29Show/hide
Query:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE
        G   RYFI+KS N +N+++SV++G+ ATQ  NE  L  AF  +  VILIFSVN +  FQG A M+S +G       W    G  + +G+ F +KWL+L E
Subjt:  GVILRYFIVKSCN-ENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAH-YGQIFLLKWLKLRE

Query:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL
        L FQKT HL+N  N+  P+KISRDCQEL   IGE L  LL
Subjt:  LSFQKTRHLRNTYNEDLPIKISRDCQELEPSIGEQLASLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTTTGATAAATCATGCAAAACCCATCATAGGTTCAATTGCAATTGCAACTGCAACGGCAACGCCACTCCGTTCAAGTTCACTACTTCTTTCCACTCTCTTCTT
CGTCGATTCTGAGGGTATTCTCAGCTTCGATTTCGAGGGTGGTCTCGATGCCGTCCTCACAAACCCCACCGTCGCCCCCTCCTCCTCCTTACACCTCGTCCCCTCCGACT
CCTCTGCCCCTCCCGCCCTCTCCAATTTCCTTCCCAGCTCCCTCACTCCTGAACCCCTTGGTGCCCCTACTGCTAACCTCGGCACCCGCAGGAGCTTCCGCCAAACCGTT
TGCCGCCACTGGCTTCGCAGTCTTTGTATGAAGGGCGATGCTTGTGGATTCATTCACCAGTATGATAAGTCTCGGATGCCCATATGCCGCTTCTTTCGTCTTTATGGGGA
GTGTCGGGAGCAGGATTGCGTGTATAAACATACCAATGAAGATATAAAGGAGTGTAATATGTACAAGTTTGGTTTCTGTCTAAATGGTTCTGATTGTCGATATAGGCATG
CAAAGCTACCTGGTCCTCCACCTTCTGTGGAAGAAATCATTCAGAAAATACAGCACTTAGAAAACGTGCTTTCATTGAGAAGAAATGAAACAATACGGAAGGGCATGAAT
GGTGTCATCTTAAGGTACTTTATTGTTAAAAGTTGCAATGAGAATTTGGAGTTATCTGTACAACAAGGGGTATGTGCAACTCAAAGAAGCAATGAAGCTAAACTTAATGA
AGCTTTTGATTCTGCTGATAATGTTATTTTGATTTTCTCAGTCAACCGGACTCGACATTTTCAGGGTTGTGCAAATATGATGTCCAGGATTGGTGGTTCTGTCAGTGGGG
GCAATTGGAAATATGCACATGGAACTGCACATTATGGTCAAATTTTTTTACTCAAATGGCTTAAGTTACGTGAACTATCCTTCCAGAAAACACGCCATTTGAGGAATACA
TATAATGAAGACTTACCCATAAAGATCAGTAGAGATTGCCAAGAGCTAGAGCCCTCTATTGGTGAGCAGCTGGCTTCTTTGCTTTATCTCGAGCCAGATGGTGAACTCAT
GGCTGTCTCAATAGCAGCAGAATCGAAACGGAAAGACGAGAAAGCAAAGGGAGTTAATCCTGATATTGGAAATGAGAACCCAGATATCGTCCTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCACTTTGATAAATCATGCAAAACCCATCATAGGTTCAATTGCAATTGCAACTGCAACGGCAACGCCACTCCGTTCAAGTTCACTACTTCTTTCCACTCTCTTCTT
CGTCGATTCTGAGGGTATTCTCAGCTTCGATTTCGAGGGTGGTCTCGATGCCGTCCTCACAAACCCCACCGTCGCCCCCTCCTCCTCCTTACACCTCGTCCCCTCCGACT
CCTCTGCCCCTCCCGCCCTCTCCAATTTCCTTCCCAGCTCCCTCACTCCTGAACCCCTTGGTGCCCCTACTGCTAACCTCGGCACCCGCAGGAGCTTCCGCCAAACCGTT
TGCCGCCACTGGCTTCGCAGTCTTTGTATGAAGGGCGATGCTTGTGGATTCATTCACCAGTATGATAAGTCTCGGATGCCCATATGCCGCTTCTTTCGTCTTTATGGGGA
GTGTCGGGAGCAGGATTGCGTGTATAAACATACCAATGAAGATATAAAGGAGTGTAATATGTACAAGTTTGGTTTCTGTCTAAATGGTTCTGATTGTCGATATAGGCATG
CAAAGCTACCTGGTCCTCCACCTTCTGTGGAAGAAATCATTCAGAAAATACAGCACTTAGAAAACGTGCTTTCATTGAGAAGAAATGAAACAATACGGAAGGGCATGAAT
GGTGTCATCTTAAGGTACTTTATTGTTAAAAGTTGCAATGAGAATTTGGAGTTATCTGTACAACAAGGGGTATGTGCAACTCAAAGAAGCAATGAAGCTAAACTTAATGA
AGCTTTTGATTCTGCTGATAATGTTATTTTGATTTTCTCAGTCAACCGGACTCGACATTTTCAGGGTTGTGCAAATATGATGTCCAGGATTGGTGGTTCTGTCAGTGGGG
GCAATTGGAAATATGCACATGGAACTGCACATTATGGTCAAATTTTTTTACTCAAATGGCTTAAGTTACGTGAACTATCCTTCCAGAAAACACGCCATTTGAGGAATACA
TATAATGAAGACTTACCCATAAAGATCAGTAGAGATTGCCAAGAGCTAGAGCCCTCTATTGGTGAGCAGCTGGCTTCTTTGCTTTATCTCGAGCCAGATGGTGAACTCAT
GGCTGTCTCAATAGCAGCAGAATCGAAACGGAAAGACGAGAAAGCAAAGGGAGTTAATCCTGATATTGGAAATGAGAACCCAGATATCGTCCTGTTTTAG
Protein sequenceShow/hide protein sequence
MTTLINHAKPIIGSIAIATATATPLRSSSLLLSTLFFVDSEGILSFDFEGGLDAVLTNPTVAPSSSLHLVPSDSSAPPALSNFLPSSLTPEPLGAPTANLGTRRSFRQTV
CRHWLRSLCMKGDACGFIHQYDKSRMPICRFFRLYGECREQDCVYKHTNEDIKECNMYKFGFCLNGSDCRYRHAKLPGPPPSVEEIIQKIQHLENVLSLRRNETIRKGMN
GVILRYFIVKSCNENLELSVQQGVCATQRSNEAKLNEAFDSADNVILIFSVNRTRHFQGCANMMSRIGGSVSGGNWKYAHGTAHYGQIFLLKWLKLRELSFQKTRHLRNT
YNEDLPIKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKRKDEKAKGVNPDIGNENPDIVLF