; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G033070 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G033070
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionsnRNA-activating protein complex subunit-like
Genome locationGy14Chr1:32815939..32825915
RNA-Seq ExpressionCsGy1G033070
SyntenyCsGy1G033070
Gene Ontology termsGO:0042795 - snRNA transcription by RNA polymerase II (biological process)
GO:0042796 - snRNA transcription by RNA polymerase III (biological process)
GO:0005634 - nucleus (cellular component)
GO:0019185 - snRNA-activating protein complex (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0001006 - RNA polymerase III type 3 promoter sequence-specific DNA binding (molecular function)
GO:0001046 - core promoter sequence-specific DNA binding (molecular function)
GO:0003681 - bent DNA binding (molecular function)
InterPro domainsIPR022042 - snRNA-activating protein complex, subunit 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043314.1 snRNA-activating protein complex subunit-like isoform X2 [Cucumis melo var. makuwa]2.50e-27185.53Show/hide
Query:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQ-SLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNAN
        MEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE Q SLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SLQS  NAN
Subjt:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQ-SLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNAN

Query:  NQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKE
        NQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQLHAFKWKKDIASSSSESKE
Subjt:  NQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKE

Query:  RLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNP
        RLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQDSSGYFLVEDVFCNDLRNP
Subjt:  RLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNP

Query:  SATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ------------------------
        SA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                        
Subjt:  SATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ------------------------

Query:  --------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  --------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

XP_008459093.1 PREDICTED: snRNA-activating protein complex subunit-like isoform X1 [Cucumis melo]1.00e-27283.44Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLR------------ECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQ
        QS  NANNQSEL EENMNAGLLR            ECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQ
Subjt:  QSHGNANNQSELQEENMNAGLLR------------ECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQ

Query:  LHAFKWKKDIASSSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQ
        LHAFKWKKDIASSSSESKERLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQ
Subjt:  LHAFKWKKDIASSSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQ

Query:  DSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----
        DSSGYFLVEDVFCNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ     
Subjt:  DSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----

Query:  ---------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                                   LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

XP_008459094.1 PREDICTED: snRNA-activating protein complex subunit-like isoform X2 [Cucumis melo]2.04e-27685.53Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS
        QS  NANNQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQLHAFKWKKDIAS
Subjt:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS

Query:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF
        SSSESKERLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQDSSGYFLVEDVF
Subjt:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF

Query:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------
        CNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                 
Subjt:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------

Query:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                       LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

XP_011660317.1 snRNA-activating protein complex subunit [Cucumis sativus]7.41e-30393.16Show/hide
Query:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN
        MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN
Subjt:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN

Query:  QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL
        QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL
Subjt:  QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL

Query:  KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA
        KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA
Subjt:  KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA

Query:  TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ--------------------------
        TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ                          
Subjt:  TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ--------------------------

Query:  ------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
              LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
Subjt:  ------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

XP_016901523.1 PREDICTED: snRNA-activating protein complex subunit-like [Cucumis melo]9.21e-28086.58Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG SEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS
        QS GNANNQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEIVK+KQKQE DRAVVQLHAFKWKKDIAS
Subjt:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS

Query:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF
        SSSESKERLKSLRSTN SAKVPHVKSLSG KHESLHHPTTVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTD LMQKA  QDSSGYFLVEDVF
Subjt:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF

Query:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------
        CNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGESQQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                 
Subjt:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------

Query:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                       LRTRAQKCDVCNIYRAKKVT+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

TrEMBL top hitse value%identityAlignment
A0A0A0M3U8 Uncharacterized protein3.59e-30393.16Show/hide
Query:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN
        MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN
Subjt:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANN

Query:  QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL
        QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL
Subjt:  QSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERL

Query:  KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA
        KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA
Subjt:  KSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSA

Query:  TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ--------------------------
        TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ                          
Subjt:  TDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ--------------------------

Query:  ------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
              LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
Subjt:  ------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

A0A1S3C9D9 snRNA-activating protein complex subunit-like isoform X29.89e-27785.53Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS
        QS  NANNQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQLHAFKWKKDIAS
Subjt:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS

Query:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF
        SSSESKERLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQDSSGYFLVEDVF
Subjt:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF

Query:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------
        CNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                 
Subjt:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------

Query:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                       LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

A0A1S3C9H5 snRNA-activating protein complex subunit-like isoform X14.85e-27383.44Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLR------------ECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQ
        QS  NANNQSEL EENMNAGLLR            ECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQ
Subjt:  QSHGNANNQSELQEENMNAGLLR------------ECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQ

Query:  LHAFKWKKDIASSSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQ
        LHAFKWKKDIASSSSESKERLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQ
Subjt:  LHAFKWKKDIASSSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQ

Query:  DSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----
        DSSGYFLVEDVFCNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ     
Subjt:  DSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----

Query:  ---------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                                   LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

A0A1S4DZX0 snRNA-activating protein complex subunit-like4.46e-28086.58Show/hide
Query:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL
        +S VVTRAMEA DRAG SEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE QSLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SL
Subjt:  ESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSL

Query:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS
        QS GNANNQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEIVK+KQKQE DRAVVQLHAFKWKKDIAS
Subjt:  QSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIAS

Query:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF
        SSSESKERLKSLRSTN SAKVPHVKSLSG KHESLHHPTTVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTD LMQKA  QDSSGYFLVEDVF
Subjt:  SSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVF

Query:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------
        CNDLRNPSA DYSKPILDWLRNSEDEARKKW CIITGESQQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                 
Subjt:  CNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ-----------------

Query:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                       LRTRAQKCDVCNIYRAKKVT+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  ---------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

A0A5A7TJ23 snRNA-activating protein complex subunit-like isoform X21.21e-27185.53Show/hide
Query:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQ-SLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNAN
        MEA DRAG S+EIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE Q SLEAEL LDS Q CDEDISIDELK+FTEEQLLNMALE SLQS  NAN
Subjt:  MEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQELQ-SLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNAN

Query:  NQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKE
        NQSEL EENMNAGLLRECEEEVNGHNLEAD  SNANRSTNK+T RKRKKEELS IEEKSIAKVAEI K+KQKQE DRAVVQLHAFKWKKDIASSSSESKE
Subjt:  NQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKIT-RKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKE

Query:  RLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNP
        RLKSLRSTN SAKVPHVKSL+ GKHESLHHP TVLFVEVYHKSRKMVKSQE LALGRQTLAELKDKIYCSTDTLMQKA QQDSSGYFLVEDVFCNDLRNP
Subjt:  RLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNP

Query:  SATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ------------------------
        SA DYSKPILDWLRNSEDEARKKW CIITGE QQK+SVVGEVSDLHVPHFRSVSM+K RFCDLKFRLGAGYLYCHQ                        
Subjt:  SATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ------------------------

Query:  --------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
                LRTRAQKCDVCNIYRAKK T+DDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL+D
Subjt:  --------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

SwissProt top hitse value%identityAlignment
Q4R6Y6 snRNA-activating protein complex subunit 35.1e-2028.57Show/hide
Query:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED
        +L+  ++HK ++    Q +L LG Q L EL+D I C +D  +Q  G+  +               S +F  E  F ND R P   D S+ I++W      
Subjt:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED

Query:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC
                              E  D     F++  M    F DL  +LG  YLYCHQ                                  L TR  KC
Subjt:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC

Query:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL
         VC +Y A+ VT +D +A E+PC+FC+ C+ +LHY  EGN L  +F+ + Y+
Subjt:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL

Q5BK68 snRNA-activating protein complex subunit 35.1e-2028.4Show/hide
Query:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED
        +L+  +++K R+    Q +L LG Q L EL+D I C +D  +Q  G+  +               S +F  E  F ND R P   D S+ I++W      
Subjt:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED

Query:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ---------------------------LRTR-----AQKCDV
                              E  D     F++  M    F DL  +LG  YLYCHQ                           L T+      +KC V
Subjt:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ---------------------------LRTR-----AQKCDV

Query:  CNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL
        C +Y A+ VT +D +A E+PC+FC+ C+ +LHY  EGN L  +F+ + Y+
Subjt:  CNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL

Q5E9M5 snRNA-activating protein complex subunit 33.9e-2028.57Show/hide
Query:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED
        +L+  ++HK ++    Q +L LG Q L EL+D I C +D  +Q  G+  +               S +F  E  F ND R P   D S+ I++W      
Subjt:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED

Query:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC
                              E  D     F++  M    F DL  +LG  YLYCHQ                                  L TR  KC
Subjt:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC

Query:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL
         VC +Y A+ VT +D +A E+PC+FC+ C+ +LHY  EGN L  +F+ + Y+
Subjt:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL

Q8L627 snRNA-activating protein complex subunit4.0e-6542.96Show/hide
Query:  FTEEQLLNMALEGSLQ-SHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRA
        F E+ L    LE SL  SH           EN  AG     + +    N E    +  N    K+T K+        EE  + KV ++ K+KQKQE D+A
Subjt:  FTEEQLLNMALEGSLQ-SHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRA

Query:  VVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTD
         V LH F    +  KD+  +  E  E+++SLR    N++   P   S   G+ + L  P  +L VE+Y+ SRK VK+QE L LGRQ L ELKD I+C+TD
Subjt:  VVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTD

Query:  TLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGY
         +MQKAG+ D SGYFL+EDVF NDLRNPSA DYS PILDWL NS+DEA KKW C++TGE Q+K   V+GE   + +P +R+  M    FCD++FR+GA Y
Subjt:  TLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGY

Query:  LYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
        +YCHQ                                + R QKC VC I RA KV +DDKWA EN  YFC+ C+ LLH S+EG  L  DF V DY+ +
Subjt:  LYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

Q92966 snRNA-activating protein complex subunit 31.1e-1928.17Show/hide
Query:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED
        +L+  ++HK ++    Q +L LG Q L +L+D I C +D  +Q  G+  +               S +F  E  F ND R P   D S+ I++W      
Subjt:  VLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTDTLMQKAGQQDS---------------SGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSED

Query:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC
                              E  D     F++  M    F DL  +LG  YLYCHQ                                  L TR  KC
Subjt:  EARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGYLYCHQ----------------------------------LRTRAQKC

Query:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL
         VC +Y A+ VT +D +A E+PC+FC+ C+ +LHY  EGN L  +F+ + Y+
Subjt:  DVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYL

Arabidopsis top hitse value%identityAlignment
AT1G28560.1 snRNA activating complex family protein2.8e-6642.96Show/hide
Query:  FTEEQLLNMALEGSLQ-SHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRA
        F E+ L    LE SL  SH           EN  AG     + +    N E    +  N    K+T K+        EE  + KV ++ K+KQKQE D+A
Subjt:  FTEEQLLNMALEGSLQ-SHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDRA

Query:  VVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTD
         V LH F    +  KD+  +  E  E+++SLR    N++   P   S   G+ + L  P  +L VE+Y+ SRK VK+QE L LGRQ L ELKD I+C+TD
Subjt:  VVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCSTD

Query:  TLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGY
         +MQKAG+ D SGYFL+EDVF NDLRNPSA DYS PILDWL NS+DEA KKW C++TGE Q+K   V+GE   + +P +R+  M    FCD++FR+GA Y
Subjt:  TLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAGY

Query:  LYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
        +YCHQ                                + R QKC VC I RA KV +DDKWA EN  YFC+ C+ LLH S+EG  L  DF V DY+ +
Subjt:  LYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD

AT1G28560.2 snRNA activating complex family protein5.7e-6743.36Show/hide
Query:  FTEEQLLNMALEGSLQ-SHGNANNQSELQE-ENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDR
        F E+ L    LE SL  SH  A++Q   +  EN  AG     + +    N E    +  N    K+T K+        EE  + KV ++ K+KQKQE D+
Subjt:  FTEEQLLNMALEGSLQ-SHGNANNQSELQE-ENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEEKSIAKVAEIVKIKQKQETDR

Query:  AVVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCST
        A V LH F    +  KD+  +  E  E+++SLR    N++   P   S   G+ + L  P  +L VE+Y+ SRK VK+QE L LGRQ L ELKD I+C+T
Subjt:  AVVQLHAF----KWKKDIASSSSESKERLKSLR--STNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAELKDKIYCST

Query:  DTLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAG
        D +MQKAG+ D SGYFL+EDVF NDLRNPSA DYS PILDWL NS+DEA KKW C++TGE Q+K   V+GE   + +P +R+  M    FCD++FR+GA 
Subjt:  DTLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSS-VVGEVSDLHVPHFRSVSMNKARFCDLKFRLGAG

Query:  YLYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD
        Y+YCHQ                                + R QKC VC I RA KV +DDKWA EN  YFC+ C+ LLH S+EG  L  DF V DY+ +
Subjt:  YLYCHQ-------------------------------LRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTGTGTTTCCCTCTTCCTTGCAAAGACGATGAAACCCACGTCATTGCTGACTTAAAACACCACCGCCACCTATTGCTCACTGCCGCCCACCATCCCTCC
CAGGCAGTCGCCGTTGAGACCCCTATCGAATTCTTTTCTCTTTCTCTCATTCTCCTCTCCTCTCAATCGCTCTTCCTCGCCACACTTCACCTTTCCGAGCATTGT
CTGCTGTTCGCGCTGCTAATCAACGCCGCCCCTTCTCGTGACCGACCCAGGATTCGGTGGATTACGGTGGCGAGGTTGGAGATGTGGTTTTTGCAAGCTAAACAT
AGTTTTACCTTGGTAGTTTTTGGGCTTAATTTGAGGACTACCGATGCATGGAATACTGCTGCTTGTCGACTTAGCGTTGATATGGTAAAGGCAAGAGGCCTGGCA
GGCCATTGCAATGCGGGCGTAGGGAACAGATTAATCGAAAGTCGGGTCGTTACAAGAGCCATGGAGGCGAGCGACAGGGCTGGAACTTCAGAGGAGATTGGTCTT
AATGGAGACGGCCTCTCAATTCCTTTGGGTGGGCCTATATATGCACCCAACTTGGTTGGTCCGCTCACCAGGGTCCCTCACTTTGAGTCTTCTGTTGTTCAAGAA
CTTCAGAGTCTGGAGGCAGAGTTACAGTTGGATTCATGTCAACAATGTGATGAAGATATTTCGATTGACGAACTTAAGATATTTACTGAAGAACAGTTATTGAAT
ATGGCCTTGGAGGGATCATTGCAGAGTCATGGGAATGCTAATAACCAGTCGGAGCTTCAAGAAGAAAATATGAATGCTGGGTTGCTCAGAGAATGTGAAGAGGAA
GTTAACGGGCACAATTTGGAGGCTGATTCGGTATCAAATGCAAATAGAAGCACAAATAAAATAACAAGGAAGCGAAAAAAGGAGGAACTTTCTAATATTGAGGAA
AAATCTATCGCAAAGGTGGCTGAAATTGTAAAAATTAAACAAAAACAGGAGACAGACAGAGCTGTAGTGCAATTACATGCTTTCAAGTGGAAGAAGGATATTGCC
AGTTCATCGTCAGAAAGTAAAGAGAGGTTGAAGTCCTTGAGGTCTACAAATTTTTCTGCAAAGGTTCCTCATGTGAAATCACTGAGCGGTGGAAAACACGAATCT
TTGCATCATCCAACGACTGTACTTTTTGTGGAGGTTTACCATAAATCTCGCAAGATGGTTAAGTCTCAAGAGCTCTTGGCCCTTGGACGACAAACGTTAGCAGAA
CTGAAGGACAAGATTTATTGCTCGACAGATACGTTAATGCAAAAGGCTGGGCAGCAAGATTCATCTGGATATTTTCTTGTAGAAGATGTATTCTGCAATGATTTG
AGGAATCCTTCCGCAACAGACTATAGCAAACCTATACTTGATTGGCTTAGAAACTCAGAGGATGAAGCTCGTAAGAAATGGGGATGCATCATAACCGGTGAATCC
CAACAGAAAAGTTCAGTTGTAGGTGAAGTGTCAGATTTACATGTGCCTCATTTTAGATCAGTCAGCATGAACAAGGCTCGATTTTGTGACCTGAAATTTAGGCTT
GGGGCTGGGTATCTGTACTGTCATCAGCTTAGGACCCGAGCTCAGAAATGTGATGTCTGCAACATTTACAGGGCTAAAAAGGTTACTATTGATGACAAGTGGGCG
CAGGAAAACCCATGCTACTTTTGTGAAGACTGCTATTTTCTTCTTCACTACTCGAAAGAAGGTAACCTTCTTTACAACGATTTCGTGGTGCACGATTACTTGAAA
GATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATTGTGTTTCCCTCTTCCTTGCAAAGACGATGAAACCCACGTCATTGCTGACTTAAAACACCACCGCCACCTATTGCTCACTGCCGCCCACCATCCCTCC
CAGGCAGTCGCCGTTGAGACCCCTATCGAATTCTTTTCTCTTTCTCTCATTCTCCTCTCCTCTCAATCGCTCTTCCTCGCCACACTTCACCTTTCCGAGCATTGT
CTGCTGTTCGCGCTGCTAATCAACGCCGCCCCTTCTCGTGACCGACCCAGGATTCGGTGGATTACGGTGGCGAGGTTGGAGATGTGGTTTTTGCAAGCTAAACAT
AGTTTTACCTTGGTAGTTTTTGGGCTTAATTTGAGGACTACCGATGCATGGAATACTGCTGCTTGTCGACTTAGCGTTGATATGGTAAAGGCAAGAGGCCTGGCA
GGCCATTGCAATGCGGGCGTAGGGAACAGATTAATCGAAAGTCGGGTCGTTACAAGAGCCATGGAGGCGAGCGACAGGGCTGGAACTTCAGAGGAGATTGGTCTT
AATGGAGACGGCCTCTCAATTCCTTTGGGTGGGCCTATATATGCACCCAACTTGGTTGGTCCGCTCACCAGGGTCCCTCACTTTGAGTCTTCTGTTGTTCAAGAA
CTTCAGAGTCTGGAGGCAGAGTTACAGTTGGATTCATGTCAACAATGTGATGAAGATATTTCGATTGACGAACTTAAGATATTTACTGAAGAACAGTTATTGAAT
ATGGCCTTGGAGGGATCATTGCAGAGTCATGGGAATGCTAATAACCAGTCGGAGCTTCAAGAAGAAAATATGAATGCTGGGTTGCTCAGAGAATGTGAAGAGGAA
GTTAACGGGCACAATTTGGAGGCTGATTCGGTATCAAATGCAAATAGAAGCACAAATAAAATAACAAGGAAGCGAAAAAAGGAGGAACTTTCTAATATTGAGGAA
AAATCTATCGCAAAGGTGGCTGAAATTGTAAAAATTAAACAAAAACAGGAGACAGACAGAGCTGTAGTGCAATTACATGCTTTCAAGTGGAAGAAGGATATTGCC
AGTTCATCGTCAGAAAGTAAAGAGAGGTTGAAGTCCTTGAGGTCTACAAATTTTTCTGCAAAGGTTCCTCATGTGAAATCACTGAGCGGTGGAAAACACGAATCT
TTGCATCATCCAACGACTGTACTTTTTGTGGAGGTTTACCATAAATCTCGCAAGATGGTTAAGTCTCAAGAGCTCTTGGCCCTTGGACGACAAACGTTAGCAGAA
CTGAAGGACAAGATTTATTGCTCGACAGATACGTTAATGCAAAAGGCTGGGCAGCAAGATTCATCTGGATATTTTCTTGTAGAAGATGTATTCTGCAATGATTTG
AGGAATCCTTCCGCAACAGACTATAGCAAACCTATACTTGATTGGCTTAGAAACTCAGAGGATGAAGCTCGTAAGAAATGGGGATGCATCATAACCGGTGAATCC
CAACAGAAAAGTTCAGTTGTAGGTGAAGTGTCAGATTTACATGTGCCTCATTTTAGATCAGTCAGCATGAACAAGGCTCGATTTTGTGACCTGAAATTTAGGCTT
GGGGCTGGGTATCTGTACTGTCATCAGCTTAGGACCCGAGCTCAGAAATGTGATGTCTGCAACATTTACAGGGCTAAAAAGGTTACTATTGATGACAAGTGGGCG
CAGGAAAACCCATGCTACTTTTGTGAAGACTGCTATTTTCTTCTTCACTACTCGAAAGAAGGTAACCTTCTTTACAACGATTTCGTGGTGCACGATTACTTGAAA
GATTAGGTGTGGCTGAGTTTGTATAAAAATGCTTCCCACTTGGTCATCGAGATTCATTCAGCAGTTTTGATAGTGTTGAGGATTCTGTAAATAGCTATAGGTAAA
TGGTTACAATTTTATTTTATTTGGAACCTTAATTATTCTTTTTGGGTTGGTTGTGTAATGGGGGATTGTTTTTATAAAAGGAAAGGTAAATTGTTTTCTACTTAC
ATTTTCTA
Protein sequenceShow/hide protein sequence
MKLCFPLPCKDDETHVIADLKHHRHLLLTAAHHPSQAVAVETPIEFFSLSLILLSSQSLFLATLHLSEHCLLFALLINAAPSRDRPRIRWITVARLEMWFLQAKH
SFTLVVFGLNLRTTDAWNTAACRLSVDMVKARGLAGHCNAGVGNRLIESRVVTRAMEASDRAGTSEEIGLNGDGLSIPLGGPIYAPNLVGPLTRVPHFESSVVQE
LQSLEAELQLDSCQQCDEDISIDELKIFTEEQLLNMALEGSLQSHGNANNQSELQEENMNAGLLRECEEEVNGHNLEADSVSNANRSTNKITRKRKKEELSNIEE
KSIAKVAEIVKIKQKQETDRAVVQLHAFKWKKDIASSSSESKERLKSLRSTNFSAKVPHVKSLSGGKHESLHHPTTVLFVEVYHKSRKMVKSQELLALGRQTLAE
LKDKIYCSTDTLMQKAGQQDSSGYFLVEDVFCNDLRNPSATDYSKPILDWLRNSEDEARKKWGCIITGESQQKSSVVGEVSDLHVPHFRSVSMNKARFCDLKFRL
GAGYLYCHQLRTRAQKCDVCNIYRAKKVTIDDKWAQENPCYFCEDCYFLLHYSKEGNLLYNDFVVHDYLKD