; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020456 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020456
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRNA polymerase II-associated protein 3
Genome locationtig00153533:353171..362266
RNA-Seq ExpressionSgr020456
SyntenySgr020456
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021040.1 RNA polymerase II-associated protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-13477.59Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELSL G+DKKLKP AI KE+      +G RQ  KA++ADY+KHYDAVN LS    TEQSF D+ASEKEQGNE+FKQKKFKEAI CYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEP+N EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS ++K+M                                 ENGG+KAVK SARLE TE RS GAEIRYKREATNG HKD  P+SNLGALER HV+RK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELKPSVQELASRAASRSMVEAAKNI APTTAYQFEVSWRGFSGDRALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

XP_022156426.1 RNA polymerase II-associated protein 3 [Momordica charantia]5.2e-14280.67Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLND+QDWELSLKGKDKKLKPQA  KE+      +GRR+AEKAS+ADYLKHYDAVN LSRNFQTEQSF D+ASEKE+GNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKI+RFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHN EIKKQHAELRAFVGK+ILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS QDKE                                  EN  QKAVKAS RL++TEDRSM AEIRYKREATNGFHKDA PS NLG LER HVTRK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGD ALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

XP_038890653.1 RNA polymerase II-associated protein 3 isoform X1 [Benincasa hispida]1.5e-13376.1Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELS KGKDKKLKPQAI KE+      +GRRQ EKAS+ADYLKHYDAVNRLSRNFQT+QSF D+ASEKEQGNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRR+QEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDA+FAQRLEPHN EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKE-------------------------------------------MENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLG
        SRSS +DKE                                           MENGG+ A K SA LEE EDRS GAEI YK+ ATNGFHKD+  SS  G
Subjt:  SRSSRQDKE-------------------------------------------MENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLG

Query:  ALERH-VTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK
         LER  V RKQELKPSVQE AS AASRSMVEAAKNI APTTAYQFEVSWRGFSGDR LQARLLK
Subjt:  ALERH-VTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK

XP_038890654.1 RNA polymerase II-associated protein 3 isoform X2 [Benincasa hispida]8.9e-13477.53Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELS KGKDKKLKPQAI KE+      +GRRQ EKAS+ADYLKHYDAVNRLSRNFQT+QSF D+ASEKEQGNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRR+QEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDA+FAQRLEPHN EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKE-----------------------------------MENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALERH-VT
        SRSS +DKE                                    ENGG+ A K SA LEE EDRS GAEI YK+ ATNGFHKD+  SS  G LER  V 
Subjt:  SRSSRQDKE-----------------------------------MENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALERH-VT

Query:  RKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK
        RKQELKPSVQE AS AASRSMVEAAKNI APTTAYQFEVSWRGFSGDR LQARLLK
Subjt:  RKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK

XP_038890655.1 RNA polymerase II-associated protein 3 isoform X3 [Benincasa hispida]5.2e-13477.97Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELS KGKDKKLKPQAI KE+      +GRRQ EKAS+ADYLKHYDAVNRLSRNFQT+QSF D+ASEKEQGNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRR+QEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDA+FAQRLEPHN EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALERH-VTRK
        SRSS +DKE                                  ENGG+ A K SA LEE EDRS GAEI YK+ ATNGFHKD+  SS  G LER  V RK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALERH-VTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK
        QELKPSVQE AS AASRSMVEAAKNI APTTAYQFEVSWRGFSGDR LQARLLK
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLK

TrEMBL top hitse value%identityAlignment
A0A0A0LGI8 Uncharacterized protein3.8e-13076.19Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWE+S KGKDKKLKPQAI KE+ D      RRQ EKAS+ADY+K YDAVNRLSRNFQTE SF D+ASEKEQGNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEP+N EIKKQHA+LRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS ++K+                                  ENGG  AVK SARLEE+ED S GAEI  K+ ATNGFHKD+  SS L ALER H+ RK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELK SV ELAS+AASRSMVEAAKNI APTTAYQFEVSWRGFSGD+ALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

A0A6J1DS20 rab9 effector protein with kelch motifs-like6.5e-13091.18Show/hide
Query:  MGNNIELVRNGDHEPISETKHSIQFMSETLHPKRRRTTNPKVWEVESEQEEHSLSLSQHSSPSQSDQEQTPVRKVSDSVTSSQGLRLLKHVSQSSTSEPY
        MGNNIELVR GDHE   ETKHSIQ MSETLHPKRRRTTNPKVWEVESEQEEHSLSLSQHSSPSQSDQEQTPVRKVSDSVTSSQGLRLLKHVS SSTSEP+
Subjt:  MGNNIELVRNGDHEPISETKHSIQFMSETLHPKRRRTTNPKVWEVESEQEEHSLSLSQHSSPSQSDQEQTPVRKVSDSVTSSQGLRLLKHVSQSSTSEPY

Query:  SISRTQPEFRNAVQSASPQDHPYFGHQNQLKPEQQQLLHVVRPVKEHKSLETGLVQNLIGSEVRGRVDGAFDSGFLMTATVNGKIYRGVLFTPGPGVFSR
        SISRTQPEFRN VQSA  QD PY GHQNQLKPEQQ LLHVVRPVKE KSLE G++QNLIGSEVRGRVDGAFDSGFLMTATVNGKIYRGVLFTPGPGVFSR
Subjt:  SISRTQPEFRNAVQSASPQDHPYFGHQNQLKPEQQQLLHVVRPVKEHKSLETGLVQNLIGSEVRGRVDGAFDSGFLMTATVNGKIYRGVLFTPGPGVFSR

Query:  ATIVAENPLLPANTLPNSNHVEPSKTLQHRPLVPMPESAQSFRQAQLSPPVPIIKPTPSSLPVKLRDDLQGV
        ATI+AEN  LPANTLPNSNHVE SKTLQHRPLVPMPES Q+FRQAQ+SPPVPIIKPTPSSLPVKLRDDLQGV
Subjt:  ATIVAENPLLPANTLPNSNHVEPSKTLQHRPLVPMPESAQSFRQAQLSPPVPIIKPTPSSLPVKLRDDLQGV

A0A6J1DUX9 RNA polymerase II-associated protein 32.5e-14280.67Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLND+QDWELSLKGKDKKLKPQA  KE+      +GRR+AEKAS+ADYLKHYDAVN LSRNFQTEQSF D+ASEKE+GNE+FKQKKFKEAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKI+RFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHN EIKKQHAELRAFVGK+ILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS QDKE                                  EN  QKAVKAS RL++TEDRSM AEIRYKREATNGFHKDA PS NLG LER HVTRK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGD ALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

A0A6J1FCA0 RNA polymerase II-associated protein 3 isoform X11.3e-13377.31Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELSL G+DKKLKP AI KE+      +G RQ  KA++ADY+KHYDAVN LS    TEQSF D+ASEKEQGNE+FKQKKFKEAI CYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALN DDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEP+N EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS ++K+M                                 ENGG+KAVK SARLE TE RS GAEIRYKREATNG HKD  P+SNLGALER HV+RK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELKPSVQELASRAASRSMVEAAKNI APTTAYQFEVSWRGFSGDRALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

A0A6J1HPT1 RNA polymerase II-associated protein 3 isoform X19.6e-13477.31Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGFLNDLQDWELSL G+DKKLKP AI KE+      +G RQ  KA++ADY+KHYDA+N LS     EQSF D+ASEKEQGNE+FKQKKFKEAI CYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEP+N EIKKQHAELRAFVGKAILEKASGA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK
        SRSS +DK+M                                 ENGG+KAVK SARLE TE RS GAEIRYKREATNG HKD  P+SNLGALER HV+RK
Subjt:  SRSSRQDKEM---------------------------------ENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKDAIPSSNLGALER-HVTRK

Query:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        QELKPSVQELASRAASRSMVEAAKNI APTTAYQFEVSWRGFSGDRALQARLLK  S
Subjt:  QELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

SwissProt top hitse value%identityAlignment
Q28IV3 RNA polymerase II-associated protein 39.0e-2045.74Show/hide
Query:  RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEAL
        +  + +Q    +  +K+ GN +FK+ K++ AIDCYS+ +    T A+  ANRAMAYLKI++++EAE DCT A++LD  Y KA++RR TAR  LGK KEA 
Subjt:  RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEAL

Query:  EDAEFAQRLEPHNP----EIKKQHAELRA
        ED E   +L+P N     E++K   ELR+
Subjt:  EDAEFAQRLEPHNP----EIKKQHAELRA

Q5ZKQ3 RNA polymerase II-associated protein 32.8e-2143.33Show/hide
Query:  QAEKASSADYLKHYDAV-NRLSRN----FQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALN
        QA  + S++  +  +AV + L+ N     + EQ    + +EK+ GN +FK+ K++ AI+CY+R IA   T A+  ANRAMAYLKI++++EAE+DCT+AL 
Subjt:  QAEKASSADYLKHYDAV-NRLSRN----FQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALN

Query:  LDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELR
        LD  Y KA++RR  AR  LGK KEA++D E   +LEP N +   +  ++R
Subjt:  LDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELR

Q6NU95 RNA polymerase II-associated protein 32.9e-1841.33Show/hide
Query:  RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEAL
        +  + +Q    +  +K+ GN +FK+ K++ AI+CYS+ +    T A+  ANRAMAYLKI++++EAE DCT A++LD  Y KA++RR TA   LGK KEA 
Subjt:  RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEAL

Query:  EDAEFAQRLEPHNP----EIKKQHAELRAFVGKAILEKASGASRSSRQDK
        ED E   +L+P N     E+ K   ELR+      +EK    ++ S Q K
Subjt:  EDAEFAQRLEPHNP----EIKKQHAELRAFVGKAILEKASGASRSSRQDK

Q9D706 RNA polymerase II-associated protein 34.8e-2148.84Show/hide
Query:  EQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEF
        +Q    + +EK+ GN FFK+ K+++AI+CY+R IA   T A+  ANRAMAYLKI+R++EAE DCT+A+ LD  Y KA++RR TAR  LGK  EA +D E 
Subjt:  EQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPT-AVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEF

Query:  AQRLEPHNPEIKKQHAELRAFVGKAILEK
           LEP N    KQ A   + + K ++EK
Subjt:  AQRLEPHNPEIKKQHAELRAFVGKAILEK

Q9H6T3 RNA polymerase II-associated protein 38.1e-2140.88Show/hide
Query:  QAEKASSADYLKHYDAVNRLS----RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNL
        QA  +    Y K  D V + +    +  + +Q+   + SEK++GN FFK+ K++ AI+CY+R IA     A+  ANRAMAYLKI++++EAE DCT+A+ L
Subjt:  QAEKASSADYLKHYDAVNRLS----RNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNL

Query:  DDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEK
        D  Y KA++RR TAR  LGK  EA +D E    LEP N +   + ++++    K ++EK
Subjt:  DDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEK

Arabidopsis top hitse value%identityAlignment
AT1G56440.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-7949.73Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGF NDLQDWELSLK KDKK+K Q                +   +   D+ K Y ++  LS +   E S  DS+SEKEQGNEFFKQKKF EAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSP AV +ANRAMAYLKI+R++EAE DCTEALNLDDRYIKAYSRRATARKELG  KEA EDAEFA RLEP + E+KKQ+A++++ + K I+EKA+GA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQ--------DKEM------------------------------ENGGQKAVKASARLEETEDRSMG----AEIRYKREATNG---FHKDAIPSSN
         +S+ Q        DK++                              E+ G+K ++     E++++ SM      EI   ++ T G   + K+A PS  
Subjt:  SRSSRQ--------DKEM------------------------------ENGGQKAVKASARLEETEDRSMG----AEIRYKREATNG---FHKDAIPSSN

Query:  LGAL----ERHVTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
         G      E  V+++ ELKPSVQELA+ AAS +M EA+KNI  P +AY+FE SWR FSGD AL+++LLK+++
Subjt:  LGAL----ERHVTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

AT1G56440.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-7647.95Show/hide
Query:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR
        DFQGF NDLQDWELSLK KDKK+K Q                +   +   D+ K Y ++  LS +   E S  DS+SEKEQGNEFFKQKKF EAIDCYSR
Subjt:  DFQGFLNDLQDWELSLKGKDKKLKPQAIVKERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSR

Query:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA
        SIALSP AV +ANRAMAYLKI+R++EAE DCTEALNLDDRYIKAYSRRATARKELG  KEA EDAEFA RLEP + E+KKQ+A++++ + K I+EKA+GA
Subjt:  SIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGA

Query:  SRSSRQ--------DKEM------------------------------ENGGQKAVK-----------------ASARLEE-TEDRSMG----AEIRYKR
         +S+ Q        DK++                              E+ G+K ++                  S  L+E +++ SM      EI   +
Subjt:  SRSSRQ--------DKEM------------------------------ENGGQKAVK-----------------ASARLEE-TEDRSMG----AEIRYKR

Query:  EATNG---FHKDAIPSSNLGAL----ERHVTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS
        + T G   + K+A PS   G      E  V+++ ELKPSVQELA+ AAS +M EA+KNI  P +AY+FE SWR FSGD AL+++LLK+++
Subjt:  EATNG---FHKDAIPSSNLGAL----ERHVTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISS

AT2G42810.1 protein phosphatase 5.24.0e-1538.03Show/hide
Query:  QTEQSFADSASE-KEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALED
        + E S    A E K Q NE FK  K+  AID Y+++I L S  AV +ANRA A+ K+  +  A  D ++A+ +D RY K Y RR  A   +GK K+AL+D
Subjt:  QTEQSFADSASE-KEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALED

Query:  AEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGASRSSRQ
         +  +RL P++P+  ++  E    V K   E+A     S R+
Subjt:  AEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGASRSSRQ

AT2G42810.2 protein phosphatase 5.24.0e-1538.03Show/hide
Query:  QTEQSFADSASE-KEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALED
        + E S    A E K Q NE FK  K+  AID Y+++I L S  AV +ANRA A+ K+  +  A  D ++A+ +D RY K Y RR  A   +GK K+AL+D
Subjt:  QTEQSFADSASE-KEQGNEFFKQKKFKEAIDCYSRSIAL-SPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEALED

Query:  AEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGASRSSRQ
         +  +RL P++P+  ++  E    V K   E+A     S R+
Subjt:  AEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGASRSSRQ

AT3G17970.1 translocon at the outer membrane of chloroplasts 64-III2.3e-1844.09Show/hide
Query:  SRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSP-TAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEA
        S+   T++  A+ A  KE+GN+ FK+K +++AI  YS +I LS   A  ++NRA AYL++  F +AE+DCT+A+ LD + +KAY RR TAR+ LG  K A
Subjt:  SRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSP-TAVAFANRAMAYLKIRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKAKEA

Query:  LEDAEFAQRLEPHNPEIKKQHAELRAF
        +ED  +A  LEP+N         LR F
Subjt:  LEDAEFAQRLEPHNPEIKKQHAELRAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACAACATAGAGCTAGTTAGAAACGGAGATCATGAACCAATATCAGAGACAAAACACTCTATTCAGTTCATGTCAGAGACATTACATCCAAAACGGAGGAGAAC
CACAAATCCAAAAGTGTGGGAGGTCGAATCCGAGCAAGAGGAGCATTCTTTATCATTGTCCCAGCACTCATCACCATCACAATCAGATCAAGAACAGACCCCAGTCCGAA
AAGTCTCTGACTCAGTAACAAGTTCCCAAGGATTACGACTTCTTAAGCACGTGAGTCAGAGCTCAACTAGCGAACCTTACAGCATTTCTAGAACCCAACCCGAGTTCAGA
AATGCTGTCCAAAGCGCTTCGCCACAAGATCATCCGTATTTCGGACACCAGAACCAGCTAAAGCCTGAACAGCAGCAGCTTCTCCATGTTGTTCGCCCAGTGAAAGAACA
CAAGTCCTTGGAGACCGGCCTCGTCCAGAACCTGATTGGATCGGAGGTTCGAGGAAGAGTCGACGGGGCTTTCGACTCTGGCTTCCTAATGACTGCTACTGTCAATGGGA
AAATATACAGAGGTGTCCTGTTCACACCCGGACCTGGGGTCTTCTCAAGGGCCACCATTGTTGCAGAGAATCCACTTCTTCCAGCAAACACACTCCCAAACTCGAATCAC
GTAGAACCGTCCAAGACCTTGCAGCATCGACCGTTGGTTCCGATGCCCGAGTCTGCTCAGAGCTTCAGGCAAGCTCAACTCAGTCCTCCAGTTCCAATCATAAAACCCAC
TCCATCTTCATTACCAGTCAAGCTTAGAGATGATCTTCAGGGTGTCAAAAATAAAATTAAAAAAAATGAAAAAAAAAAAGCAAGAGAACGGGGTCGGTGTCAGCTGTCTG
ATCTGTCTGCCACTTGGGATTCACAGATTTTGTATCCATGGCGAAGACCCGACCGGAGGGGACTGCTTTCGGCCGACGCTCCGAGGCTTTGGACGCTTCGAGAAGCACTC
CGAGACCTCGTTAAGCTAGAATCTCCGTTCTCTTTCTTCGTTTTCATTTCGTTTGGTCTCCAATGGCCGAGCCCTCTGGCAAGCACGGGCGTGATCATTGAGCGTCCTGT
TGCTGCATATCGACGCATTGATAGGAAAAACAGTATGATTCGTTGTCTGTGGAATAGTGTATGGTGTGCATTTCTGTTCCCTTGCATTGCAGTGATAAACAGGCTGAATG
TATTGAAAGAGGGATTAGAATTCGATTTCCAGGGATTTTTGAATGATTTGCAGGATTGGGAACTCTCCCTTAAGGGAAAAGACAAGAAATTGAAGCCACAGGCCATTGTT
AAAGAAAGGAGAGATGTCATAGCTATCCAGGGTAGAAGGCAGGCAGAAAAAGCTTCATCAGCTGATTACTTGAAGCACTATGATGCAGTTAACCGTCTATCAAGAAATTT
TCAGACGGAACAGAGTTTTGCTGATTCTGCTTCAGAGAAAGAACAGGGTAATGAGTTTTTTAAGCAGAAGAAGTTTAAAGAAGCTATTGACTGCTATTCAAGAAGTATTG
CTTTGTCACCAACGGCTGTAGCCTTTGCAAATAGGGCCATGGCCTACCTAAAAATTAGAAGATTTCAGGAGGCTGAGGATGACTGTACAGAGGCCTTAAATTTAGATGAT
CGATATATTAAAGCATATTCACGCCGAGCAACAGCTAGAAAGGAACTCGGGAAAGCTAAAGAAGCCTTGGAGGATGCTGAATTTGCCCAGAGGTTGGAGCCTCACAACCC
AGAGATCAAGAAGCAACATGCTGAGCTCAGAGCTTTTGTTGGGAAAGCAATTCTTGAGAAGGCATCTGGTGCATCGAGAAGCTCCAGACAGGATAAGGAGATGGAAAATG
GTGGGCAAAAAGCTGTTAAGGCATCAGCTCGTTTAGAGGAAACCGAGGACAGAAGTATGGGAGCTGAAATCAGATACAAAAGAGAAGCAACAAATGGTTTTCACAAAGAT
GCCATCCCAAGTTCGAATTTGGGGGCATTGGAGAGACATGTTACGAGAAAGCAGGAACTGAAGCCGTCAGTGCAGGAACTTGCTTCTCGAGCAGCTTCTAGAAGTATGGT
TGAAGCTGCAAAAAACATCACAGCCCCAACTACTGCCTATCAATTCGAAGTTTCTTGGCGAGGATTCTCTGGTGATCGTGCACTGCAGGCTCGTCTTTTGAAGATCTCCT
CAAGATTTGGGATGAAGTATTTTGTGATGAGGCTGTTCCTATTGAGTATGCAGAAACGCTCGATAGCTTGCGTCAAAGGTAGTGCCTTAGATAGGGTTCAGTTGCCCCAT
GTTGTGTTGTGGTGCACTGCCACATGCAGGAGGACAATTGAATGGCCCATTTCCATTCATCTGGACCATCCTCAACTCAAAAGTCTTTATGTGAATATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACAACATAGAGCTAGTTAGAAACGGAGATCATGAACCAATATCAGAGACAAAACACTCTATTCAGTTCATGTCAGAGACATTACATCCAAAACGGAGGAGAAC
CACAAATCCAAAAGTGTGGGAGGTCGAATCCGAGCAAGAGGAGCATTCTTTATCATTGTCCCAGCACTCATCACCATCACAATCAGATCAAGAACAGACCCCAGTCCGAA
AAGTCTCTGACTCAGTAACAAGTTCCCAAGGATTACGACTTCTTAAGCACGTGAGTCAGAGCTCAACTAGCGAACCTTACAGCATTTCTAGAACCCAACCCGAGTTCAGA
AATGCTGTCCAAAGCGCTTCGCCACAAGATCATCCGTATTTCGGACACCAGAACCAGCTAAAGCCTGAACAGCAGCAGCTTCTCCATGTTGTTCGCCCAGTGAAAGAACA
CAAGTCCTTGGAGACCGGCCTCGTCCAGAACCTGATTGGATCGGAGGTTCGAGGAAGAGTCGACGGGGCTTTCGACTCTGGCTTCCTAATGACTGCTACTGTCAATGGGA
AAATATACAGAGGTGTCCTGTTCACACCCGGACCTGGGGTCTTCTCAAGGGCCACCATTGTTGCAGAGAATCCACTTCTTCCAGCAAACACACTCCCAAACTCGAATCAC
GTAGAACCGTCCAAGACCTTGCAGCATCGACCGTTGGTTCCGATGCCCGAGTCTGCTCAGAGCTTCAGGCAAGCTCAACTCAGTCCTCCAGTTCCAATCATAAAACCCAC
TCCATCTTCATTACCAGTCAAGCTTAGAGATGATCTTCAGGGTGTCAAAAATAAAATTAAAAAAAATGAAAAAAAAAAAGCAAGAGAACGGGGTCGGTGTCAGCTGTCTG
ATCTGTCTGCCACTTGGGATTCACAGATTTTGTATCCATGGCGAAGACCCGACCGGAGGGGACTGCTTTCGGCCGACGCTCCGAGGCTTTGGACGCTTCGAGAAGCACTC
CGAGACCTCGTTAAGCTAGAATCTCCGTTCTCTTTCTTCGTTTTCATTTCGTTTGGTCTCCAATGGCCGAGCCCTCTGGCAAGCACGGGCGTGATCATTGAGCGTCCTGT
TGCTGCATATCGACGCATTGATAGGAAAAACAGTATGATTCGTTGTCTGTGGAATAGTGTATGGTGTGCATTTCTGTTCCCTTGCATTGCAGTGATAAACAGGCTGAATG
TATTGAAAGAGGGATTAGAATTCGATTTCCAGGGATTTTTGAATGATTTGCAGGATTGGGAACTCTCCCTTAAGGGAAAAGACAAGAAATTGAAGCCACAGGCCATTGTT
AAAGAAAGGAGAGATGTCATAGCTATCCAGGGTAGAAGGCAGGCAGAAAAAGCTTCATCAGCTGATTACTTGAAGCACTATGATGCAGTTAACCGTCTATCAAGAAATTT
TCAGACGGAACAGAGTTTTGCTGATTCTGCTTCAGAGAAAGAACAGGGTAATGAGTTTTTTAAGCAGAAGAAGTTTAAAGAAGCTATTGACTGCTATTCAAGAAGTATTG
CTTTGTCACCAACGGCTGTAGCCTTTGCAAATAGGGCCATGGCCTACCTAAAAATTAGAAGATTTCAGGAGGCTGAGGATGACTGTACAGAGGCCTTAAATTTAGATGAT
CGATATATTAAAGCATATTCACGCCGAGCAACAGCTAGAAAGGAACTCGGGAAAGCTAAAGAAGCCTTGGAGGATGCTGAATTTGCCCAGAGGTTGGAGCCTCACAACCC
AGAGATCAAGAAGCAACATGCTGAGCTCAGAGCTTTTGTTGGGAAAGCAATTCTTGAGAAGGCATCTGGTGCATCGAGAAGCTCCAGACAGGATAAGGAGATGGAAAATG
GTGGGCAAAAAGCTGTTAAGGCATCAGCTCGTTTAGAGGAAACCGAGGACAGAAGTATGGGAGCTGAAATCAGATACAAAAGAGAAGCAACAAATGGTTTTCACAAAGAT
GCCATCCCAAGTTCGAATTTGGGGGCATTGGAGAGACATGTTACGAGAAAGCAGGAACTGAAGCCGTCAGTGCAGGAACTTGCTTCTCGAGCAGCTTCTAGAAGTATGGT
TGAAGCTGCAAAAAACATCACAGCCCCAACTACTGCCTATCAATTCGAAGTTTCTTGGCGAGGATTCTCTGGTGATCGTGCACTGCAGGCTCGTCTTTTGAAGATCTCCT
CAAGATTTGGGATGAAGTATTTTGTGATGAGGCTGTTCCTATTGAGTATGCAGAAACGCTCGATAGCTTGCGTCAAAGGTAGTGCCTTAGATAGGGTTCAGTTGCCCCAT
GTTGTGTTGTGGTGCACTGCCACATGCAGGAGGACAATTGAATGGCCCATTTCCATTCATCTGGACCATCCTCAACTCAAAAGTCTTTATGTGAATATGTAG
Protein sequenceShow/hide protein sequence
MGNNIELVRNGDHEPISETKHSIQFMSETLHPKRRRTTNPKVWEVESEQEEHSLSLSQHSSPSQSDQEQTPVRKVSDSVTSSQGLRLLKHVSQSSTSEPYSISRTQPEFR
NAVQSASPQDHPYFGHQNQLKPEQQQLLHVVRPVKEHKSLETGLVQNLIGSEVRGRVDGAFDSGFLMTATVNGKIYRGVLFTPGPGVFSRATIVAENPLLPANTLPNSNH
VEPSKTLQHRPLVPMPESAQSFRQAQLSPPVPIIKPTPSSLPVKLRDDLQGVKNKIKKNEKKKARERGRCQLSDLSATWDSQILYPWRRPDRRGLLSADAPRLWTLREAL
RDLVKLESPFSFFVFISFGLQWPSPLASTGVIIERPVAAYRRIDRKNSMIRCLWNSVWCAFLFPCIAVINRLNVLKEGLEFDFQGFLNDLQDWELSLKGKDKKLKPQAIV
KERRDVIAIQGRRQAEKASSADYLKHYDAVNRLSRNFQTEQSFADSASEKEQGNEFFKQKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEALNLDD
RYIKAYSRRATARKELGKAKEALEDAEFAQRLEPHNPEIKKQHAELRAFVGKAILEKASGASRSSRQDKEMENGGQKAVKASARLEETEDRSMGAEIRYKREATNGFHKD
AIPSSNLGALERHVTRKQELKPSVQELASRAASRSMVEAAKNITAPTTAYQFEVSWRGFSGDRALQARLLKISSRFGMKYFVMRLFLLSMQKRSIACVKGSALDRVQLPH
VVLWCTATCRRTIEWPISIHLDHPQLKSLYVNM