; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026083 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026083
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionU1 small nuclear ribonucleoprotein 70 kDa
Genome locationtig00153031:1585207..1596038
RNA-Seq ExpressionSgr026083
SyntenySgr026083
Gene Ontology termsGO:0070897 - transcription preinitiation complex assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009527 - plastid outer membrane (cellular component)
GO:0097550 - transcriptional preinitiation complex (cellular component)
GO:0000182 - rDNA binding (molecular function)
GO:0017025 - TBP-class protein binding (molecular function)
GO:0030619 - U1 snRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR000812 - Transcription factor TFIIB
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR013150 - Transcription factor TFIIB, cyclin-like domain
IPR013763 - Cyclin-like
IPR022023 - U1 small nuclear ribonucleoprotein of 70kDa N-terminal
IPR034143 - snRNP70, RNA recognition motif
IPR035979 - RNA-binding domain superfamily
IPR036915 - Cyclin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036237.1 plant-specific TFIIB-related protein 1 [Cucumis melo var. makuwa]6.3e-23492.53Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPNI  TEI++M HPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQWKVFTEPPLFS
        NKSSTF QP PPKG SVA+  GKKSQ +D QGMDIV+EHSNSQ  +   +P + S
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQWKVFTEPPLFS

XP_004143581.1 plant-specific TFIIB-related protein 1 [Cucumis sativus]8.2e-23494.59Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPN I TEIS+M HPSRVKEDSESKFVS GM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ
        NKSSTF QP PPKG SVA   GKKSQ +DTQGMDIV++HSNSQQ
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ

XP_008440699.1 PREDICTED: uncharacterized protein LOC103485039 [Cucumis melo]4.8e-23494.59Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPNI  TEI++M HPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ
        NKSSTF QP PPKG SVA+  GKKSQ +D QGMDIV+EHSNSQQ
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ

XP_022132384.1 plant-specific TFIIB-related protein 1 [Momordica charantia]6.3e-23494.78Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLF LRAQDNPL LVTSDLPTPP+HHQHDQ LDPFEPTGF+TAFSTWSLEH+PLF RSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEG SL+KDK  ETKPNIIP EISDMGHPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQSDTQGMDIVREHSNS
        N+S+TFCQP P KGTSVA+ AGKKSQSD QGMDIV+EHSNS
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQSDTQGMDIVREHSNS

XP_038882162.1 plant-specific TFIIB-related protein 1 [Benincasa hispida]3.3e-23594.78Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRCATSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APKVD FEGAS+EKDK IETKPN I TEIS+MGHPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSV-ANAGKKSQSDTQGMDIVREHSNS
        +KSSTFCQP PPKG S+ +  GKK+QSD QGMDIV+EHSNS
Subjt:  NKSSTFCQPQPPKGTSV-ANAGKKSQSDTQGMDIVREHSNS

TrEMBL top hitse value%identityAlignment
A0A0A0KM51 Uncharacterized protein4.0e-23494.59Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPN I TEIS+M HPSRVKEDSESKFVS GM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ
        NKSSTF QP PPKG SVA   GKKSQ +DTQGMDIV++HSNSQQ
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ

A0A1S3B2G9 uncharacterized protein LOC1034850392.3e-23494.59Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPNI  TEI++M HPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ
        NKSSTF QP PPKG SVA+  GKKSQ +D QGMDIV+EHSNSQQ
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQ

A0A5D3CMR8 Plant-specific TFIIB-related protein 13.0e-23492.53Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRC TSS+GKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APK+DAFEGASLEKDK IETKPNI  TEI++M HPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQWKVFTEPPLFS
        NKSSTF QP PPKG SVA+  GKKSQ +D QGMDIV+EHSNSQ  +   +P + S
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQ-SDTQGMDIVREHSNSQQWKVFTEPPLFS

A0A6J1BSX3 plant-specific TFIIB-related protein 13.0e-23494.78Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLF LRAQDNPL LVTSDLPTPP+HHQHDQ LDPFEPTGF+TAFSTWSLEH+PLF RSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEG SL+KDK  ETKPNIIP EISDMGHPSRVKEDSESKFVSRGM+N V 
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQPPKGTSVAN-AGKKSQSDTQGMDIVREHSNS
        N+S+TFCQP P KGTSVA+ AGKKSQSD QGMDIV+EHSNS
Subjt:  NKSSTFCQPQPPKGTSVAN-AGKKSQSDTQGMDIVREHSNS

A0A6J1GE41 plant-specific TFIIB-related protein 16.3e-23293.93Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCSAAQGRCATSS+GKSITEC+SCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLP+PPVHHQ+DQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLESTSS+NLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI
        EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRS+APKVDAFE  S EKDKQI+TKPN I TEISDMGHPSRVKED+ESKFVSRGM+N VI
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVI

Query:  NKSSTFCQPQ-PPKGTSVANAGKKSQSDTQGMDIVRE--HSNSQQ
        NKSSTFCQP  PPKGT     GKKSQSD QGMDIV+E  HS SQQ
Subjt:  NKSSTFCQPQ-PPKGTSVANAGKKSQSDTQGMDIVRE--HSNSQQ

SwissProt top hitse value%identityAlignment
O23215 Plant-specific TFIIB-related protein 14.3e-16974.83Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCS+AQGRC T+SSG+SITEC SCGRV+EERQ Q HHLFHLRAQD PLCLVTSDL T       D+  DPFEPTGFITAFSTWSLE +P+F RS  
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLE  SS++  +SSTVVVDNLRAYMQIIDVAS+LGLD  ISEHAF+LFRDCCSATCLRNRSVEALATA LVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANV QKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICK+TGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPK-VDAFEGASLEKDKQIETKPNIIPTEISDMGHPS-RVKEDSESKFVSRGMHNT
        EVTLRKVYKELLENWDDLLPSNYTPAVPPE+AFPTT I++ RS+ P+ VD  E + +EKD     KP+  P E  D  +   + KED + KF    +  T
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPK-VDAFEGASLEKDKQIETKPNIIPTEISDMGHPS-RVKEDSESKFVSRGMHNT

Query:  --VINKSSTFCQPQPPKGTSVANAGKKSQSDTQ
          V+N +    +P  P         +K Q D Q
Subjt:  --VINKSSTFCQPQPPKGTSVANAGKKSQSDTQ

P08621 U1 small nuclear ribonucleoprotein 70 kDa5.1e-6149.67Show/hide
Query:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP
        L  NLL LF PR P+ Y P     P EK    P  G+A ++  F +P D   APP  + ET  +R  R    +IE+  ++   EL  +   +DPN  GD 
Subjt:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP

Query:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV
        +KTLFVAR+NY+T+ES+++REFE YGPIKR+ ++  K +GKP+GYAFIEY H+RDM +AYK ADG+KIDGRRVLVDVERGRTV  WRPRRLGGGLG TR 
Subjt:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV

Query:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR
        GG + N R+ GR+            P    DR  +R+RE+  E+ RER++ERE+ R RSRD   RS+ RD +E R  R+R + +DRD      R R+R R
Subjt:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR

Query:  EERE
         ERE
Subjt:  EERE

Q1RMR2 U1 small nuclear ribonucleoprotein 70 kDa5.1e-6149.67Show/hide
Query:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP
        L  NLL LF PR P+ Y P     P EK    P  G+A ++  F +P D   APP  + ET  +R  R    +IE+  ++   EL  +   +DPN  GD 
Subjt:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP

Query:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV
        +KTLFVAR+NY+T+ES+++REFE YGPIKR+ ++  K +GKP+GYAFIEY H+RDM +AYK ADG+KIDGRRVLVDVERGRTV  WRPRRLGGGLG TR 
Subjt:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV

Query:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR
        GG + N R+ GR+            P    DR  +R+RE+  E+ RER++ERE+ R RSRD   RS+ RD +E R  R+R + +DRD      R R+R R
Subjt:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR

Query:  EERE
         ERE
Subjt:  EERE

Q42404 U1 small nuclear ribonucleoprotein 70 kDa1.4e-11453.3Show/hide
Query:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ
        MGD  D F+RN NAAVQAR K QNRANVLQLKL                         +GQSHPTGLT NLLKLFEPRPPL+YKPPPEKRKCPP TGMAQ
Subjt:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ

Query:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKAT
        FVSNFAEPGDPEYAPP  + E P+Q+R RIH LR+EKG EKAAE+L KY   +DPN TGDPYKTLFV+RLNYE+SES+IKREFESYGPIKRV L+TD+ T
Subjt:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKAT

Query:  GKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRVGGEENNQRYPGREQQQPGAPSRSEEP-RAREDRHGERDR
         KPKGYAFIEYMH RDMKAAYKQADG+KIDGRRVLVDVERGRTVPNWRPRRLGGGLGT+RVGG E        EQQ  G  S+SEEP R RE      +R
Subjt:  GKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRVGGEENNQRYPGREQQQPGAPSRSEEP-RAREDRHGERDR

Query:  EKSYEKGREREREREKSRERSRDRSKDRDLKEDRHHRDRDR-TRDRDRDRDRDREERETVAMTVIKHVIVIEEGIEVVIMSVIEIAIVIGTEIGRGIGIM
        EKS EKG+ERER RE S E+ R+RS+DR  +ED+HHRDRD+  RDRDRD  RDR+                                             
Subjt:  EKSYEKGREREREREKSRERSRDRSKDRDLKEDRHHRDRDR-TRDRDRDRDRDREERETVAMTVIKHVIVIEEGIEVVIMSVIEIAIVIGTEIGRGIGIM

Query:  KGDIQTLIVAIPMIRNLIMIELNQNMKRIDMSEHGHRHPDPDHDPQYYDHFEHNRS--------------RGQYDHSEGHRDQDRYD-QYDRM-EDDYHY
        +GD           R+    +  ++    D      R  + D++   Y+H    RS              RG Y+  +G  D DRY  +YD+M EDD+ Y
Subjt:  KGDIQTLIVAIPMIRNLIMIELNQNMKRIDMSEHGHRHPDPDHDPQYYDHFEHNRS--------------RGQYDHSEGHRDQDRYD-QYDRM-EDDYHY

Query:  ERGTSESHDRERTRDMNHEHRRTERSHSREY
        ER                E++R++RS SREY
Subjt:  ERGTSESHDRERTRDMNHEHRRTERSHSREY

Q62376 U1 small nuclear ribonucleoprotein 70 kDa8.7e-6149.67Show/hide
Query:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP
        L  NLL LF PR P+ Y P     P EK    P  G+A ++  F +P D   APP  + ET  +R  R    +IE+  ++   EL  +   +DPN  GD 
Subjt:  LTANLLKLFEPRPPLDYKP-----PPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDP

Query:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV
        +KTLFVAR+NY+T+ES+++REFE YGPIKR+ ++  K +GKP+GYAFIEY H+RDM +AYK ADG+KIDGRRVLVDVERGRTV  WRPRRLGGGLG TR 
Subjt:  YKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV

Query:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR
        GG + N R+ GR+            P    DR  +R+RE+  E+ RER++ERE+ R RSRD   RS+ RD  E R  R+R + +DRD      R R+R R
Subjt:  GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRD---RSKDRDLKEDRHHRDRDRTRDRD------RDRDRDR

Query:  EERE
         ERE
Subjt:  EERE

Arabidopsis top hitse value%identityAlignment
AT2G43370.1 RNA-binding (RRM/RBD/RNP motifs) family protein7.9e-1736.57Show/hide
Query:  DPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLG
        D    GDPY TLFV RL++ T+E  ++     YG IK +RL+    TG  +GY F+EY  +++M  AY+ A    IDGR ++VD  R + +P W PRRLG
Subjt:  DPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLG

Query:  GGLGTTRVGGEENNQRYPGREQ--QQPGAPSRSEEPR------AREDRHGER------DREKSYEKGREREREREKSRERSRDRSKDR-DLKEDRHHRDR
        GGLG  +  G+    R+ GR++  + P  P   E+ +        E R+  R       R K     RE E  REKS     +  K+R  L+    HR  
Subjt:  GGLGTTRVGGEENNQRYPGREQ--QQPGAPSRSEEPR------AREDRHGER------DREKSYEKGREREREREKSRERSRDRSKDR-DLKEDRHHRDR

Query:  DRTRDRDRDRDRDREE
          T    R R +DREE
Subjt:  DRTRDRDRDRDRDREE

AT3G50670.1 U1 small nuclear ribonucleoprotein-70K9.7e-11653.3Show/hide
Query:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ
        MGD  D F+RN NAAVQAR K QNRANVLQLKL                         +GQSHPTGLT NLLKLFEPRPPL+YKPPPEKRKCPP TGMAQ
Subjt:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ

Query:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKAT
        FVSNFAEPGDPEYAPP  + E P+Q+R RIH LR+EKG EKAAE+L KY   +DPN TGDPYKTLFV+RLNYE+SES+IKREFESYGPIKRV L+TD+ T
Subjt:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKAT

Query:  GKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRVGGEENNQRYPGREQQQPGAPSRSEEP-RAREDRHGERDR
         KPKGYAFIEYMH RDMKAAYKQADG+KIDGRRVLVDVERGRTVPNWRPRRLGGGLGT+RVGG E        EQQ  G  S+SEEP R RE      +R
Subjt:  GKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRVGGEENNQRYPGREQQQPGAPSRSEEP-RAREDRHGERDR

Query:  EKSYEKGREREREREKSRERSRDRSKDRDLKEDRHHRDRDR-TRDRDRDRDRDREERETVAMTVIKHVIVIEEGIEVVIMSVIEIAIVIGTEIGRGIGIM
        EKS EKG+ERER RE S E+ R+RS+DR  +ED+HHRDRD+  RDRDRD  RDR+                                             
Subjt:  EKSYEKGREREREREKSRERSRDRSKDRDLKEDRHHRDRDR-TRDRDRDRDRDREERETVAMTVIKHVIVIEEGIEVVIMSVIEIAIVIGTEIGRGIGIM

Query:  KGDIQTLIVAIPMIRNLIMIELNQNMKRIDMSEHGHRHPDPDHDPQYYDHFEHNRS--------------RGQYDHSEGHRDQDRYD-QYDRM-EDDYHY
        +GD           R+    +  ++    D      R  + D++   Y+H    RS              RG Y+  +G  D DRY  +YD+M EDD+ Y
Subjt:  KGDIQTLIVAIPMIRNLIMIELNQNMKRIDMSEHGHRHPDPDHDPQYYDHFEHNRS--------------RGQYDHSEGHRDQDRYD-QYDRM-EDDYHY

Query:  ERGTSESHDRERTRDMNHEHRRTERSHSREY
        ER                E++R++RS SREY
Subjt:  ERGTSESHDRERTRDMNHEHRRTERSHSREY

AT3G50670.2 U1 small nuclear ribonucleoprotein-70K2.8e-7070.31Show/hide
Query:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ
        MGD  D F+RN NAAVQAR K QNRANVLQLKL                         +GQSHPTGLT NLLKLFEPRPPL+YKPPPEKRKCPP TGMAQ
Subjt:  MGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSIIVIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQ

Query:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRV
        FVSNFAEPGDPEYAPP  + E P+Q+R RIH LR+EKG EKAAE+L KY   +DPN TGDPYKTLFV+RLNYE+SES+IKREFESYGPIKRV
Subjt:  FVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLERDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRV

AT4G36650.1 plant-specific TFIIB-related protein3.1e-17074.83Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCS+AQGRC T+SSG+SITEC SCGRV+EERQ Q HHLFHLRAQD PLCLVTSDL T       D+  DPFEPTGFITAFSTWSLE +P+F RS  
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLE  SS++  +SSTVVVDNLRAYMQIIDVAS+LGLD  ISEHAF+LFRDCCSATCLRNRSVEALATA LVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANV QKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICK+TGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPK-VDAFEGASLEKDKQIETKPNIIPTEISDMGHPS-RVKEDSESKFVSRGMHNT
        EVTLRKVYKELLENWDDLLPSNYTPAVPPE+AFPTT I++ RS+ P+ VD  E + +EKD     KP+  P E  D  +   + KED + KF    +  T
Subjt:  EVTLRKVYKELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPK-VDAFEGASLEKDKQIETKPNIIPTEISDMGHPS-RVKEDSESKFVSRGMHNT

Query:  --VINKSSTFCQPQPPKGTSVANAGKKSQSDTQ
          V+N +    +P  P         +K Q D Q
Subjt:  --VINKSSTFCQPQPPKGTSVANAGKKSQSDTQ

AT4G36650.2 plant-specific TFIIB-related protein8.4e-14475.07Show/hide
Query:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF
        MKCPYCS+AQGRC T+SSG+SITEC SCGRV+EERQ Q HHLFHLRAQD PLCLVTSDL T       D+  DPFEPTGFITAFSTWSLE +P+F RS  
Subjt:  MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCF

Query:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA
        SFSGHLAELERTLE  SS++  +SSTVVVDNLRAYMQIIDVAS+LGLD  ISEHAF+LFRDCCSATCLRNRSVEALATA LVQAIREAQEPRTLQEISIA
Subjt:  SFSGHLAELERTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIA

Query:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT
        ANV QKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICK+TGLT
Subjt:  ANVPQKEIGKYIKILGEALQLSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLT

Query:  EVTLRKVY---------KELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASL
        E   RK +         ++LLE W  L   ++    P       ++I +    A K+ +    SL
Subjt:  EVTLRKVY---------KELLENWDDLLPSNYTPAVPPERAFPTTVIASGRSSAPKVDAFEGASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTGCCCCTACTGTTCGGCTGCGCAGGGGCGGTGCGCCACCTCAAGCTCCGGGAAGTCCATCACAGAGTGCATTTCCTGCGGCCGTGTGGTCGAAGAGCGCCAGTT
CCAGCCCCACCACCTCTTCCACCTTCGCGCTCAAGACAACCCCCTTTGCCTCGTCACCTCCGACCTACCAACCCCGCCGGTCCACCACCAGCACGATCAAGTTCTGGATC
CTTTCGAGCCCACTGGCTTCATTACAGCCTTCTCCACATGGTCCTTAGAGCACAATCCACTCTTCTTCCGCTCTTGCTTCTCCTTCTCCGGCCACCTTGCGGAGCTTGAA
CGCACCCTTGAGTCTACATCGTCATCCAATTTGCCGTCTTCGTCGACGGTCGTGGTGGATAACCTTAGGGCTTATATGCAGATTATCGACGTCGCTTCTCTTTTGGGGTT
GGATTACTATATTTCCGAGCACGCATTTAAGTTGTTTAGGGATTGTTGCTCGGCTACTTGTTTAAGGAACCGGAGCGTTGAGGCGCTTGCGACTGCTGCCCTTGTGCAGG
CCATTAGGGAGGCGCAGGAGCCCCGAACCCTTCAGGAAATCTCCATTGCAGCCAATGTACCCCAGAAAGAGATTGGAAAGTACATAAAGATTTTGGGAGAAGCTTTGCAA
TTAAGTCAACCCATTAATAGCAATTCCATATCAGTCCATATGCCAAGGTTCTGCACGCTTCTCCAACTCAATAAATCTGCCCAGGAACTGGCAACTCATATTGGGGAAGT
TGTCATCAACAAATGCTTCTGCACTCGTAGGAATCCCATTAGCATCTCTGCTGCCGCTATATATTTGGCCTGCCAATTAGAAGACAAGCGGAAAACACAAGCAGAAATTT
GTAAGGTTACAGGTCTCACTGAAGTCACCCTCCGGAAAGTCTACAAAGAGCTACTAGAAAATTGGGATGATTTGCTTCCATCTAATTATACTCCTGCTGTTCCTCCAGAA
AGAGCATTTCCTACAACCGTAATTGCTTCAGGCCGTTCTTCAGCTCCTAAAGTTGATGCATTTGAAGGGGCTTCTTTAGAAAAGGACAAGCAGATAGAGACTAAACCTAA
TATTATACCCACCGAGATCTCAGACATGGGTCATCCATCCAGAGTCAAAGAAGATAGTGAGAGTAAATTTGTATCTCGCGGGATGCATAACACTGTAATCAACAAGTCAT
CAACTTTTTGTCAACCACAACCTCCTAAAGGGACTTCTGTAGCAAATGCAGGAAAGAAGAGTCAAAGTGATACTCAGGGAATGGATATTGTTAGGGAGCACTCCAACAGT
CAACAGTGGAAAGTCTTTACAGAACCACCCCTTTTTTCTCTCTCTCTCTCACACACACTGACACACATAGTGGAGGAGGTTGGGAAAGATGCTGCTGCAGCTGTTGTCAC
CCTCTTTAGTACGATCATTACCGCCAATGGTTTGAATTTCGCATGTAAACCGGCGAATTCGCGGACGGCAGTGAAGGTCCAGGCCATGGCCAAGGAAGGAAGCGAGAGCG
AAGGGGGCATTGCCGAGACGGCGGCTATAGCCGGCGGGCTAGTCGCGACCCCAGTGATTGGTTGGTCGCTGCGGGTCGCTGGGCGCATTGGAGGGCGTCAGCTACTTGGC
AGTGGTGGGCATCGTCGCATGGTCACTGTACACGAAGACCAAAACTGGGTCGGGTCTGCCCAACGGCCCTTTCGGGTTTTGGGCGCCGTTGAAGGTCTGTCCTACTTGAC
TCTGCTGGCCATCTTCGTGGTTTTCGGATTGCAATACTTTGAGCAGGGCTACATCCCCGGCCCTCTCCCGGCCGATCAGTGCTTCGAGCGCGCTACCATGGGAGATTACA
ATGATGCTTTCATGCGGAACCAAAACGCGGCGGTTCAGGCTCGTACCAAGGCCCAGAATCGTGCCAATGTCCTTCAACTCAAACTGGTTGGTGGAATGGTTTCGATTATT
GTTATCATTTCTGTAGAAGCCTTCGATTTTGTTTGTTTGAAGTCAGCTAGGATTGGGCAGAGTCATCCCACTGGTCTTACGGCCAATCTTCTGAAGCTCTTTGAGCCCCG
ACCGCCTTTGGACTATAAACCTCCCCCGGAGAAAAGAAAATGTCCGCCATTAACAGGAATGGCCCAATTTGTGAGTAATTTTGCAGAACCTGGTGATCCTGAATATGCTC
CACCTGTTCAAAAGGGTGAAACTCCTGCACAAAGGAGGGCTAGAATTCATTTGCTAAGAATTGAAAAGGGTGCAGAGAAAGCTGCTGAGGAGCTGCCAAAATATCTTGAA
CGTGATGATCCAAATGTTACTGGAGATCCATACAAAACACTCTTTGTGGCTAGACTGAATTATGAGACATCTGAGAGCAGAATCAAAAGAGAGTTTGAGTCTTATGGGCC
AATCAAGCGGGTCCGATTGATTACAGACAAAGCGACAGGTAAACCTAAAGGCTATGCTTTCATTGAGTACATGCACAAAAGAGATATGAAAGCTGCATATAAGCAAGCTG
ATGGTAGGAAGATTGATGGTAGAAGGGTACTTGTTGATGTCGAGCGAGGAAGAACAGTACCGAATTGGCGTCCTCGCCGGCTGGGTGGTGGTCTTGGAACTACCAGAGTG
GGAGGGGAAGAAAATAACCAGAGGTACCCTGGAAGGGAGCAACAGCAGCCTGGAGCACCATCTCGATCTGAGGAACCTAGGGCACGTGAAGATCGCCATGGAGAGCGAGA
TAGGGAAAAGTCTTATGAGAAGGGAAGAGAAAGAGAGAGAGAGAGAGAGAAGTCTCGTGAACGTTCTCGTGACAGATCTAAGGATCGTGACCTCAAAGAAGACAGGCATC
ACAGAGATCGTGATAGGACTAGGGATAGAGATAGAGATAGAGATAGAGATAGAGAAGAGAGAGAGACCGTGGCTATGACCGTGATAAAACACGTGATCGTGATCGAGGAA
GGGATCGAGGTCGTGATTATGAGCGTGATCGAGATCGCGATCGTCATCGGGACAGAGATAGGGAGAGGGATCGGGATCATGAAGGGGGATATCCAGACCCTGATCGTGGC
CATTCCCATGATAAGGAATCTGATTATGATCGAGTTGAATCAAAATATGAAAAGGATAGACATGTCTGAGCATGGGCACAGACACCCAGATCCTGACCACGATCCCCAAT
ACTACGATCACTTTGAACATAACCGAAGTCGAGGGCAATATGATCACTCAGAAGGCCATCGTGACCAAGACCGTTATGATCAGTATGATAGAATGGAGGATGATTATCAT
TATGAGCGTGGCACTTCTGAATCACATGATAGAGAGAGGACTCGTGATATGAATCATGAACATAGGCGCACTGAGAGATCTCATTCGAGAGAGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTGCCCCTACTGTTCGGCTGCGCAGGGGCGGTGCGCCACCTCAAGCTCCGGGAAGTCCATCACAGAGTGCATTTCCTGCGGCCGTGTGGTCGAAGAGCGCCAGTT
CCAGCCCCACCACCTCTTCCACCTTCGCGCTCAAGACAACCCCCTTTGCCTCGTCACCTCCGACCTACCAACCCCGCCGGTCCACCACCAGCACGATCAAGTTCTGGATC
CTTTCGAGCCCACTGGCTTCATTACAGCCTTCTCCACATGGTCCTTAGAGCACAATCCACTCTTCTTCCGCTCTTGCTTCTCCTTCTCCGGCCACCTTGCGGAGCTTGAA
CGCACCCTTGAGTCTACATCGTCATCCAATTTGCCGTCTTCGTCGACGGTCGTGGTGGATAACCTTAGGGCTTATATGCAGATTATCGACGTCGCTTCTCTTTTGGGGTT
GGATTACTATATTTCCGAGCACGCATTTAAGTTGTTTAGGGATTGTTGCTCGGCTACTTGTTTAAGGAACCGGAGCGTTGAGGCGCTTGCGACTGCTGCCCTTGTGCAGG
CCATTAGGGAGGCGCAGGAGCCCCGAACCCTTCAGGAAATCTCCATTGCAGCCAATGTACCCCAGAAAGAGATTGGAAAGTACATAAAGATTTTGGGAGAAGCTTTGCAA
TTAAGTCAACCCATTAATAGCAATTCCATATCAGTCCATATGCCAAGGTTCTGCACGCTTCTCCAACTCAATAAATCTGCCCAGGAACTGGCAACTCATATTGGGGAAGT
TGTCATCAACAAATGCTTCTGCACTCGTAGGAATCCCATTAGCATCTCTGCTGCCGCTATATATTTGGCCTGCCAATTAGAAGACAAGCGGAAAACACAAGCAGAAATTT
GTAAGGTTACAGGTCTCACTGAAGTCACCCTCCGGAAAGTCTACAAAGAGCTACTAGAAAATTGGGATGATTTGCTTCCATCTAATTATACTCCTGCTGTTCCTCCAGAA
AGAGCATTTCCTACAACCGTAATTGCTTCAGGCCGTTCTTCAGCTCCTAAAGTTGATGCATTTGAAGGGGCTTCTTTAGAAAAGGACAAGCAGATAGAGACTAAACCTAA
TATTATACCCACCGAGATCTCAGACATGGGTCATCCATCCAGAGTCAAAGAAGATAGTGAGAGTAAATTTGTATCTCGCGGGATGCATAACACTGTAATCAACAAGTCAT
CAACTTTTTGTCAACCACAACCTCCTAAAGGGACTTCTGTAGCAAATGCAGGAAAGAAGAGTCAAAGTGATACTCAGGGAATGGATATTGTTAGGGAGCACTCCAACAGT
CAACAGTGGAAAGTCTTTACAGAACCACCCCTTTTTTCTCTCTCTCTCTCACACACACTGACACACATAGTGGAGGAGGTTGGGAAAGATGCTGCTGCAGCTGTTGTCAC
CCTCTTTAGTACGATCATTACCGCCAATGGTTTGAATTTCGCATGTAAACCGGCGAATTCGCGGACGGCAGTGAAGGTCCAGGCCATGGCCAAGGAAGGAAGCGAGAGCG
AAGGGGGCATTGCCGAGACGGCGGCTATAGCCGGCGGGCTAGTCGCGACCCCAGTGATTGGTTGGTCGCTGCGGGTCGCTGGGCGCATTGGAGGGCGTCAGCTACTTGGC
AGTGGTGGGCATCGTCGCATGGTCACTGTACACGAAGACCAAAACTGGGTCGGGTCTGCCCAACGGCCCTTTCGGGTTTTGGGCGCCGTTGAAGGTCTGTCCTACTTGAC
TCTGCTGGCCATCTTCGTGGTTTTCGGATTGCAATACTTTGAGCAGGGCTACATCCCCGGCCCTCTCCCGGCCGATCAGTGCTTCGAGCGCGCTACCATGGGAGATTACA
ATGATGCTTTCATGCGGAACCAAAACGCGGCGGTTCAGGCTCGTACCAAGGCCCAGAATCGTGCCAATGTCCTTCAACTCAAACTGGTTGGTGGAATGGTTTCGATTATT
GTTATCATTTCTGTAGAAGCCTTCGATTTTGTTTGTTTGAAGTCAGCTAGGATTGGGCAGAGTCATCCCACTGGTCTTACGGCCAATCTTCTGAAGCTCTTTGAGCCCCG
ACCGCCTTTGGACTATAAACCTCCCCCGGAGAAAAGAAAATGTCCGCCATTAACAGGAATGGCCCAATTTGTGAGTAATTTTGCAGAACCTGGTGATCCTGAATATGCTC
CACCTGTTCAAAAGGGTGAAACTCCTGCACAAAGGAGGGCTAGAATTCATTTGCTAAGAATTGAAAAGGGTGCAGAGAAAGCTGCTGAGGAGCTGCCAAAATATCTTGAA
CGTGATGATCCAAATGTTACTGGAGATCCATACAAAACACTCTTTGTGGCTAGACTGAATTATGAGACATCTGAGAGCAGAATCAAAAGAGAGTTTGAGTCTTATGGGCC
AATCAAGCGGGTCCGATTGATTACAGACAAAGCGACAGGTAAACCTAAAGGCTATGCTTTCATTGAGTACATGCACAAAAGAGATATGAAAGCTGCATATAAGCAAGCTG
ATGGTAGGAAGATTGATGGTAGAAGGGTACTTGTTGATGTCGAGCGAGGAAGAACAGTACCGAATTGGCGTCCTCGCCGGCTGGGTGGTGGTCTTGGAACTACCAGAGTG
GGAGGGGAAGAAAATAACCAGAGGTACCCTGGAAGGGAGCAACAGCAGCCTGGAGCACCATCTCGATCTGAGGAACCTAGGGCACGTGAAGATCGCCATGGAGAGCGAGA
TAGGGAAAAGTCTTATGAGAAGGGAAGAGAAAGAGAGAGAGAGAGAGAGAAGTCTCGTGAACGTTCTCGTGACAGATCTAAGGATCGTGACCTCAAAGAAGACAGGCATC
ACAGAGATCGTGATAGGACTAGGGATAGAGATAGAGATAGAGATAGAGATAGAGAAGAGAGAGAGACCGTGGCTATGACCGTGATAAAACACGTGATCGTGATCGAGGAA
GGGATCGAGGTCGTGATTATGAGCGTGATCGAGATCGCGATCGTCATCGGGACAGAGATAGGGAGAGGGATCGGGATCATGAAGGGGGATATCCAGACCCTGATCGTGGC
CATTCCCATGATAAGGAATCTGATTATGATCGAGTTGAATCAAAATATGAAAAGGATAGACATGTCTGAGCATGGGCACAGACACCCAGATCCTGACCACGATCCCCAAT
ACTACGATCACTTTGAACATAACCGAAGTCGAGGGCAATATGATCACTCAGAAGGCCATCGTGACCAAGACCGTTATGATCAGTATGATAGAATGGAGGATGATTATCAT
TATGAGCGTGGCACTTCTGAATCACATGATAGAGAGAGGACTCGTGATATGAATCATGAACATAGGCGCACTGAGAGATCTCATTCGAGAGAGTATTAG
Protein sequenceShow/hide protein sequence
MKCPYCSAAQGRCATSSSGKSITECISCGRVVEERQFQPHHLFHLRAQDNPLCLVTSDLPTPPVHHQHDQVLDPFEPTGFITAFSTWSLEHNPLFFRSCFSFSGHLAELE
RTLESTSSSNLPSSSTVVVDNLRAYMQIIDVASLLGLDYYISEHAFKLFRDCCSATCLRNRSVEALATAALVQAIREAQEPRTLQEISIAANVPQKEIGKYIKILGEALQ
LSQPINSNSISVHMPRFCTLLQLNKSAQELATHIGEVVINKCFCTRRNPISISAAAIYLACQLEDKRKTQAEICKVTGLTEVTLRKVYKELLENWDDLLPSNYTPAVPPE
RAFPTTVIASGRSSAPKVDAFEGASLEKDKQIETKPNIIPTEISDMGHPSRVKEDSESKFVSRGMHNTVINKSSTFCQPQPPKGTSVANAGKKSQSDTQGMDIVREHSNS
QQWKVFTEPPLFSLSLSHTLTHIVEEVGKDAAAAVVTLFSTIITANGLNFACKPANSRTAVKVQAMAKEGSESEGGIAETAAIAGGLVATPVIGWSLRVAGRIGGRQLLG
SGGHRRMVTVHEDQNWVGSAQRPFRVLGAVEGLSYLTLLAIFVVFGLQYFEQGYIPGPLPADQCFERATMGDYNDAFMRNQNAAVQARTKAQNRANVLQLKLVGGMVSII
VIISVEAFDFVCLKSARIGQSHPTGLTANLLKLFEPRPPLDYKPPPEKRKCPPLTGMAQFVSNFAEPGDPEYAPPVQKGETPAQRRARIHLLRIEKGAEKAAEELPKYLE
RDDPNVTGDPYKTLFVARLNYETSESRIKREFESYGPIKRVRLITDKATGKPKGYAFIEYMHKRDMKAAYKQADGRKIDGRRVLVDVERGRTVPNWRPRRLGGGLGTTRV
GGEENNQRYPGREQQQPGAPSRSEEPRAREDRHGERDREKSYEKGREREREREKSRERSRDRSKDRDLKEDRHHRDRDRTRDRDRDRDRDREERETVAMTVIKHVIVIEE
GIEVVIMSVIEIAIVIGTEIGRGIGIMKGDIQTLIVAIPMIRNLIMIELNQNMKRIDMSEHGHRHPDPDHDPQYYDHFEHNRSRGQYDHSEGHRDQDRYDQYDRMEDDYH
YERGTSESHDRERTRDMNHEHRRTERSHSREY