; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001667 (gene) of Chayote v1 genome

Gene IDSed0001667
OrganismSechium edule (Chayote v1)
DescriptionTHO complex subunit 4D-like
Genome locationLG06:38306125..38315380
RNA-Seq ExpressionSed0001667
SyntenySed0001667
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457020.1 PREDICTED: THO complex subunit 4D [Cucumis melo]4.0e-12484.98Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        M TPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGSF+ GRGV IGSVRRGPL INAR SAYSI KPP RMKNVQ +HDLFEDSLRASGISGI++GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNAEMPVSARINVTG NGR+RRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGR-TRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        LT ESGR   FN VNPFPGPS+RGGLRN+RGRGRG W+ G+G    GG   GR   GRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  LTSESGR-TRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

XP_022935328.1 THO complex subunit 4D-like [Cucurbita moschata]2.0e-12384.39Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGS++ GR V IGSVRRGPL INARPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNA+ PVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ
        LTSESGRT   +AVNPFPGPS+RGGLR+   RGRGRGGWS GLG    GG  +G G GGRGR      GRGRGQGRKKPVEKSS ELDKELENYHAEAMQ
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ

Query:  T
        T
Subjt:  T

XP_022982649.1 THO complex subunit 4D-like [Cucurbita maxima]1.2e-12583.39Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGS++ GR V IGSVRRGPL INARPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNA+ PVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGL-----------GSLSRGGRSQGRGRG---GRGRGRGRGQGRKKPVEKSSDELDKELENY
        LTSESGRT   NAVNPFPGPS+RGGLR+ RGRGRGGWS GL           G    GGR +GRGRG   G G GRGRGQGRKKPVEKSS ELDKELENY
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGL-----------GSLSRGGRSQGRGRG---GRGRGRGRGQGRKKPVEKSSDELDKELENY

Query:  HAEAMQT
        HAEAMQT
Subjt:  HAEAMQT

XP_023527122.1 THO complex subunit 4D [Cucurbita pepo subsp. pepo]5.2e-12484.72Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGS++ GR V IGSVRRGPL INARPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNA+ PVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ
        LTSESGRT   NAVNPFPGPS+RGGLR+   RGRGRGGWS GLG    GG  +G G GGRGR      GRGRGQGRKKPVEKSS ELDKELENYHAEAMQ
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ

Query:  T
        T
Subjt:  T

XP_038896761.1 THO complex subunit 4D-like [Benincasa hispida]4.7e-12586.01Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKK+NREKLR RGRA   RGRGAGGSF+ GRGV +GSVRRGPL INAR SAYSI KPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGV+KEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        LT ESGRT   N VNPFPGPS+RGGLRN RGRGRGGW+ G G    GG   GR   GRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

TrEMBL top hitse value%identityAlignment
A0A1S3C452 THO complex subunit 4D1.9e-12484.98Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        M TPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGSF+ GRGV IGSVRRGPL INAR SAYSI KPP RMKNVQ +HDLFEDSLRASGISGI++GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNAEMPVSARINVTG NGR+RRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGR-TRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        LT ESGR   FN VNPFPGPS+RGGLRN+RGRGRG W+ G+G    GG   GR   GRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  LTSESGR-TRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A2C9WC96 RRM domain-containing protein1.8e-9064.04Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL
        MAT +DMSL+D+IKK NRE+  RGR    RGRG GG F+ GR V  G+VR+GPLS+N+RP+ Y+I+KPPRR++++  +HDL EDS+RA+GI+G+E+GTKL
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL

Query:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVL
        YVSNLDYGV+ EDIRELFSEIGDLKR+A+HYD +GRPSG+AEV+YTRRSDAFAALK+YNNVLLDG PMKIE++G +AEMP SAR+NVTGV+GR +RTVV+
Subjt:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVL

Query:  TSESGRTRFNA-VNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        T   GR R  A  N   G + RGGLRN RGRG+G                     G+GRGRGRG+G+K+P  KS+D+LDKELENYHAEAMQT
Subjt:  TSESGRTRFNA-VNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A6J1CP51 THO complex subunit 4D-like9.2e-11983.62Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLED+IKKNNREKLR RGRA   RGRGAGGSF+ GR V IGS+RRGPLSIN RPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIE+LGDNAEMPVSARINVTG+NGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        LTSESGRT     VN FPGPS RG LR  RGRGRGGWS G G +  GGR       GRGRGRGRG GRKK VEKSSDELDK+LENYHAEAMQT
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A6J1FAB9 THO complex subunit 4D-like9.5e-12484.39Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGS++ GR V IGSVRRGPL INARPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNA+ PVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ
        LTSESGRT   +AVNPFPGPS+RGGLR+   RGRGRGGWS GLG    GG  +G G GGRGR      GRGRGQGRKKPVEKSS ELDKELENYHAEAMQ
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRN--SRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGR------GRGRGQGRKKPVEKSSDELDKELENYHAEAMQ

Query:  T
        T
Subjt:  T

A0A6J1J564 THO complex subunit 4D-like6.0e-12683.39Show/hide
Query:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        MATPLDMSLEDVIKKNNREKLR RGRA   RGRGAGGS++ GR V IGSVRRGPL INARPSA+SISKPPRRMKNVQ +HDLFEDSLRASGISGIE+GTK
Subjt:  MATPLDMSLEDVIKKNNREKLR-RGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYD +GRPSG+AEV+YTRRSDAFAALKRYNNVLLDG PMKIEMLGDNA+ PVSARINVTGVNGRSRRTVV
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGL-----------GSLSRGGRSQGRGRG---GRGRGRGRGQGRKKPVEKSSDELDKELENY
        LTSESGRT   NAVNPFPGPS+RGGLR+ RGRGRGGWS GL           G    GGR +GRGRG   G G GRGRGQGRKKPVEKSS ELDKELENY
Subjt:  LTSESGRT-RFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGL-----------GSLSRGGRSQGRGRG---GRGRGRGRGQGRKKPVEKSSDELDKELENY

Query:  HAEAMQT
        HAEAMQT
Subjt:  HAEAMQT

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 41.4e-3138.57Show/hide
Query:  MATPLDMSLEDVIKKNNREK--LRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPL---SINARPSAYSISKPPRRMKNV--QLRHDLFEDSLRASGISG
        MA  +DMSL+D+IK N  ++   R GR  RGRG  A G      GVG G    GP+    + AR    +   P  R K +  + +HDLF+    A   +G
Subjt:  MATPLDMSLEDVIKKNNREK--LRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPL---SINARPSAYSISKPPRRMKNV--QLRHDLFEDSLRASGISG

Query:  IEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGR
        +E G KL VSNLD+GV+  DI+ELF+E G LK+ A+HYD SGR  GTA+V + R++DA  A+K+YN V LDG PM I++        V+++I        
Subjt:  IEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGR

Query:  SRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHA
                 ++ R    +VN       RGG+  +RG   GG+         GG    RG  G  RGRGRG GR    + S++ELD +L+ Y+A
Subjt:  SRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHA

Q6NQ72 THO complex subunit 4D5.7e-7354.64Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL
        M+  L+M+L++++K+    +      +RGRGRG GG      G G G  RRGPL++NARPS+++I+KP RR++++  +  LFED LRA+G SG+EVGT+L
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL

Query:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR
        +V+NLD GVT EDIRELFSEIG+++R+AIHYD +GRPSGTAEV+Y RRSDAF ALK+YNNVLLDG PM++E+LG N  +E P+S R  +NVTG+NGR +R
Subjt:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR

Query:  TVVLTSESG-----RTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAM
        TVV+    G     R       P P  S R  + N +G G RGG         RGG R++GRG GGRGRG GRG G KKPVEKS+ +LDK+LE+YHA+AM
Subjt:  TVVLTSESG-----RTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAM

Query:  QT
         T
Subjt:  QT

Q8L719 THO complex subunit 4B2.5e-4443.46Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLF--EDSLRAS---------
        M+  LDMSL+D+IK N +    RGR   G G   GG    G G   G  RR    + AR + YS     ++  +   ++D+F  + S+ A+         
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLF--EDSLRAS---------

Query:  -GISGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVT
         G S IE GTKLY+SNLDYGV+ EDI+ELFSE+GDLKR+ IHYD SGR  GTAEV+++RR DA AA+KRYNNV LDG  MKIE++G N   P    +   
Subjt:  -GISGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVT

Query:  GVNGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRG-RGR-GRGQ-GRKKPVEKSSDELDKELENYH
         +   +   +   +E+    FN        ++ G   N RGRGRGG+   +G    GG   G  RGGRG RGR GRG  GR +    S+++LD EL+ YH
Subjt:  GVNGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRG-RGR-GRGQ-GRKKPVEKSSDELDKELENYH

Query:  AEAMQT
         EAM+T
Subjt:  AEAMQT

Q8L773 THO complex subunit 4A2.3e-4242.66Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRR-GPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK
        M+T LDMSL+D+I KN        R +RG   GAG +   G G G G  RR  P   + R + Y  +K P       +  D  ED       +GIE GTK
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRR-GPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTK

Query:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV
        LY+SNLDYGV  EDI+ELF+E+G+LKR+ +H+D SGR  GTAEV+Y+RR DA AA+K+YN+V LDG PMKIE++G N  +  +A  +    NG S     
Subjt:  LYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVV

Query:  LTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEK-SSDELDKELENYHAEAMQT
                         G  +RG      G+GRGG         RGG   G GRGG GRGR  G+G   P EK S+++LD +L+ YH+  M+T
Subjt:  LTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEK-SSDELDKELENYHAEAMQT

Q94EH8 THO complex subunit 4C3.3e-6551.94Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI
        M+  L+M+L++++KK+  E+    R+     +R  GRG GG +   G G G G VRRGPL++N RP S++SI+K  RR +++  Q ++DL+E++LRA G+
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI

Query:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV
        SG+EVGT +Y++NLD GVT EDIREL++EIG+LKR+AIHYD +GRPSG+AEV+Y RRSDA  A+++YNNVLLDG PMK+E+LG N E  PV+AR+NVTG+
Subjt:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV

Query:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK
        NGR +R+V                F G   RGG R  RGRG G  G    L    +GG + GRG G RGRGRG G GR         KKPVEKS+ +LDK
Subjt:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK

Query:  ELENYHAEAM
        +LE+YHAEAM
Subjt:  ELENYHAEAM

Arabidopsis top hitse value%identityAlignment
AT1G66260.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.4e-6651.94Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI
        M+  L+M+L++++KK+  E+    R+     +R  GRG GG +   G G G G VRRGPL++N RP S++SI+K  RR +++  Q ++DL+E++LRA G+
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI

Query:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV
        SG+EVGT +Y++NLD GVT EDIREL++EIG+LKR+AIHYD +GRPSG+AEV+Y RRSDA  A+++YNNVLLDG PMK+E+LG N E  PV+AR+NVTG+
Subjt:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV

Query:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK
        NGR +R+V                F G   RGG R  RGRG G  G    L    +GG + GRG G RGRGRG G GR         KKPVEKS+ +LDK
Subjt:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK

Query:  ELENYHAEAM
        +LE+YHAEAM
Subjt:  ELENYHAEAM

AT1G66260.2 RNA-binding (RRM/RBD/RNP motifs) family protein2.4e-6651.94Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI
        M+  L+M+L++++KK+  E+    R+     +R  GRG GG +   G G G G VRRGPL++N RP S++SI+K  RR +++  Q ++DL+E++LRA G+
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRA-----NRGRGRGAGG-SFSDGRGVGIGSVRRGPLSINARP-SAYSISKPPRRMKNV--QLRHDLFEDSLRASGI

Query:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV
        SG+EVGT +Y++NLD GVT EDIREL++EIG+LKR+AIHYD +GRPSG+AEV+Y RRSDA  A+++YNNVLLDG PMK+E+LG N E  PV+AR+NVTG+
Subjt:  SGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAE-MPVSARINVTGV

Query:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK
        NGR +R+V                F G   RGG R  RGRG G  G    L    +GG + GRG G RGRGRG G GR         KKPVEKS+ +LDK
Subjt:  NGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRG--GWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGR---------KKPVEKSSDELDK

Query:  ELENYHAEAM
        +LE+YHAEAM
Subjt:  ELENYHAEAM

AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.8e-4543.46Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLF--EDSLRAS---------
        M+  LDMSL+D+IK N +    RGR   G G   GG    G G   G  RR    + AR + YS     ++  +   ++D+F  + S+ A+         
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLF--EDSLRAS---------

Query:  -GISGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVT
         G S IE GTKLY+SNLDYGV+ EDI+ELFSE+GDLKR+ IHYD SGR  GTAEV+++RR DA AA+KRYNNV LDG  MKIE++G N   P    +   
Subjt:  -GISGIEVGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVT

Query:  GVNGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRG-RGR-GRGQ-GRKKPVEKSSDELDKELENYH
         +   +   +   +E+    FN        ++ G   N RGRGRGG+   +G    GG   G  RGGRG RGR GRG  GR +    S+++LD EL+ YH
Subjt:  GVNGRSRRTVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRG-RGR-GRGQ-GRKKPVEKSSDELDKELENYH

Query:  AEAMQT
         EAM+T
Subjt:  AEAMQT

AT5G37720.1 ALWAYS EARLY 44.0e-7454.64Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL
        M+  L+M+L++++K+    +      +RGRGRG GG      G G G  RRGPL++NARPS+++I+KP RR++++  +  LFED LRA+G SG+EVGT+L
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL

Query:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR
        +V+NLD GVT EDIRELFSEIG+++R+AIHYD +GRPSGTAEV+Y RRSDAF ALK+YNNVLLDG PM++E+LG N  +E P+S R  +NVTG+NGR +R
Subjt:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR

Query:  TVVLTSESG-----RTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAM
        TVV+    G     R       P P  S R  + N +G G RGG         RGG R++GRG GGRGRG GRG G KKPVEKS+ +LDK+LE+YHA+AM
Subjt:  TVVLTSESG-----RTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAM

Query:  QT
         T
Subjt:  QT

AT5G37720.2 ALWAYS EARLY 43.6e-7555.56Show/hide
Query:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL
        M+  L+M+L++++K+    +      +RGRGRG GG      G G G  RRGPL++NARPS+++I+KP RR++++  +  LFED LRA+G SG+EVGT+L
Subjt:  MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKL

Query:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR
        +V+NLD GVT EDIRELFSEIG+++R+AIHYD +GRPSGTAEV+Y RRSDAF ALK+YNNVLLDG PM++E+LG N  +E P+S R  +NVTG+NGR +R
Subjt:  YVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRR

Query:  TVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        TVV+    GR       P P  S R  + N +G G RGG         RGG R++GRG GGRGRG GRG G KKPVEKS+ +LDK+LE+YHA+AM T
Subjt:  TVVLTSESGRTRFNAVNPFPGPSYRGGLRNSRGRG-RGGWSHGLGSLSRGG-RSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCCTTTGGATATGTCATTAGAGGATGTGATAAAGAAGAATAATCGTGAGAAGCTAAGACGAGGTAGGGCCAATCGTGGTCGGGGGCGTGGAGCAGGTGGGTC
TTTTAGTGATGGAAGAGGAGTAGGGATAGGATCAGTCCGTAGAGGTCCTCTCAGCATAAATGCACGGCCATCTGCTTACTCAATTAGCAAGCCTCCACGCAGAATGAAGA
ATGTTCAATTGCGGCATGATTTATTTGAAGATAGTCTTCGAGCTTCTGGGATTTCTGGAATTGAAGTTGGCACAAAGTTGTACGTTTCCAACTTGGATTATGGAGTGACC
AAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTGAAAAGATTTGCAATTCATTATGACAATAGTGGCCGTCCAAGTGGCACGGCAGAGGTGATATATACACG
TAGAAGTGATGCATTTGCTGCTCTAAAGCGCTATAACAATGTGCTACTGGATGGGAATCCAATGAAGATTGAAATGCTTGGAGATAATGCTGAAATGCCAGTTTCTGCAC
GTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACTGTTGTTCTCACGTCTGAATCTGGACGTACTCGCTTCAATGCGGTCAACCCTTTTCCTGGTCCAAGCTAT
CGTGGAGGGCTGAGGAACAGCCGTGGCCGCGGGCGAGGAGGCTGGAGCCATGGTTTAGGAAGTCTAAGTCGTGGAGGCCGTAGCCAAGGGCGAGGACGAGGTGGCCGTGG
TCGTGGTCGTGGTCGTGGCCAGGGAAGAAAGAAGCCTGTGGAGAAGTCCTCGGATGAACTTGACAAGGAGCTTGAAAACTATCATGCAGAAGCTATGCAAACCTGA
mRNA sequenceShow/hide mRNA sequence
TTTTACAAAAACAAAACGCAATTGTAAAACCAAGGAGAAAGAAGAGATCAAAATTGTGGATGAAAAAAAACATCGTCGCTTAGTTCCGAAGAAAAGTCTCTCTGTCAGGC
CGCCGCCGCTGTAGCCGCCGCCTCTCGCTGGAAGAATCTCTCTGAAAATCCCACTCGCCCATATCGCTTTTCATCTTTATCGTCTTTGCTATTATCTCCTTCACCTTGTT
CAAATCTTCAGGGCTTATTTTCTATAATTCTTTGGAATCTGTTTTCTCTCATCAAACGATTGAAGCCAATGGCCACTCCTTTGGATATGTCATTAGAGGATGTGATAAAG
AAGAATAATCGTGAGAAGCTAAGACGAGGTAGGGCCAATCGTGGTCGGGGGCGTGGAGCAGGTGGGTCTTTTAGTGATGGAAGAGGAGTAGGGATAGGATCAGTCCGTAG
AGGTCCTCTCAGCATAAATGCACGGCCATCTGCTTACTCAATTAGCAAGCCTCCACGCAGAATGAAGAATGTTCAATTGCGGCATGATTTATTTGAAGATAGTCTTCGAG
CTTCTGGGATTTCTGGAATTGAAGTTGGCACAAAGTTGTACGTTTCCAACTTGGATTATGGAGTGACCAAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTG
AAAAGATTTGCAATTCATTATGACAATAGTGGCCGTCCAAGTGGCACGGCAGAGGTGATATATACACGTAGAAGTGATGCATTTGCTGCTCTAAAGCGCTATAACAATGT
GCTACTGGATGGGAATCCAATGAAGATTGAAATGCTTGGAGATAATGCTGAAATGCCAGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACTG
TTGTTCTCACGTCTGAATCTGGACGTACTCGCTTCAATGCGGTCAACCCTTTTCCTGGTCCAAGCTATCGTGGAGGGCTGAGGAACAGCCGTGGCCGCGGGCGAGGAGGC
TGGAGCCATGGTTTAGGAAGTCTAAGTCGTGGAGGCCGTAGCCAAGGGCGAGGACGAGGTGGCCGTGGTCGTGGTCGTGGTCGTGGCCAGGGAAGAAAGAAGCCTGTGGA
GAAGTCCTCGGATGAACTTGACAAGGAGCTTGAAAACTATCATGCAGAAGCTATGCAAACCTGATTAGGAACCCTGGAATATGCAAAAACCAATTGTATGGTTTTGATTA
AGCATTTCATTCTGGGGCTGATTATTTTGTTAACGTTAGTAGTTTCCATGTTCTTTTTCCTCTTCAGTAGATAAGATATAGATTGTATTTACCTTTCAATGGAGATTTTA
TGCATCTTTTTTTGGTCAAACCCACTCTAGGTGATCTATCAATCTCTCAAGGTAATTTTCTTTATGTATCAGGATTTCAAAGTTTAGTTGATTAATTATACAATTTGAGA
ATGGGAGTATTTTGTATTTATGGGACCTATTCTTCTAAACCTTGTGTCGC
Protein sequenceShow/hide protein sequence
MATPLDMSLEDVIKKNNREKLRRGRANRGRGRGAGGSFSDGRGVGIGSVRRGPLSINARPSAYSISKPPRRMKNVQLRHDLFEDSLRASGISGIEVGTKLYVSNLDYGVT
KEDIRELFSEIGDLKRFAIHYDNSGRPSGTAEVIYTRRSDAFAALKRYNNVLLDGNPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTSESGRTRFNAVNPFPGPSY
RGGLRNSRGRGRGGWSHGLGSLSRGGRSQGRGRGGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT