; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G04360 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G04360
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTHO complex subunit 3
Genome locationChr3:3558117..3561373
RNA-Seq ExpressionCSPI03G04360
SyntenyCSPI03G04360
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0010267 - production of ta-siRNAs involved in RNA interference (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005509 - calcium ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR020472 - G-protein beta WD-40 repeat
IPR024977 - Anaphase-promoting complex subunit 4, WD40 domain
IPR036322 - WD40-repeat-containing domain superfamily
IPR040132 - TREX component Tex1/THOC3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065764.1 THO complex subunit 3 [Cucumis melo var. makuwa]4.2e-180100Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEG
        ADEG
Subjt:  ADEG

XP_004149572.1 THO complex subunit 3 [Cucumis sativus]1.4e-186100Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFESP
        ADEGIFRIFGFESP
Subjt:  ADEGIFRIFGFESP

XP_022155009.1 THO complex subunit 3 isoform X2 [Momordica charantia]1.2e-17996.17Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+A  FKNL SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDAR+GK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+HKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRP+ETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVG YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFN+TG+YIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFES
        ADEGIFRIFGFES
Subjt:  ADEGIFRIFGFES

XP_022946738.1 THO complex subunit 3 [Cucurbita moschata]9.1e-18398.4Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES  AFKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFES
        ADEGIFRIFGFES
Subjt:  ADEGIFRIFGFES

XP_038905145.1 THO complex subunit 3 [Benincasa hispida]6.3e-18498.72Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESA  FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKP+HKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFES
        ADEGIFRIFGFE+
Subjt:  ADEGIFRIFGFES

TrEMBL top hitse value%identityAlignment
A0A0A0L651 WD_REPEATS_REGION domain-containing protein6.6e-187100Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFESP
        ADEGIFRIFGFESP
Subjt:  ADEGIFRIFGFESP

A0A1S3BM38 THO complex subunit 36.6e-187100Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFESP
        ADEGIFRIFGFESP
Subjt:  ADEGIFRIFGFESP

A0A5D3BC11 THO complex subunit 32.0e-180100Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEG
        ADEG
Subjt:  ADEG

A0A6J1G4N9 THO complex subunit 34.4e-18398.4Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES  AFKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFES
        ADEGIFRIFGFES
Subjt:  ADEGIFRIFGFES

A0A6J1KZA7 THO complex subunit 34.4e-18398.4Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES  AFKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
        PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQ

Query:  ADEGIFRIFGFES
        ADEGIFRIFGFES
Subjt:  ADEGIFRIFGFES

SwissProt top hitse value%identityAlignment
P14197 WD repeat-containing protein AAC38.9e-6435.71Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHG----------------HGKVKD-VELKGHTDSVDQLCWDPKHSDLIA
        ++ +++ F    ++++ G+KKK  SVAWN  G K+AS   D   RVW+ +P G                +  +K+ +ELKGH  S++++ W PK++DL+A
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHG----------------HGKVKD-VELKGHTDSVDQLCWDPKHSDLIA

Query:  TASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFN-YEVNEIAWNMTGEMFFLTTGNGTVEVLAY----
        +A  DK +++WD + GKC      + ENI++ + PDG  I    RDD L ++D+   K +   KFN  E+N++ W+  G++  +    G +E   +    
Subjt:  TASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFN-YEVNEIAWNMTGEMFFLTTGNGTVEVLAY----

Query:  -PSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAM
           ++ ++TL  HTA  YC+  DP G Y A GSADS+VSLWDI  M+CV+TF K  +P R++SF+  G++IA++S +  I+I ++++ + +H I C + +
Subjt:  -PSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAM

Query:  NSVEWNPKHNLLAYAGDDKNKYQADEGIFRIFGFES
        +S+ W+P   LLAYA  + N+   D  I R+FG+ S
Subjt:  NSVEWNPKHNLLAYAGDDKNKYQADEGIFRIFGFES

Q29RH4 THO complex subunit 37.7e-9251.16Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD
        NI + PDG  IAVGN+DD +T +D +  +   + +F +EVNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G YFA GSAD
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD

Query:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF
        +LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++F
Subjt:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF

Query:  G
        G
Subjt:  G

Q8VE80 THO complex subunit 37.7e-9251.16Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD
        NI + PDG  IAVGN+DD +T +D +  +   + +F +EVNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G YFA GSAD
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD

Query:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF
        +LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++F
Subjt:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF

Query:  G
        G
Subjt:  G

Q96J01 THO complex subunit 31.0e-9151.16Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD
        NI + PDG  IAVGN+DD +T +D +  +   + +F +EVNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G YFA GSAD
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSAD

Query:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF
        +LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++F
Subjt:  SLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA--DEGIFRIF

Query:  G
        G
Subjt:  G

Q9FKT5 THO complex subunit 34.7e-16686.62Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+   FK+LHSREYQGHKKKVHSVAWN  G KLASGSVDQTAR+W+IEPHGH K KD+ELKGHTDSVDQLCWDPKHSDL+ATASGDK+VRLWDAR+GK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        C+QQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+H+RKFNYEVNEIAWNM G+ FFLTTG GTVEVL+YPSL+P++TL AHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KY
        P G YFAVGSADSLVSLWDIS MLC+RTFTKLEWPVRTISFN++GEYIASASEDLFIDI+NVQTGRTVHQIPCRAAMNSVEWNPK+NLLAYAGDDKN KY
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KY

Query:  QADEGIFRIFGFES
          DEG+FRIFGFES
Subjt:  QADEGIFRIFGFES

Arabidopsis top hitse value%identityAlignment
AT3G49660.1 Transducin/WD40 repeat-like superfamily protein1.9e-1624Show/hide
Query:  LHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDV-ELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSG
        +HS+    H + V SV ++  G  LAS S D+T R + I        + V E  GH + +  + +    +  I +AS DKT++LWD   G  S    L G
Subjt:  LHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDV-ELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSG

Query:  EN---INITYKPDGTHIAVGNRDDELTILDVR-----KFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLM-AHTAGCYCIAID
               + + P    I  G+ D+ + I DV      K  P H    +  V  + +N  G +   ++ +G   +    +   ++TL+         +   
Subjt:  EN---INITYKPDGTHIAVGNRDDELTILDVR-----KFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLM-AHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFT---KLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRA-AMNSVEWNPKHNLLAYAGDDK
        P G +  VG+ D+ + LW+IS    ++T+T     ++ + +      G+ I S SED  + +  + + + + ++      + +V  +P  NL+A    DK
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFT---KLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRA-AMNSVEWNPKHNLLAYAGDDK

AT5G08390.1 Transducin/WD40 repeat-like superfamily protein1.9e-1323.12Show/hide
Query:  VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASA
        ++ + ++ +  +      +GT+++      + + TL  H + C  +   P G +FA GS D+ + +WDI +  C+ T+      V  + F   G +I S 
Subjt:  VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASA

Query:  SEDLFIDISNVQTGRTVHQIPC-RAAMNSVEWNPKHNLLAYAGDDKNKYQADEGIFRIFG
         ED  + + ++  G+ +H+       + S++++P   LLA    DK     D   F + G
Subjt:  SEDLFIDISNVQTGRTVHQIPC-RAAMNSVEWNPKHNLLAYAGDDKNKYQADEGIFRIFG

AT5G25150.1 TBP-associated factor 52.7e-1524.9Show/hide
Query:  YQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVE-LKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGEN---
        Y+GH   V    ++  G   AS S D+TAR+W ++     +++ +  + GH   VD + W P + + IAT S DKTVRLWD + G+C +     G     
Subjt:  YQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVE-LKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGEN---

Query:  INITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSA
        +++   PDG ++A G+ D                                         GT+ +    + R I  LM H +  + ++    G   A GSA
Subjt:  INITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSA

Query:  DSLVSLWDI----------------SQMLCVRTFTKLEWPVRTISFNHTGEYIASAS
        D  V LWD+                +++  +RTF     PV  + F+      A+ +
Subjt:  DSLVSLWDI----------------SQMLCVRTFTKLEWPVRTISFNHTGEYIASAS

AT5G56130.1 Transducin/WD40 repeat-like superfamily protein3.4e-16786.62Show/hide
Query:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+   FK+LHSREYQGHKKKVHSVAWN  G KLASGSVDQTAR+W+IEPHGH K KD+ELKGHTDSVDQLCWDPKHSDL+ATASGDK+VRLWDAR+GK
Subjt:  MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID
        C+QQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+H+RKFNYEVNEIAWNM G+ FFLTTG GTVEVL+YPSL+P++TL AHTAGCYCIAID
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAID

Query:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KY
        P G YFAVGSADSLVSLWDIS MLC+RTFTKLEWPVRTISFN++GEYIASASEDLFIDI+NVQTGRTVHQIPCRAAMNSVEWNPK+NLLAYAGDDKN KY
Subjt:  PVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KY

Query:  QADEGIFRIFGFES
          DEG+FRIFGFES
Subjt:  QADEGIFRIFGFES

AT5G67320.1 WD-40 repeat family protein4.6e-2327.99Show/hide
Query:  HSREYQGHKKK-VHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAEL-SG
        H++     K K V ++ WN  G  LA+GS D  AR+W +    +G++    L  H   +  L W+ K  D + T S D+T  +WD +  +  QQ E  SG
Subjt:  HSREYQGHKKK-VHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAEL-SG

Query:  ENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHK-RKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG----
          +++ ++ +    A  + D  + +  + + +P         EVN + W+ TG +    + + T ++        +  L  HT   Y I   P G     
Subjt:  ENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHK-RKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG----

Query:  -----YFAVGSADSLVSLWD--ISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLA
               A  S DS V LWD  + +MLC  +F     PV +++F+  GEYIAS S D  I I +++ G+ V        +  V WN + N +A
Subjt:  -----YFAVGSADSLVSLWD--ISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAATCAGCCCAAGCTTTCAAGAATCTTCACAGCAGAGAGTATCAAGGTCACAAGAAGAAGGTACATTCTGTGGCATGGAATTGCACGGGCATGAAGCTTGCTTC
CGGTTCTGTTGATCAAACTGCTCGAGTTTGGCATATTGAGCCTCACGGACATGGTAAGGTTAAGGATGTTGAGTTGAAAGGGCACACTGATAGTGTAGATCAGCTTTGCT
GGGACCCTAAACATTCTGATCTTATAGCGACTGCATCTGGAGACAAGACTGTTCGACTATGGGATGCTCGCAATGGGAAATGCTCACAGCAAGCTGAGCTCAGTGGGGAA
AATATCAACATCACCTACAAACCTGATGGGACACACATCGCTGTTGGAAATAGGGATGATGAACTTACAATTCTGGATGTTAGGAAGTTTAAACCAGTTCACAAGCGCAA
GTTCAACTATGAGGTGAATGAAATTGCATGGAACATGACTGGGGAGATGTTTTTCCTGACAACTGGAAATGGTACTGTTGAAGTACTAGCATACCCATCGCTTCGACCAA
TCGAAACTCTTATGGCTCATACAGCGGGTTGTTACTGCATTGCGATTGACCCTGTTGGAGGGTATTTTGCAGTTGGAAGTGCTGATTCATTAGTTAGCCTATGGGATATA
TCGCAGATGCTTTGTGTGCGAACATTTACAAAACTTGAATGGCCTGTTAGAACAATAAGTTTCAACCACACAGGAGAATACATTGCTTCTGCCAGTGAGGATTTGTTCAT
TGATATATCAAATGTTCAAACGGGACGAACGGTTCATCAGATTCCTTGTCGGGCTGCAATGAACAGTGTGGAGTGGAATCCTAAACATAACTTACTTGCATATGCTGGGG
ATGACAAGAACAAGTATCAGGCTGATGAAGGTATTTTTAGGATCTTTGGGTTTGAAAGTCCATGA
mRNA sequenceShow/hide mRNA sequence
GAAAAGGATTTGATGTATTGAACAATTTTTTCACTCAAAAATCCATTTTAGTATTCGCATGGAAAGTCCCTCTAATCACAGGATTTGAAATCGTTCCATAGCAGCAGGAC
ATGGAGGAATCAGCCCAAGCTTTCAAGAATCTTCACAGCAGAGAGTATCAAGGTCACAAGAAGAAGGTACATTCTGTGGCATGGAATTGCACGGGCATGAAGCTTGCTTC
CGGTTCTGTTGATCAAACTGCTCGAGTTTGGCATATTGAGCCTCACGGACATGGTAAGGTTAAGGATGTTGAGTTGAAAGGGCACACTGATAGTGTAGATCAGCTTTGCT
GGGACCCTAAACATTCTGATCTTATAGCGACTGCATCTGGAGACAAGACTGTTCGACTATGGGATGCTCGCAATGGGAAATGCTCACAGCAAGCTGAGCTCAGTGGGGAA
AATATCAACATCACCTACAAACCTGATGGGACACACATCGCTGTTGGAAATAGGGATGATGAACTTACAATTCTGGATGTTAGGAAGTTTAAACCAGTTCACAAGCGCAA
GTTCAACTATGAGGTGAATGAAATTGCATGGAACATGACTGGGGAGATGTTTTTCCTGACAACTGGAAATGGTACTGTTGAAGTACTAGCATACCCATCGCTTCGACCAA
TCGAAACTCTTATGGCTCATACAGCGGGTTGTTACTGCATTGCGATTGACCCTGTTGGAGGGTATTTTGCAGTTGGAAGTGCTGATTCATTAGTTAGCCTATGGGATATA
TCGCAGATGCTTTGTGTGCGAACATTTACAAAACTTGAATGGCCTGTTAGAACAATAAGTTTCAACCACACAGGAGAATACATTGCTTCTGCCAGTGAGGATTTGTTCAT
TGATATATCAAATGTTCAAACGGGACGAACGGTTCATCAGATTCCTTGTCGGGCTGCAATGAACAGTGTGGAGTGGAATCCTAAACATAACTTACTTGCATATGCTGGGG
ATGACAAGAACAAGTATCAGGCTGATGAAGGTATTTTTAGGATCTTTGGGTTTGAAAGTCCATGAGAAATGGATCAACCATGGAGAAATTTCCAATTCTGTGATACTTTA
TTATGTTCCTTCATTTCTGGCAAAATGTATTGAATTTCAGTTAGTGGCTGTTACCATAATAACTTAGTCAAATTTTCCCTTGAAACAACAATCTGTGGTTGTATTACCTC
TTGAAAAGTTCACCATGATAAAATCTTAACTTGAAACAACCATTCCTACACACCTGATGCTTAGAAGTTCCATCTTTTTAACTCGAGTTTTTTAGTATTTGGGGAG
Protein sequenceShow/hide protein sequence
MEESAQAFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGE
NINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDI
SQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGIFRIFGFESP