; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G04600 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G04600
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTHO complex subunit 3
Genome locationClcChr10:5077896..5085986
RNA-Seq ExpressionClc10G04600
SyntenyClc10G04600
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0010267 - production of ta-siRNAs involved in RNA interference (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005509 - calcium ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR002048 - EF-hand domain
IPR011992 - EF-hand domain pair
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR018247 - EF-Hand 1, calcium-binding site
IPR019775 - WD40 repeat, conserved site
IPR020472 - G-protein beta WD-40 repeat
IPR036322 - WD40-repeat-containing domain superfamily
IPR040132 - TREX component Tex1/THOC3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065764.1 THO complex subunit 3 [Cucumis melo var. makuwa]1.8e-27785.87Show/hide
Query:  MGSVVGKLESP-ECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDA
        MGSVVGKLESP ECVPETKLEAKMVE MR+RATKGSIIRSFD I+LKFPKIDDSLR CKTIFQ+FDEDLNGIIDR+ELK+CF+GLEI LTEEEIDDLF+A
Subjt:  MGSVVGKLESP-ECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDA

Query:  CDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG
        CDIS+AMG+KFNEFIVLLCLVYLLKDDP+A  SKS FGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG
Subjt:  CDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG

Query:  MVNFKEFLFAFTRWVGIDENEDGQSPIQELVSAWNQNPSKSQDLKSIVPLQEDMEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW
        M                                                  +DMEES   FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW
Subjt:  MVNFKEFLFAFTRWVGIDENEDGQSPIQELVSAWNQNPSKSQDLKSIVPLQEDMEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW

Query:  HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK
        HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK
Subjt:  HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK

Query:  FNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI
        FNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI
Subjt:  FNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI

Query:  SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK
        SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK
Subjt:  SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK

XP_004149572.1 THO complex subunit 3 [Cucumis sativus]1.1e-17596.47Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES   FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

XP_022155009.1 THO complex subunit 3 isoform X2 [Momordica charantia]1.4e-17093.27Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+   FKNL SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDAR+GK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+HKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRP+ETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVG YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFN+TG+YIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

XP_022946738.1 THO complex subunit 3 [Cucurbita moschata]1.9e-17596.15Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESTP FKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

XP_038905145.1 THO complex subunit 3 [Benincasa hispida]4.6e-17796.79Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES PIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKP+HKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

TrEMBL top hitse value%identityAlignment
A0A0A0L651 WD_REPEATS_REGION domain-containing protein5.5e-17696.47Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES   FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

A0A1S3BM38 THO complex subunit 35.5e-17696.47Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEES   FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

A0A5D3BC11 THO complex subunit 38.6e-27885.87Show/hide
Query:  MGSVVGKLESP-ECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDA
        MGSVVGKLESP ECVPETKLEAKMVE MR+RATKGSIIRSFD I+LKFPKIDDSLR CKTIFQ+FDEDLNGIIDR+ELK+CF+GLEI LTEEEIDDLF+A
Subjt:  MGSVVGKLESP-ECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDA

Query:  CDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG
        CDIS+AMG+KFNEFIVLLCLVYLLKDDP+A  SKS FGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG
Subjt:  CDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNG

Query:  MVNFKEFLFAFTRWVGIDENEDGQSPIQELVSAWNQNPSKSQDLKSIVPLQEDMEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW
        M                                                  +DMEES   FKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW
Subjt:  MVNFKEFLFAFTRWVGIDENEDGQSPIQELVSAWNQNPSKSQDLKSIVPLQEDMEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVW

Query:  HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK
        HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK
Subjt:  HIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRK

Query:  FNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI
        FNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI
Subjt:  FNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTI

Query:  SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK
        SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK
Subjt:  SFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKNKYQADEGK

A0A6J1G4N9 THO complex subunit 39.4e-17696.15Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESTP FKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

A0A6J1KZA7 THO complex subunit 39.4e-17696.15Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEESTP FKNL+SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYE        VNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDIS+VQ+GRTVHQIPCRAAMNSVEWNPKHNLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKNKYQADEG
        GDDKNKYQADEG
Subjt:  GDDKNKYQADEG

SwissProt top hitse value%identityAlignment
Q29RH4 THO complex subunit 35.5e-8850.67Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG
        NI + PDG  IAVGN+DD +T +D +  +   + +F         F+VNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G 
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG

Query:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA
        YFA GSAD+LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +
Subjt:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA

Q52K82 Probable calcium-binding protein CML211.5e-8269.96Show/hide
Query:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD
        MG  V K E+   E VPETKLEAK++EA+++RA++G+ ++SF+SI+LKFPKIDD LR CK IFQ+FDED NG ID  ELK C   LEIS  EEEI+DLF 
Subjt:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD

Query:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKN
        ACDI+  MG+ F EFIVLLCLVYLLKDD +    K   GMPKLE TFE+LVD FVFLD+NKDGYVS+ EM+ AI+E  SGERSSGRIAM+RFEEMDWDKN
Subjt:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKN

Query:  GMVNFKEFLFAFTRWVGIDENED
        GMVNFKEFLFAFT+WVGIDENE+
Subjt:  GMVNFKEFLFAFTRWVGIDENED

Q8VE80 THO complex subunit 35.5e-8850.67Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG
        NI + PDG  IAVGN+DD +T +D +  +   + +F         F+VNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G 
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG

Query:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA
        YFA GSAD+LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +
Subjt:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA

Q96J01 THO complex subunit 37.1e-8850.67Show/hide
Query:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI
        +RE+  H  KVHSVAW+C G +LASGS D+TA V+ +E      VK+   +GH DSVDQLCW P + DL  TASGDKT+R+WD R  KC       GENI
Subjt:  SREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAELSGENI

Query:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG
        NI + PDG  IAVGN+DD +T +D +  +   + +F         F+VNEI+WN    MFFLT GNG + +L+YP L+P++++ AH + C CI  DP+G 
Subjt:  NITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDPVGG

Query:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA
        YFA GSAD+LVSLWD+ +++CVR F++L+WPVRT+SF+H G+ +ASASED FIDI+ V+TG  + ++ C +   +V W+PK  LLA+A DDK+ KY +
Subjt:  YFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYAGDDKN-KYQA

Q9FKT5 THO complex subunit 32.9e-15884.66Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+T  FK+LHSREYQGHKKKVHSVAWN  G KLASGSVDQTAR+W+IEPHGH K KD+ELKGHTDSVDQLCWDPKHSDL+ATASGDK+VRLWDAR+GK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        C+QQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+H+RKFNYE        VNEIAWNM G+ FFLTTG GTVEVL+YPSL+P++TL AHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDP G YFAVGSADSLVSLWDIS MLC+RTFTKLEWPVRTISFN++GEYIASASEDLFIDI+NVQTGRTVHQIPCRAAMNSVEWNPK+NLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKN-KYQADEG
        GDDKN KY  DEG
Subjt:  GDDKN-KYQADEG

Arabidopsis top hitse value%identityAlignment
AT3G24110.1 Calcium-binding EF-hand family protein1.2e-4544.23Show/hide
Query:  ETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDACDISAAMGMKFNEFIV
        + KL  KMVE+ R        ++S DSII+KFPK+ + LR  +++F+ +D D NG ID +ELK+C   L++SL++EE+  L+  CD+  + G++FNEFIV
Subjt:  ETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDACDISAAMGMKFNEFIV

Query:  LLCLVYLLKDDPNAKCSKSHFGMPKL-EETFESLVDAFVFLDKNKDGYVSKSEMISAI-NETTSGERSSGRIAMRRFEEMDWDKNGMVNFKEFLFAFTRW
        LLCL+YLL    +   ++S    PKL E  F+ +V+ F+FLDK+  G ++K+++I  + NE    ERS   +   RFEEMDW + G V F+EFLFAF  W
Subjt:  LLCLVYLLKDDPNAKCSKSHFGMPKL-EETFESLVDAFVFLDKNKDGYVSKSEMISAI-NETTSGERSSGRIAMRRFEEMDWDKNGMVNFKEFLFAFTRW

Query:  VGIDENED
        VG+D+ +D
Subjt:  VGIDENED

AT4G26470.1 Calcium-binding EF-hand family protein1.1e-8369.96Show/hide
Query:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD
        MG  V K E+   E VPETKLEAK++EA+++RA++G+ ++SF+SI+LKFPKIDD LR CK IFQ+FDED NG ID  ELK C   LEIS  EEEI+DLF 
Subjt:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD

Query:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKN
        ACDI+  MG+ F EFIVLLCLVYLLKDD +    K   GMPKLE TFE+LVD FVFLD+NKDGYVS+ EM+ AI+E  SGERSSGRIAM+RFEEMDWDKN
Subjt:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKN

Query:  GMVNFKEFLFAFTRWVGIDENED
        GMVNFKEFLFAFT+WVGIDENE+
Subjt:  GMVNFKEFLFAFTRWVGIDENED

AT4G26470.2 Calcium-binding EF-hand family protein3.3e-6466.15Show/hide
Query:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD
        MG  V K E+   E VPETKLEAK++EA+++RA++G+ ++SF+SI+LKFPKIDD LR CK IFQ+FDED NG ID  ELK C   LEIS  EEEI+DLF 
Subjt:  MGSVVGKLES--PECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFD

Query:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRF
        ACDI+  MG+ F EFIVLLCLVYLLKDD +    K   GMPKLE TFE+LVD FVFLD+NKDGYVS+ EM+ AI+E  SGERSSGRIAM+RF
Subjt:  ACDISAAMGMKFNEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRF

AT5G56130.1 Transducin/WD40 repeat-like superfamily protein2.1e-15984.66Show/hide
Query:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK
        MEE+T  FK+LHSREYQGHKKKVHSVAWN  G KLASGSVDQTAR+W+IEPHGH K KD+ELKGHTDSVDQLCWDPKHSDL+ATASGDK+VRLWDAR+GK
Subjt:  MEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGK

Query:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA
        C+QQ ELSGENINITYKPDGTH+AVGNRDDELTILDVRKFKP+H+RKFNYE        VNEIAWNM G+ FFLTTG GTVEVL+YPSL+P++TL AHTA
Subjt:  CSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTA

Query:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA
        GCYCIAIDP G YFAVGSADSLVSLWDIS MLC+RTFTKLEWPVRTISFN++GEYIASASEDLFIDI+NVQTGRTVHQIPCRAAMNSVEWNPK+NLLAYA
Subjt:  GCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLAYA

Query:  GDDKN-KYQADEG
        GDDKN KY  DEG
Subjt:  GDDKN-KYQADEG

AT5G67320.1 WD-40 repeat family protein1.6e-2127.33Show/hide
Query:  HSREYQGHKKK-VHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAEL-SG
        H++     K K V ++ WN  G  LA+GS D  AR+W +    +G++    L  H   +  L W+ K  D + T S D+T  +WD +  +  QQ E  SG
Subjt:  HSREYQGHKKK-VHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKHSDLIATASGDKTVRLWDARNGKCSQQAEL-SG

Query:  ENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDP
          +++ ++ +    A  + D  + +  + + +P        + F     +VN + W+ TG +    + + T ++        +  L  HT   Y I   P
Subjt:  ENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPSLRPIETLMAHTAGCYCIAIDP

Query:  VGG---------YFAVGSADSLVSLWD--ISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLA
         G            A  S DS V LWD  + +MLC  +F     PV +++F+  GEYIAS S D  I I +++ G+ V        +  V WN + N +A
Subjt:  VGG---------YFAVGSADSLVSLWD--ISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGTGTGGTGGGAAAGTTGGAATCTCCAGAGTGTGTACCAGAAACAAAACTTGAAGCCAAAATGGTTGAGGCAATGAGACAAAGGGCAACTAAAGGAAGCATCAT
TAGGTCATTTGATAGCATAATCTTGAAATTCCCCAAAATTGATGATAGCCTTAGAAAGTGCAAAACTATTTTCCAAAAGTTTGATGAGGATTTGAACGGGATAATAGATC
GTCAAGAGCTTAAAGAATGCTTTAACGGGCTGGAAATTTCGCTCACGGAGGAGGAAATCGATGATCTCTTTGATGCTTGTGATATTAGTGCAGCTATGGGAATGAAGTTC
AATGAATTCATTGTACTTCTCTGCCTTGTCTACCTTCTCAAGGATGATCCCAATGCTAAATGTTCTAAATCTCATTTCGGAATGCCAAAATTGGAGGAGACATTTGAATC
GTTGGTTGATGCATTCGTGTTCTTGGACAAGAATAAGGATGGATATGTAAGCAAGAGCGAGATGATATCTGCAATAAACGAGACCACGTCGGGAGAACGTTCTTCAGGGC
GGATAGCGATGAGGAGATTCGAGGAGATGGACTGGGACAAAAATGGAATGGTAAATTTCAAAGAATTTCTTTTCGCATTCACCCGTTGGGTTGGAATCGATGAGAATGAG
GACGGACAATCCCCCATACAAGAATTGGTCTCTGCATGGAACCAAAATCCCTCTAAATCGCAGGATTTGAAATCAATCGTTCCATTGCAGGAGGACATGGAGGAATCAAC
GCCAATTTTCAAGAATCTTCACAGCAGAGAGTATCAAGGTCACAAGAAGAAGGTACATTCTGTGGCATGGAATTGCACGGGCATGAAGCTTGCTTCCGGTTCCGTTGATC
AAACTGCTCGAGTTTGGCATATTGAGCCTCACGGACATGGTAAGGTTAAGGATGTTGAGTTGAAAGGGCACACTGATAGTGTAGATCAGCTATGCTGGGACCCTAAACAT
TCTGATCTTATAGCGACTGCATCTGGGGACAAGACCGTTCGACTATGGGATGCTCGTAATGGGAAATGCTCTCAGCAAGCTGAGCTCAGTGGGGAAAATATCAACATCAC
CTACAAACCTGATGGTACACACATAGCTGTTGGGAATAGGGATGATGAACTTACAATTCTGGATGTTAGGAAGTTTAAACCTGTTCACAAGCGCAAGTTCAATTATGAGG
AGTTTTTCGTTATCTTTTTCCAGGTGAATGAAATTGCTTGGAACATGACTGGGGAGATGTTTTTCCTGACAACTGGAAATGGTACTGTTGAAGTACTAGCATACCCGTCA
CTTCGACCAATTGAAACTCTTATGGCCCATACAGCTGGTTGTTACTGCATTGCAATTGACCCAGTTGGAGGGTACTTTGCTGTTGGAAGTGCTGATTCATTAGTTAGCCT
ATGGGATATCTCTCAGATGCTCTGCGTGCGAACATTTACAAAACTCGAATGGCCTGTCAGAACAATAAGTTTCAACCACACAGGAGAATACATTGCTTCTGCCAGCGAGG
ACTTGTTCATTGATATATCAAATGTTCAAACGGGACGAACGGTCCATCAGATTCCTTGTCGGGCTGCGATGAACAGTGTGGAGTGGAATCCAAAACACAATTTACTTGCA
TATGCTGGGGATGACAAGAACAAGTATCAGGCTGATGAAGGCAAGTTATCCGTGCATTGA
mRNA sequenceShow/hide mRNA sequence
ACAGAAACGTGGTGGGTCGACCACAGAAATCGGTACAAGAATCCAATGAAGCAGCAAAGAATCATCATCCCCCATTTTCATCTTCATCCCACCAACAACTAATCTCTACC
AAACTCTTCCCTTTTCCTTCATTCATCAAAAACAAAAACCCCCCAAAAGAAAAAATCTCCCAAAACGCCCTTCAATTTCTCTTCCTTTTCCTTGTTGACCTTCTGAGATT
CCTGAAGAACAACAAGAAGAAGCGATTTTGGAACCAAGCTGTGCCCTTAATCTCAAACAAAGACACTGTTTAATTAGTTGGTTGAGATTCCAAAAACATCATCCCATTTT
GTGTTGTGAAAGTTAAGTTGGCAATGGGGAGTGTGGTGGGAAAGTTGGAATCTCCAGAGTGTGTACCAGAAACAAAACTTGAAGCCAAAATGGTTGAGGCAATGAGACAA
AGGGCAACTAAAGGAAGCATCATTAGGTCATTTGATAGCATAATCTTGAAATTCCCCAAAATTGATGATAGCCTTAGAAAGTGCAAAACTATTTTCCAAAAGTTTGATGA
GGATTTGAACGGGATAATAGATCGTCAAGAGCTTAAAGAATGCTTTAACGGGCTGGAAATTTCGCTCACGGAGGAGGAAATCGATGATCTCTTTGATGCTTGTGATATTA
GTGCAGCTATGGGAATGAAGTTCAATGAATTCATTGTACTTCTCTGCCTTGTCTACCTTCTCAAGGATGATCCCAATGCTAAATGTTCTAAATCTCATTTCGGAATGCCA
AAATTGGAGGAGACATTTGAATCGTTGGTTGATGCATTCGTGTTCTTGGACAAGAATAAGGATGGATATGTAAGCAAGAGCGAGATGATATCTGCAATAAACGAGACCAC
GTCGGGAGAACGTTCTTCAGGGCGGATAGCGATGAGGAGATTCGAGGAGATGGACTGGGACAAAAATGGAATGGTAAATTTCAAAGAATTTCTTTTCGCATTCACCCGTT
GGGTTGGAATCGATGAGAATGAGGACGGACAATCCCCCATACAAGAATTGGTCTCTGCATGGAACCAAAATCCCTCTAAATCGCAGGATTTGAAATCAATCGTTCCATTG
CAGGAGGACATGGAGGAATCAACGCCAATTTTCAAGAATCTTCACAGCAGAGAGTATCAAGGTCACAAGAAGAAGGTACATTCTGTGGCATGGAATTGCACGGGCATGAA
GCTTGCTTCCGGTTCCGTTGATCAAACTGCTCGAGTTTGGCATATTGAGCCTCACGGACATGGTAAGGTTAAGGATGTTGAGTTGAAAGGGCACACTGATAGTGTAGATC
AGCTATGCTGGGACCCTAAACATTCTGATCTTATAGCGACTGCATCTGGGGACAAGACCGTTCGACTATGGGATGCTCGTAATGGGAAATGCTCTCAGCAAGCTGAGCTC
AGTGGGGAAAATATCAACATCACCTACAAACCTGATGGTACACACATAGCTGTTGGGAATAGGGATGATGAACTTACAATTCTGGATGTTAGGAAGTTTAAACCTGTTCA
CAAGCGCAAGTTCAATTATGAGGAGTTTTTCGTTATCTTTTTCCAGGTGAATGAAATTGCTTGGAACATGACTGGGGAGATGTTTTTCCTGACAACTGGAAATGGTACTG
TTGAAGTACTAGCATACCCGTCACTTCGACCAATTGAAACTCTTATGGCCCATACAGCTGGTTGTTACTGCATTGCAATTGACCCAGTTGGAGGGTACTTTGCTGTTGGA
AGTGCTGATTCATTAGTTAGCCTATGGGATATCTCTCAGATGCTCTGCGTGCGAACATTTACAAAACTCGAATGGCCTGTCAGAACAATAAGTTTCAACCACACAGGAGA
ATACATTGCTTCTGCCAGCGAGGACTTGTTCATTGATATATCAAATGTTCAAACGGGACGAACGGTCCATCAGATTCCTTGTCGGGCTGCGATGAACAGTGTGGAGTGGA
ATCCAAAACACAATTTACTTGCATATGCTGGGGATGACAAGAACAAGTATCAGGCTGATGAAGGCAAGTTATCCGTGCATTGA
Protein sequenceShow/hide protein sequence
MGSVVGKLESPECVPETKLEAKMVEAMRQRATKGSIIRSFDSIILKFPKIDDSLRKCKTIFQKFDEDLNGIIDRQELKECFNGLEISLTEEEIDDLFDACDISAAMGMKF
NEFIVLLCLVYLLKDDPNAKCSKSHFGMPKLEETFESLVDAFVFLDKNKDGYVSKSEMISAINETTSGERSSGRIAMRRFEEMDWDKNGMVNFKEFLFAFTRWVGIDENE
DGQSPIQELVSAWNQNPSKSQDLKSIVPLQEDMEESTPIFKNLHSREYQGHKKKVHSVAWNCTGMKLASGSVDQTARVWHIEPHGHGKVKDVELKGHTDSVDQLCWDPKH
SDLIATASGDKTVRLWDARNGKCSQQAELSGENINITYKPDGTHIAVGNRDDELTILDVRKFKPVHKRKFNYEEFFVIFFQVNEIAWNMTGEMFFLTTGNGTVEVLAYPS
LRPIETLMAHTAGCYCIAIDPVGGYFAVGSADSLVSLWDISQMLCVRTFTKLEWPVRTISFNHTGEYIASASEDLFIDISNVQTGRTVHQIPCRAAMNSVEWNPKHNLLA
YAGDDKNKYQADEGKLSVH