; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G20980 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G20980
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAnkyrin repeat domain-containing protein, chloroplastic
Genome locationClcChr01:32774849..32784620
RNA-Seq ExpressionClc01G20980
SyntenyClc01G20980
Gene Ontology termsGO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004620 - phospholipase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008081 - phosphoric diester hydrolase activity (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR002110 - Ankyrin repeat
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR020683 - Ankyrin repeat-containing domain
IPR035979 - RNA-binding domain superfamily
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOX90735.1 Ankyrin repeat protein [Theobroma cacao]3.4e-23065.27Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS
        S+SPSPWR  SRSRSRSRSRSR RSRSRSWSRP    RS SRGRSRSRSRGR    NPGNTLYVTGLS RV+ERD+EEHFSKEGKVVSCFLVVEPRTRIS
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS

Query:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS
        RGFAFVTMDNV+DANRC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLK+TRD G RGDRG RYRGG   RDD GYRRSPRRSPYRG R+YSPR+S
Subjt:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS

Query:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL
         P+ GRSRR+ S SP              Y RGS        L L+              H     +      A SL  P   +   + + LSLP     
Subjt:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL

Query:  SFISLLRPRNRVSSVSSSSHSSYSIQPDELED--FVIGDCVIFEDGVFDDPYLSDDSSA---EHSSPSTGKSKPK--------SSVAEIKPENLVPDEWK
           +L  PR ++ S+S S  SSY  Q D+++D   VIGDC++FEDG F+DP+L  DS++    ++S   GK+KPK            +I+ ENLVP++W+
Subjt:  SFISLLRPRNRVSSVSSSSHSSYSIQPDELED--FVIGDCVIFEDGVFDDPYLSDDSSA---EHSSPSTGKSKPK--------SSVAEIKPENLVPDEWK

Query:  EVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSK--EVNNADGVKEVKGGEMSRLSERVAPKNPK
         VQ EIN+TKKERRK+AQ LEFG+RVEKK++GLVP+ +VN++EYL YKEAKLAQL P+VLDNPSSFP K  E  N +G KE +   +   SERVAPKNP+
Subjt:  EVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSK--EVNNADGVKEVKGGEMSRLSERVAPKNPK

Query:  WAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAK
        WAVYGRG EDV+E FNS  YQP + KSEG RKLF+KEEK MLN+RVPDLA+A S  WLPLHTL  SGEFYLVD LLK+N+DINAV   G TA+H+AI+ K
Subjt:  WAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAK

Query:  KQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQ
        KQAITNYLLRESANPFVRD+DGATLMHYAV+TAS+  IK+LLLYNVDINLQD DGW+PLHLAVQARRTD+V+LLLI+GADK LKN  GLTPLD+CLYSG+
Subjt:  KQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQ

Query:  DTKTYELIKLLKQLPRP
        DT+TYELIKLLKQLP+P
Subjt:  DTKTYELIKLLKQLPRP

KAG7034061.1 Ankyrin repeat domain-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.5e-24367.64Show/hide
Query:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGR-----------------------------------------------DHAH
        YSRSPSPWRAP  SRSRSRSRSRPRSRSRSWSRPRS SRGRSRSRSRGR                                               ++ +
Subjt:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGR-----------------------------------------------DHAH

Query:  NPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGL---K
        NPGNTLYVTGLSTRVTERDLEEHFSKEGKV SCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEK+              G+    
Subjt:  NPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGL---K

Query:  STRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLG----LQPNR
        S R  G    + +R+  G            P  +P    R +        GG +          +    GG               L  LG      PN 
Subjt:  STRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLG----LQPNR

Query:  M-KSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPDELEDFVIGDCVIFEDGVFDD
        M KSLPFS   G          MSLAASLNIPVAKV FLSNSPL LPNS KLSF+SL R  NRV SV SSSH+SYSIQPDELEDFVIGDCVIFEDGVFDD
Subjt:  M-KSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPDELEDFVIGDCVIFEDGVFDD

Query:  PYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLD
        PY+ DDS+AEH SP T KSKPK+ V EIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKK+GLVPL SVNLEEYLS+KEAKLAQLKPLVLD
Subjt:  PYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLD

Query:  NPSSFP-SKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHT
        NPSSFP ++EVN A+GVKEVKG E+S  SERVAPKNPKWAVYGRGLEDVSEFFNS QYQPAD+K+EGPRKLFSKEEK MLNKRVPDLASAHS        
Subjt:  NPSSFP-SKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHT

Query:  LVGSGEFYLVDELLKNNIDINAVGFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQA
                              VGFTALHRAI+AKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASS AIK+LLLYNVDINLQDKDGWTPLHLAVQA
Subjt:  LVGSGEFYLVDELLKNNIDINAVGFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQA

Query:  RRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLPRPNKLPS
        RRTDVVRLLLIKGADK+LKN  GLTPLDICLYSGQDTKTYELIKLLKQLP+P+KLPS
Subjt:  RRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLPRPNKLPS

RXH74957.1 hypothetical protein DVH24_029678 [Malus domestica]1.6e-23565.55Show/hide
Query:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP-----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR
        YSRSPSPWRA  +SRSRSRS SRPRSRSRS SRP     RS SRGRSRSRS GR  A NPGNTLYVTGLSTRV+E+D+E HFSKEGKV SCFLV+EPRTR
Subjt:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP-----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR

Query:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRG-----------SRYRGGGSFRDDYGYRRSPRRS
        ISRGFAF+TMD+V+DA RC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLK+TRD GYRGDR             RYR GGS RDDYGYRRSPRRS
Subjt:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRG-----------SRYRGGGSFRDDYGYRRSPRRS

Query:  PYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFL
        PYRG R+YSPR   PYGGRSRR+ S S      PYG           SP RN+          K++P S      N +S                   FL
Subjt:  PYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFL

Query:  SNSPLSLPNS-NKLSFISLLRPRNRVSSVSSSSHSSYSIQPDEL-EDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDE
           PLSLP +  KLS  + L+   ++  +  S  SSYS+Q D   ED VIGDC++FE+GVFDDPYL +D+  +   P   K+KP+   +EI  ENLVPD+
Subjt:  SNSPLSLPNS-NKLSFISLLRPRNRVSSVSSSSHSSYSIQPDEL-EDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDE

Query:  WKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSF-PSKEVNNADGVKEVKGGEMSRLSERVAPKNP
        W+EVQAEINITKKERRKIAQEL+FGTRVEK+KKGL P+  +NLEEYL+Y+EAKL QLKPLVLD+ S   P++E+   +       G+  +LSERVAPKNP
Subjt:  WKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSF-PSKEVNNADGVKEVKGGEMSRLSERVAPKNP

Query:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA
        +WAVYG+GLEDV+++FNSG Y+P  +KSEGPRKLFSKEEK +LNKR PD+A+A S  WLPLHTL  SGEFYL+D LLK+N DIN V   G++ALH+AIL 
Subjt:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA

Query:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG
        K QAITNYLLRESANPFVRDKDG TLMHYAV+TASSQAIKILLLYNVD+NLQD DGWTPLHLAVQ RRTDVVRLLLIKGA+K+LKN  GLTPLD+CLYSG
Subjt:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG

Query:  QDTKTYELIKLLKQLPR
        QDT+TYELIKLLK LP+
Subjt:  QDTKTYELIKLLKQLPR

TYI41085.1 hypothetical protein ES332_A02G209700v1 [Gossypium tomentosum]6.9e-23164.36Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS
        S+SPSPWR  SRSRSRSR RSR RSRSRSWSRP    RS SRGRSRSRSRGR  A NPGNTLYVTGLS RVTERDLEEHFSKEGKV SCFLVVEPRTRIS
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS

Query:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS
        RGFAFVTMD+V+DA+RC+K+LNQSILEGR+ITVE+SRRKRPRTPTPGHYLGLK+TRD G RG+RG RYRGGG  RDDYGYRRSP RSPYR  R+YSPR+S
Subjt:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS

Query:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPN----------------RNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKV
         P+GGRSRR+ S SPPYS             RGSS N                  L+ LGL        P   S     ++    ++  ++ L  P  K+
Subjt:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPN----------------RNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKV

Query:  FFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKP
           S   LS P S      +L  PR ++ SVS S  SSY  Q +    + E+ VIGDC++FEDG F+DPYL  DS  ++ + +T   K+KPK +  EI+ 
Subjt:  FFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKP

Query:  ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSER
        ENLVP++W  V  E+NITKKERRK+AQ+LEFG RVEKKK+GLVP+ ++N +EYL YKEAKLAQL P+VLDNPS+FP KE N  D +++      S  SER
Subjt:  ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSER

Query:  VAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTAL
        VAPKNP+WAVY +G +DV+EFFNSG YQ A++ ++G RKLF+KEEK+MLN+RVPDLA+A S  WLPLHTL  SGEFYLVD LLK+N+DINAV   GFTA+
Subjt:  VAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTAL

Query:  HRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLD
        H+AI+ KKQAITNYLLRESANPFVRD+DGATLMHYAV  AS+  IK+LLLYNVDINLQD DGWTPLHLAVQARRTDVV+LLLI+GA+K LKN  GLTPLD
Subjt:  HRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLD

Query:  ICLYSGQDTKTYELIKLLKQLPRP
        +CLY G+D +TYELIK+LKQLP+P
Subjt:  ICLYSGQDTKTYELIKLLKQLPRP

XP_012472254.1 PREDICTED: ankyrin repeat domain-containing protein, chloroplastic [Gossypium raimondii]6.4e-22965.25Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS
        S SPSPWR  SRSRSRSR RSR RSRSRSWSRP    RS SRGRSRSRSRGR  A NPGNTLYVTGLS RVTERDLEEHFSKEGKV SCFLVVEPRTRIS
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS

Query:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS
        RGFAFVTMD+V+DA+RC+K+LNQSILEGRYITVE+SRRKRPRTPTPGHYLGLK+TRD G RG+RG RYRGGG  RDDYGY RSPRRSPYRG R+YSPR+S
Subjt:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS

Query:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL
         P+GGRSRR+ S SPPYS             RGS           +  +  SLP   S+       I +T  L             LS  P  L   + L
Subjt:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL

Query:  SFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKPENLVPDEWKEVQAEIN
         F     PR ++ SVS S  SSY  Q +    + E+ VIGDC++FEDG F+DPYL  DS  ++ + +T   K+KPK +  EI+ ENLVP++W  V  E+N
Subjt:  SFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKPENLVPDEWKEVQAEIN

Query:  ITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLE
        ITKKERRK+AQ+LEFG RVEKKK+GLVP+ ++N +EYL YKEAKLAQL P+VLDNP +FP KE N  D +++      S  SERVAPKNP+WAVY +G +
Subjt:  ITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLE

Query:  DVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLL
        DV+EFFNSG YQ A++ ++G RKLF+KEEK+MLN+RVPDLA+A S  WLPLHTL  SGEFYLVD LLK+N+DINAV   G TA+H+AI+ KKQAITNYLL
Subjt:  DVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLL

Query:  RESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIK
        RESANPFVRD+DGATLMHYAV  AS+  IK+LLLYNVDINLQD DGWTPLHLAVQARRTDVV+LLLI+GA+K LKN  GLTPLD+CLY G+D +TYELIK
Subjt:  RESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIK

Query:  LLKQLPRP
        +LKQLP+P
Subjt:  LLKQLPRP

TrEMBL top hitse value%identityAlignment
A0A061DF79 Ankyrin repeat protein1.7e-23065.27Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS
        S+SPSPWR  SRSRSRSRSRSR RSRSRSWSRP    RS SRGRSRSRSRGR    NPGNTLYVTGLS RV+ERD+EEHFSKEGKVVSCFLVVEPRTRIS
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS

Query:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS
        RGFAFVTMDNV+DANRC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLK+TRD G RGDRG RYRGG   RDD GYRRSPRRSPYRG R+YSPR+S
Subjt:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS

Query:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL
         P+ GRSRR+ S SP              Y RGS        L L+              H     +      A SL  P   +   + + LSLP     
Subjt:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKL

Query:  SFISLLRPRNRVSSVSSSSHSSYSIQPDELED--FVIGDCVIFEDGVFDDPYLSDDSSA---EHSSPSTGKSKPK--------SSVAEIKPENLVPDEWK
           +L  PR ++ S+S S  SSY  Q D+++D   VIGDC++FEDG F+DP+L  DS++    ++S   GK+KPK            +I+ ENLVP++W+
Subjt:  SFISLLRPRNRVSSVSSSSHSSYSIQPDELED--FVIGDCVIFEDGVFDDPYLSDDSSA---EHSSPSTGKSKPK--------SSVAEIKPENLVPDEWK

Query:  EVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSK--EVNNADGVKEVKGGEMSRLSERVAPKNPK
         VQ EIN+TKKERRK+AQ LEFG+RVEKK++GLVP+ +VN++EYL YKEAKLAQL P+VLDNPSSFP K  E  N +G KE +   +   SERVAPKNP+
Subjt:  EVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSK--EVNNADGVKEVKGGEMSRLSERVAPKNPK

Query:  WAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAK
        WAVYGRG EDV+E FNS  YQP + KSEG RKLF+KEEK MLN+RVPDLA+A S  WLPLHTL  SGEFYLVD LLK+N+DINAV   G TA+H+AI+ K
Subjt:  WAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAK

Query:  KQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQ
        KQAITNYLLRESANPFVRD+DGATLMHYAV+TAS+  IK+LLLYNVDINLQD DGW+PLHLAVQARRTD+V+LLLI+GADK LKN  GLTPLD+CLYSG+
Subjt:  KQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQ

Query:  DTKTYELIKLLKQLPRP
        DT+TYELIKLLKQLP+P
Subjt:  DTKTYELIKLLKQLPRP

A0A444WUQ6 Uncharacterized protein3.9e-22460.65Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGRDHAH-------NPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRT
        SRSPSPWRA SRSRSRSRSRSRPRSRSRS S PR   R RSRSR R R  +H       NPGNTLYVTGLS+RVTERDLEEHFSKEGKV SCFLVVEPRT
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGRDHAH-------NPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRT

Query:  RISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGS---RYRGGGSFRDDYGYRRSPRRSPYRGARE
        RISRGFAF+TMD V+DANRC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLKSTRD G+RGDRG+   RYRGG   RDDY YRRSPRRSPYRG R+
Subjt:  RISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGS---RYRGGGSFRDDYGYRRSPRRSPYRGARE

Query:  YSPRHSP-PYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMS-LAASLNIPVAKVFFLSNSPL
        YSPR SP   GGRSRR+ SRSPPYSP    GSPDRRY RGS     +         +   P+ P     +   I  T S + A  ++P +    L + P 
Subjt:  YSPRHSP-PYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMS-LAASLNIPVAKVFFLSNSPL

Query:  SLPNSNKLSFISLLRP--------RNRVSSVSSSS---------------------------------HS---SYSIQPDELEDFVIGDCVIFEDGVFDD
             ++ SF S+ +         R  V+SV +SS                                 HS   S+    +E ++ VIGDC+++E+G+FDD
Subjt:  SLPNSNKLSFISLLRP--------RNRVSSVSSSS---------------------------------HS---SYSIQPDELEDFVIGDCVIFEDGVFDD

Query:  PYLSDDSSAEHS------SPSTGKSKPKSSVAEIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQL
        P +S + +  HS      +P+  K   K  V EI  ENLVPD+W+EVQAE+NITKKERRKIAQE+EF ++V KK++GLVPL  +NL+EY +YKEAKLAQL
Subjt:  PYLSDDSSAEHS------SPSTGKSKPKSSVAEIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQL

Query:  KPLVLDNPSSFPSKE---VNNA----DGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPAD---RKSEGPRKLFSKEEKVMLNKRVP
        KPLVLDNPS FP KE   + N     DG     GG     S RV PKNPKWAVYG+GL++V+EFFNS  Y PA    + +EG R+LF+KEEK +LNKRVP
Subjt:  KPLVLDNPSSFPSKE---VNNA----DGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPAD---RKSEGPRKLFSKEEKVMLNKRVP

Query:  DLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVD
        DL +A S+ WLPLHTL   GEFYL+D L+K+N+DINAV   G TALH+AI+ KKQAITNYLLR SANPFV D+DGATL+HYAVQTASSQ IKILLLYNVD
Subjt:  DLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVD

Query:  INLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLK
        +NLQD DGWTPLHLA+QA+RTD+VRLLLIKGADK++KN  GLTPLD+CLYSGQ  +TYELIKLLK
Subjt:  INLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLK

A0A498HZC2 Phospholipase D7.7e-23665.55Show/hide
Query:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP-----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR
        YSRSPSPWRA  +SRSRSRS SRPRSRSRS SRP     RS SRGRSRSRS GR  A NPGNTLYVTGLSTRV+E+D+E HFSKEGKV SCFLV+EPRTR
Subjt:  YSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP-----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR

Query:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRG-----------SRYRGGGSFRDDYGYRRSPRRS
        ISRGFAF+TMD+V+DA RC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLK+TRD GYRGDR             RYR GGS RDDYGYRRSPRRS
Subjt:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRG-----------SRYRGGGSFRDDYGYRRSPRRS

Query:  PYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFL
        PYRG R+YSPR   PYGGRSRR+ S S      PYG           SP RN+          K++P S      N +S                   FL
Subjt:  PYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFL

Query:  SNSPLSLPNS-NKLSFISLLRPRNRVSSVSSSSHSSYSIQPDEL-EDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDE
           PLSLP +  KLS  + L+   ++  +  S  SSYS+Q D   ED VIGDC++FE+GVFDDPYL +D+  +   P   K+KP+   +EI  ENLVPD+
Subjt:  SNSPLSLPNS-NKLSFISLLRPRNRVSSVSSSSHSSYSIQPDEL-EDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDE

Query:  WKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSF-PSKEVNNADGVKEVKGGEMSRLSERVAPKNP
        W+EVQAEINITKKERRKIAQEL+FGTRVEK+KKGL P+  +NLEEYL+Y+EAKL QLKPLVLD+ S   P++E+   +       G+  +LSERVAPKNP
Subjt:  WKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSF-PSKEVNNADGVKEVKGGEMSRLSERVAPKNP

Query:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA
        +WAVYG+GLEDV+++FNSG Y+P  +KSEGPRKLFSKEEK +LNKR PD+A+A S  WLPLHTL  SGEFYL+D LLK+N DIN V   G++ALH+AIL 
Subjt:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA

Query:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG
        K QAITNYLLRESANPFVRDKDG TLMHYAV+TASSQAIKILLLYNVD+NLQD DGWTPLHLAVQ RRTDVVRLLLIKGA+K+LKN  GLTPLD+CLYSG
Subjt:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG

Query:  QDTKTYELIKLLKQLPR
        QDT+TYELIKLLK LP+
Subjt:  QDTKTYELIKLLKQLPR

A0A5D2RMM9 Uncharacterized protein3.3e-23164.36Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS
        S+SPSPWR  SRSRSRSR RSR RSRSRSWSRP    RS SRGRSRSRSRGR  A NPGNTLYVTGLS RVTERDLEEHFSKEGKV SCFLVVEPRTRIS
Subjt:  SRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRP----RSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRIS

Query:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS
        RGFAFVTMD+V+DA+RC+K+LNQSILEGR+ITVE+SRRKRPRTPTPGHYLGLK+TRD G RG+RG RYRGGG  RDDYGYRRSP RSPYR  R+YSPR+S
Subjt:  RGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHS

Query:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPN----------------RNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKV
         P+GGRSRR+ S SPPYS             RGSS N                  L+ LGL        P   S     ++    ++  ++ L  P  K+
Subjt:  PPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPN----------------RNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKV

Query:  FFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKP
           S   LS P S      +L  PR ++ SVS S  SSY  Q +    + E+ VIGDC++FEDG F+DPYL  DS  ++ + +T   K+KPK +  EI+ 
Subjt:  FFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPD----ELEDFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPST--GKSKPKSSVAEIKP

Query:  ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSER
        ENLVP++W  V  E+NITKKERRK+AQ+LEFG RVEKKK+GLVP+ ++N +EYL YKEAKLAQL P+VLDNPS+FP KE N  D +++      S  SER
Subjt:  ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSER

Query:  VAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTAL
        VAPKNP+WAVY +G +DV+EFFNSG YQ A++ ++G RKLF+KEEK+MLN+RVPDLA+A S  WLPLHTL  SGEFYLVD LLK+N+DINAV   GFTA+
Subjt:  VAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTAL

Query:  HRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLD
        H+AI+ KKQAITNYLLRESANPFVRD+DGATLMHYAV  AS+  IK+LLLYNVDINLQD DGWTPLHLAVQARRTDVV+LLLI+GA+K LKN  GLTPLD
Subjt:  HRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLD

Query:  ICLYSGQDTKTYELIKLLKQLPRP
        +CLY G+D +TYELIK+LKQLP+P
Subjt:  ICLYSGQDTKTYELIKLLKQLPRP

W9SCJ5 Ankyrin repeat domain-containing protein9.4e-21865.2Show/hide
Query:  RSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRT
        RSRSRS GR  A N GNTLYVTGLSTRVTERDLE+HFSKEGKV SCFLVVEPRTRISRGFAFVTMD+V+ A+RCVK+LNQS+LEGRYITVE+SRRKRPRT
Subjt:  RSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRT

Query:  PTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGL
        PTPGHYLGLKSTRD GYRGDR      GGS RDDYGYRRSPRRSPYRG R++SPR S PYGGRSRRD SRS      PYG              + L  L
Subjt:  PTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGL

Query:  GLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLN-IPVAKVF-FLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPDELEDFVIGDCVIF
             R K+L   P V   N++     M L+  LN  P + +F  L  + LSL +S+  S + L   +    S+SSS            +D+VIGDCV+F
Subjt:  GLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLN-IPVAKVF-FLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPDELEDFVIGDCVIF

Query:  EDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKP-ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLA
        EDG FDDPYL      +    +  K + K   A I P ENLVPD+W+EVQAEINITKKERRKIAQELEFGTRVEKK+ GL PL +VNLEEYL+++EAKL+
Subjt:  EDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKP-ENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLA

Query:  QLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSD
        QLKPLVLDNP        N  +G  E   GE    S RV+P+NP+WAVYG+G + V+EFFN G Y PA+RKS GPRKLF+KEEKV+LNKR+ DL +A SD
Subjt:  QLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSD

Query:  MWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDG
         WLPLHTL  SG FYLVD LLK+N+DINAV   G TA+H+AI+ KKQAITNYLLRESANPFV DKDGATLMHYAV+TAS QAIKILLLYNVD+NL D DG
Subjt:  MWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDG

Query:  WTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLPR
        WTPLHLAVQARR DV++LLLIKGADK+LKN  GLTPLD+CLYSG+DT+TYELIK+LK LP+
Subjt:  WTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLPR

SwissProt top hitse value%identityAlignment
P62995 Transformer-2 protein homolog beta2.3e-1942.57Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSR-PRSRSRSWSRPRSPSRGRSRSRSRGRDHAHNPGNT----------------LYVTGLSTRVTERDLEEHFSKEGKVV
        SRS S  R+ SRSRSR  SR    RSRSRS S  RS SR  SR   R   H+H+P +T                L V GLS   TERDL E FSK G + 
Subjt:  SRSPSPWRAPSRSRSRSRSRSR-PRSRSRSWSRPRSPSRGRSRSRSRGRDHAHNPGNT----------------LYVTGLSTRVTERDLEEHFSKEGKVV

Query:  SCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKS-----TRDSGYRG-DRG--------SRYRGGGS
           +V + ++R SRGFAFV  +NVDDA    +  N   L+GR I V+ S  KRP TPTPG Y+G  +      RD   RG DRG          YRGGG 
Subjt:  SCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKS-----TRDSGYRG-DRG--------SRYRGGGS

Query:  FRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPY
             G+R +  R      R  SP  SP Y     R  SRS  YSP  Y
Subjt:  FRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPY

Q05753 Ankyrin repeat domain-containing protein, chloroplastic3.3e-13560.1Show/hide
Query:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD
        ISLL PR   S +S S HS          SYS Q    PD  +DF++GDC+++EDGVF+DPYL  +    +  E      G +K +   +EI+PENLVP+
Subjt:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD

Query:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP
        EW+++QAE+N+TKK++RKIAQE+EFG RVEKK++GL+PL  V+L ++L+YKEAKLAQL+P++LD P +F      ++DG   V     S  SERVAPKNP
Subjt:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP

Query:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA
        +WAVYG+G + V++FFNS +Y P+D+KS+GPRKL SKEEK MLN R PDLA A S  WLPLHTL   GEFYLVD LLK+N+DINA    G T LHRAI+ 
Subjt:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA

Query:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG
        KKQAITNYLLRESANPFV D +GATLMHYAVQTAS+  IK+LLLYN DIN QD+DGWTPLH+AVQARR+D+V+LLLIKGAD  +KN  GLTPL +CLY G
Subjt:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG

Query:  QDTKTYELIKLLKQLP
        ++ +TYE++KLLK+ P
Subjt:  QDTKTYELIKLLKQLP

Q3ZBT6 Transformer-2 protein homolog beta2.3e-1942.57Show/hide
Query:  SRSPSPWRAPSRSRSRSRSRSR-PRSRSRSWSRPRSPSRGRSRSRSRGRDHAHNPGNT----------------LYVTGLSTRVTERDLEEHFSKEGKVV
        SRS S  R+ SRSRSR  SR    RSRSRS S  RS SR  SR   R   H+H+P +T                L V GLS   TERDL E FSK G + 
Subjt:  SRSPSPWRAPSRSRSRSRSRSR-PRSRSRSWSRPRSPSRGRSRSRSRGRDHAHNPGNT----------------LYVTGLSTRVTERDLEEHFSKEGKVV

Query:  SCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKS-----TRDSGYRG-DRG--------SRYRGGGS
           +V + ++R SRGFAFV  +NVDDA    +  N   L+GR I V+ S  KRP TPTPG Y+G  +      RD   RG DRG          YRGGG 
Subjt:  SCFLVVEPRTRISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKS-----TRDSGYRG-DRG--------SRYRGGGS

Query:  FRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPY
             G+R +  R      R  SP  SP Y     R  SRS  YSP  Y
Subjt:  FRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPPY

Q84TH4 Serine/arginine-rich splicing factor SR45a1.3e-3346.39Show/hide
Query:  PSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGR-------DHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFV
        P R  +RS      RSR      P    RGRS SRS  R         A NPGN+LYVTGLS RVTERDLE+HF+KEGKV    LV++P TR SRGF F+
Subjt:  PSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGR-------DHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFV

Query:  TMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRD-------SGYRGDRGSRYRG-------GGSFRDDYGYR-RSPRRSP-YR
        +M +V DANRC++ L+ S+L+GR ITVEK+RR+R RTPTPG YLGL++ R        S  R    SR R        G S+   YG R RS   SP YR
Subjt:  TMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRD-------SGYRGDRGSRYRG-------GGSFRDDYGYR-RSPRRSP-YR

Query:  GAREYSPRHSPPYGGR--SRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMK
          R YSP  SP    R   RRD S SP Y          RR  R  S +RN       P  M+
Subjt:  GAREYSPRHSPPYGGR--SRRDHSRSPPYSPPPYGGSPDRRYPRGSSPNRNLTGLGLQPNRMK

Q9SQK3 Ankyrin repeat domain-containing protein EMB506, chloroplastic7.3e-4245.41Show/hide
Query:  KLFSKEEKVMLNK-RVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAV
        +L   EE+V+L +   P+L    +  W PL TL  S +  L+D L++N +DI+ V     TALH+AI+ KK+A+ ++LLR+ ANP ++D+DGA  +HYAV
Subjt:  KLFSKEEKVMLNK-RVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAV

Query:  QTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLP
        Q  + Q +K+L  YNVD+N+ D +GWTPLH+AVQ+R  D+ ++LL  GADK+ +   G   LD+ L  G+D K+Y+L+KLLK +P
Subjt:  QTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLP

Arabidopsis top hitse value%identityAlignment
AT4G35785.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.3e-6267.23Show/hide
Query:  SRSPSP--WRAPSRSRSRSRSRSRP--RSRSRSWSRPRSPSRGRSRSRSRGR-DHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRI
        SRSPSP   RA SRSRSRSRSRSRP  RSRSRS  RP SPSR R RSRSR R     NPG TLYVTGLSTRVT++DLE HF+KEGKV SCFLV+EPRTR+
Subjt:  SRSPSP--WRAPSRSRSRSRSRSRP--RSRSRSWSRPRSPSRGRSRSRSRGR-DHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRI

Query:  SRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRH
        SRGFAFVTM ++ DA RC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLKS+RDS   G R S  RG    RDDY  RRSPR       R+YSPR 
Subjt:  SRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRH

Query:  SPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGS
              RSRRD S S      P+G SP+RR  R S
Subjt:  SPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGS

AT4G35785.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.8e-6467.8Show/hide
Query:  SRSPSP--WRAPSRSRSRSRSRSRPRSRSRSWSRPR----SPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR
        SRSPSP   RA SRSRSRSRSRSRPR RSRS S PR    S SRGRSRSRSRGR    NPG TLYVTGLSTRVT++DLE HF+KEGKV SCFLV+EPRTR
Subjt:  SRSPSP--WRAPSRSRSRSRSRSRPRSRSRSWSRPR----SPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR

Query:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPR
        +SRGFAFVTM ++ DA RC+K+LNQS+LEGRYITVE+SRRKRPRTPTPGHYLGLKS+RDS   G R S  RG    RDDY  RRSPR       R+YSPR
Subjt:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPR

Query:  HSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGS
               RSRRD S S      P+G SP+RR  R S
Subjt:  HSPPYGGRSRRDHSRSPPYSPPPYGGSPDRRYPRGS

AT4G35785.3 RNA-binding (RRM/RBD/RNP motifs) family protein5.6e-4575.91Show/hide
Query:  SRSPSP--WRAPSRSRSRSRSRSRPRSRSRSWSRPR----SPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR
        SRSPSP   RA SRSRSRSRSRSRPR RSRS S PR    S SRGRSRSRSRGR    NPG TLYVTGLSTRVT++DLE HF+KEGKV SCFLV+EPRTR
Subjt:  SRSPSP--WRAPSRSRSRSRSRSRPRSRSRSWSRPR----SPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTR

Query:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEK
        +SRGFAFVTM ++ DA RC+K+LNQS+LEGRYITVE+
Subjt:  ISRGFAFVTMDNVDDANRCVKHLNQSILEGRYITVEK

AT5G66055.1 ankyrin repeat protein2.3e-13660.1Show/hide
Query:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD
        ISLL PR   S +S S HS          SYS Q    PD  +DF++GDC+++EDGVF+DPYL  +    +  E      G +K +   +EI+PENLVP+
Subjt:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD

Query:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP
        EW+++QAE+N+TKK++RKIAQE+EFG RVEKK++GL+PL  V+L ++L+YKEAKLAQL+P++LD P +F      ++DG   V     S  SERVAPKNP
Subjt:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP

Query:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA
        +WAVYG+G + V++FFNS +Y P+D+KS+GPRKL SKEEK MLN R PDLA A S  WLPLHTL   GEFYLVD LLK+N+DINA    G T LHRAI+ 
Subjt:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA

Query:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG
        KKQAITNYLLRESANPFV D +GATLMHYAVQTAS+  IK+LLLYN DIN QD+DGWTPLH+AVQARR+D+V+LLLIKGAD  +KN  GLTPL +CLY G
Subjt:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGADKSLKNAGGLTPLDICLYSG

Query:  QDTKTYELIKLLKQLP
        ++ +TYE++KLLK+ P
Subjt:  QDTKTYELIKLLKQLP

AT5G66055.2 ankyrin repeat protein9.3e-10958.71Show/hide
Query:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD
        ISLL PR   S +S S HS          SYS Q    PD  +DF++GDC+++EDGVF+DPYL  +    +  E      G +K +   +EI+PENLVP+
Subjt:  ISLLRPRNRVSSVSSSSHS----------SYSIQ----PDELEDFVIGDCVIFEDGVFDDPYLSDD----SSAEHSSPSTGKSKPKSSVAEIKPENLVPD

Query:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP
        EW+++QAE+N+TKK++RKIAQE+EFG RVEKK++GL+PL  V+L ++L+YKEAKLAQL+P++LD P +F      ++DG   V     S  SERVAPKNP
Subjt:  EWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQLKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNP

Query:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA
        +WAVYG+G + V++FFNS +Y P+D+KS+GPRKL SKEEK MLN R PDLA A S  WLPLHTL   GEFYLVD LLK+N+DINA    G T LHRAI+ 
Subjt:  KWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGSGEFYLVDELLKNNIDINAV---GFTALHRAILA

Query:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDG
        KKQAITNYLLRESANPFV D +GATLMHYAVQTAS+  IK+LLLYN DIN QD+ G
Subjt:  KKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTACTCTCGATCTCCTTCCCCCTGGAGAGCTCCTTCAAGGTCAAGATCTAGATCTAGATCACGATCTCGGCCCAGGTCCAGGTCGAGATCATGGTCAAGGCCAAG
ATCCCCAAGTCGTGGAAGATCAAGGTCCAGAAGTCGTGGGAGGGACCATGCCCACAACCCTGGGAATACACTTTATGTCACTGGTCTATCTACAAGGGTTACTGAAAGAG
ACCTTGAAGAGCACTTTTCCAAGGAGGGCAAGGTAGTTTCTTGTTTCCTTGTGGTTGAGCCTCGAACACGGATTTCCCGTGGATTTGCATTTGTTACGATGGACAATGTT
GATGATGCTAATCGGTGTGTTAAGCATCTCAACCAGTCAATCCTAGAAGGTCGATATATAACTGTGGAGAAGTCACGGAGGAAACGGCCAAGAACTCCAACGCCAGGACA
CTATTTGGGGTTGAAAAGCACCAGAGACTCTGGCTACCGTGGCGATCGAGGTAGTAGGTATCGTGGTGGCGGCTCCTTCCGCGATGACTATGGGTACCGTCGGTCTCCAA
GACGCTCACCATATCGAGGAGCTCGAGAGTACTCCCCTAGACACTCTCCCCCTTATGGTGGGAGATCAAGGAGGGATCACTCCAGGTCGCCTCCTTACTCTCCTCCTCCT
TATGGCGGCAGCCCGGATAGGAGGTATCCTCGTGGCTCTAGCCCAAACCGCAATCTTACGGGCCTAGGCCTGCAGCCCAATCGCATGAAATCTCTTCCGTTTTCGCCATC
GGTGGGGCATTGCAATCGGAAGTCCATCGGAGCAACCATGTCACTCGCAGCTTCTCTGAATATTCCCGTCGCTAAAGTCTTCTTCCTCTCGAATTCGCCGCTTTCTCTTC
CCAATTCCAACAAGCTTTCCTTCATCTCTCTTCTCAGGCCTCGAAACAGAGTTTCTTCTGTTTCCTCTTCTTCCCACTCCTCTTACTCTATCCAACCAGATGAGTTAGAA
GACTTCGTCATCGGCGACTGCGTTATTTTCGAAGACGGCGTGTTCGACGACCCATATCTGTCGGATGATTCGAGTGCGGAGCATTCCAGTCCGAGCACAGGGAAGTCTAA
GCCTAAGAGCAGCGTTGCGGAAATTAAGCCCGAAAATTTGGTGCCAGATGAGTGGAAAGAAGTGCAAGCGGAGATTAACATCACCAAGAAAGAGAGGCGGAAGATTGCTC
AGGAATTGGAGTTCGGTACTCGAGTTGAGAAGAAGAAGAAAGGGCTTGTTCCTTTGACAAGTGTGAATTTAGAGGAGTATTTATCGTATAAGGAGGCCAAGTTGGCGCAA
TTGAAGCCACTTGTTCTTGATAATCCTTCTAGTTTTCCGTCAAAGGAAGTTAACAATGCGGATGGTGTTAAGGAAGTGAAAGGAGGCGAAATGAGCAGGTTAAGCGAGCG
TGTGGCGCCAAAAAATCCCAAGTGGGCGGTTTATGGTAGGGGGCTGGAGGATGTTTCAGAATTCTTTAATAGTGGACAATACCAGCCTGCTGATAGAAAATCGGAAGGAC
CTCGCAAGTTATTTTCAAAAGAGGAAAAGGTTATGCTAAATAAAAGGGTGCCTGATCTTGCTTCTGCTCATTCTGACATGTGGTTGCCTCTACATACTCTTGTTGGATCT
GGGGAATTTTACCTTGTGGACGAACTACTGAAAAACAATATTGATATCAATGCCGTTGGTTTCACAGCTCTTCACAGAGCTATACTTGCAAAAAAGCAAGCCATAACTAA
TTATCTTTTAAGAGAAAGTGCCAATCCTTTTGTTCGCGATAAGGATGGGGCAACATTGATGCATTATGCTGTTCAAACTGCTTCGAGTCAGGCAATTAAAATTCTTCTGT
TGTATAATGTTGATATAAACCTTCAGGATAAAGATGGGTGGACACCACTGCATCTTGCTGTTCAGGCTCGTCGAACAGACGTAGTCAGGCTTTTATTGATTAAAGGAGCT
GATAAGTCACTAAAAAATGCGGGAGGTTTAACCCCATTGGACATTTGCCTCTATTCTGGTCAAGACACAAAAACTTATGAACTAATAAAGCTGTTGAAACAGCTTCCAAG
GCCAAATAAACTGCCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGTACTCTCGATCTCCTTCCCCCTGGAGAGCTCCTTCAAGGTCAAGATCTAGATCTAGATCACGATCTCGGCCCAGGTCCAGGTCGAGATCATGGTCAAGGCCAAG
ATCCCCAAGTCGTGGAAGATCAAGGTCCAGAAGTCGTGGGAGGGACCATGCCCACAACCCTGGGAATACACTTTATGTCACTGGTCTATCTACAAGGGTTACTGAAAGAG
ACCTTGAAGAGCACTTTTCCAAGGAGGGCAAGGTAGTTTCTTGTTTCCTTGTGGTTGAGCCTCGAACACGGATTTCCCGTGGATTTGCATTTGTTACGATGGACAATGTT
GATGATGCTAATCGGTGTGTTAAGCATCTCAACCAGTCAATCCTAGAAGGTCGATATATAACTGTGGAGAAGTCACGGAGGAAACGGCCAAGAACTCCAACGCCAGGACA
CTATTTGGGGTTGAAAAGCACCAGAGACTCTGGCTACCGTGGCGATCGAGGTAGTAGGTATCGTGGTGGCGGCTCCTTCCGCGATGACTATGGGTACCGTCGGTCTCCAA
GACGCTCACCATATCGAGGAGCTCGAGAGTACTCCCCTAGACACTCTCCCCCTTATGGTGGGAGATCAAGGAGGGATCACTCCAGGTCGCCTCCTTACTCTCCTCCTCCT
TATGGCGGCAGCCCGGATAGGAGGTATCCTCGTGGCTCTAGCCCAAACCGCAATCTTACGGGCCTAGGCCTGCAGCCCAATCGCATGAAATCTCTTCCGTTTTCGCCATC
GGTGGGGCATTGCAATCGGAAGTCCATCGGAGCAACCATGTCACTCGCAGCTTCTCTGAATATTCCCGTCGCTAAAGTCTTCTTCCTCTCGAATTCGCCGCTTTCTCTTC
CCAATTCCAACAAGCTTTCCTTCATCTCTCTTCTCAGGCCTCGAAACAGAGTTTCTTCTGTTTCCTCTTCTTCCCACTCCTCTTACTCTATCCAACCAGATGAGTTAGAA
GACTTCGTCATCGGCGACTGCGTTATTTTCGAAGACGGCGTGTTCGACGACCCATATCTGTCGGATGATTCGAGTGCGGAGCATTCCAGTCCGAGCACAGGGAAGTCTAA
GCCTAAGAGCAGCGTTGCGGAAATTAAGCCCGAAAATTTGGTGCCAGATGAGTGGAAAGAAGTGCAAGCGGAGATTAACATCACCAAGAAAGAGAGGCGGAAGATTGCTC
AGGAATTGGAGTTCGGTACTCGAGTTGAGAAGAAGAAGAAAGGGCTTGTTCCTTTGACAAGTGTGAATTTAGAGGAGTATTTATCGTATAAGGAGGCCAAGTTGGCGCAA
TTGAAGCCACTTGTTCTTGATAATCCTTCTAGTTTTCCGTCAAAGGAAGTTAACAATGCGGATGGTGTTAAGGAAGTGAAAGGAGGCGAAATGAGCAGGTTAAGCGAGCG
TGTGGCGCCAAAAAATCCCAAGTGGGCGGTTTATGGTAGGGGGCTGGAGGATGTTTCAGAATTCTTTAATAGTGGACAATACCAGCCTGCTGATAGAAAATCGGAAGGAC
CTCGCAAGTTATTTTCAAAAGAGGAAAAGGTTATGCTAAATAAAAGGGTGCCTGATCTTGCTTCTGCTCATTCTGACATGTGGTTGCCTCTACATACTCTTGTTGGATCT
GGGGAATTTTACCTTGTGGACGAACTACTGAAAAACAATATTGATATCAATGCCGTTGGTTTCACAGCTCTTCACAGAGCTATACTTGCAAAAAAGCAAGCCATAACTAA
TTATCTTTTAAGAGAAAGTGCCAATCCTTTTGTTCGCGATAAGGATGGGGCAACATTGATGCATTATGCTGTTCAAACTGCTTCGAGTCAGGCAATTAAAATTCTTCTGT
TGTATAATGTTGATATAAACCTTCAGGATAAAGATGGGTGGACACCACTGCATCTTGCTGTTCAGGCTCGTCGAACAGACGTAGTCAGGCTTTTATTGATTAAAGGAGCT
GATAAGTCACTAAAAAATGCGGGAGGTTTAACCCCATTGGACATTTGCCTCTATTCTGGTCAAGACACAAAAACTTATGAACTAATAAAGCTGTTGAAACAGCTTCCAAG
GCCAAATAAACTGCCTTCATGACAGAGATGATTAGTTGAGCTAACTCTTGCTTAGGCTCTGGAAGATCACCCATCATCGTGTTTGGTATGCCATAATGGTCAAAGTTTTT
ATTTGTGAAGCTGTTCACCATGGAAACCTACATCGGGTATGCAACTGTATCAACATCGAGTTCTTTGCTAGATCATTCTATCAGAGCAAATAGTACTATGGAAGTGACAT
CATTGCAGACCAATCTAGAAACATGTCTCTGCATGAGTGTTCCAGTCGGAGCAAATGCATCTACATCAATGACTGATTAATGATTATGGCACGGGAAGCTTGCTGAGTAT
CATATTCATATTCCTTTTTATGTTGAAAGTTGTAATAGCACTTTCCCTTTGTTTTTTATCTGTTGTAATTAATCAAGATTTGGAAGTATACATGCATATGTTTTATTGAC
GCAAGTTATGAACTTTATGGAGTGAACTCAAAGAGCTGTAGCTTTCTTATTGATGATATCTCTTCACAAGCATGAGGAGGTACAAAATATATTGAAGTGGGGAATGTGCT
TGATAATTTCTCTGATTATAGCTAGGTGGAGAATTATTTGCATTTTGATATTATAACTGTCATCATACGAATAATGAATACAGTTATTCTTTTCCTGTTGGTGAGCTATA
TTTACAGAAATAGCGCCTTGAGACTCCTGAGCTGGACTGACTACAGCAGCAGCGAGAATTACTGCAAAATCTGGTTCTCCTAATCCCACCTGCATAATGAAGGAAACCTA
GGGGTGTGTGGAAGACGAAATGAAGATGGCATCTTCTCTCCGTCACCCCACAATGCGTAGGGTTCTTCTCCGTGAGCAGCATCATCACCAATCTCCAGGTCCTCTTCTTT
CATCCGTAAGTTAACAATATGGCTAATCAGGTAACATATAATGCTGGTAAAAATGGCATTCCAGAGGGTAATAAACAGTGCTCCTGCGAACTGGATGCCGATTTGTTTGA
GGCCATCTTTGAGCTTTCTATCGGAGATAGCGTACAGGAAGCCTGGACTGTATAGGGAATCTGGATACATCATTCTCAGAAGGCTGGGTTTGGCAAAGAAACCTGTGAGA
AGGCCTCCCATGGAACCAGCCACAGCGTGCGTGTGGAAGACGCCCAGAGTATCGTCGACACACTGAAGGAATGCCAACCTTCTATGCAGCACCATCATGGTGTACCATGG
AACCAAACCAGCGAGGACGCCCATCACCACAGCTGCAAGAGTGTCTACAAGCCCTATTGTTCATGTAAACCATATATACCATGCTTTGATTTGGATTCTTCAGACAAGGC
CAGTGATGATCCCTTGAATAGCCCCAATCACAGAGCTTTTCTTGTATACTATGATGTCCATTGAAACCCAGATCAAGAGACTTGTTGCAGTGCAAACATGTGTGTTCAAA
ATGGCTAGCGATGTAATACGACCCACTGCAAATGGAGCTCCTCCATTGAATCCAGTCCAACCCATCCAAAGAAACCCTGCACCGCCAACCATGTGCAGTATGTTGTTTGG
TGGGAAGTTTTGCCTGTCTCGGGAGTGTCGTGGCCCGACCTGTGGGCATGGTCGCCAGATCAGAAACCATCATCACTACCAAAGCATATTGAGAAGGAAGATCACAGCAA
ATGGAAAGTAAAAACAAACTTGTTTTGCAGGTCACTTACCCAGTAAGCAGCAGTGAAACCAGCCACCCCAGAAGAAAGATGAATGACGTAGCCGCCGGCGTAGTCAATGA
GGTACTTCTGAATAAATCCATTCCCCCATATACAGAATGCGCCCACAGTGTAACAAAGAGTTAGCCACATTGGAACAAAAAGCATCCATGCACCAAAATTCATCCTTCCC
AGCAAGGACCCAGCGAGTAACACGACAGTAATGGCAGCAAAAGCAAACTGGAAAAAGACGTAGTCAGCCATCGGAAAACAGCCTTGTGTAGACTGTGAAAGAAGGTAACT
TTCGGACAGCGCTATGCCGGGCCTACCAATGAGCCAGAACAATCGGTCGCCAAATGCCATATGATGTGCCCACAACACCCAACAGACGAGCACTGCAGCAAAGCCATAGA
GGGCCATGAAGGCAGAGTTGACAGCCCATTTTTTCTTCACCATGCTGCCGTAGAGGATGACCAGCCCAGGAACGGTTTGGAGGCCGACCATGGTGGCTGCCGTGAGCTGC
CATGCATTGTCGGCCTTGTTGTTCCAGTAGGGAGTTGATTGTTCGGGATCAACGAACCCCGGGAGCAAGCCGTTGTCTTGAGGTATACAGGAACTCATAAGGGAGTAAGA
ATTTGATTATAAGATGGTAATATGAAAGCTCTATTTTCCTTTGAAAGTGTGGTGGGGATATAACATAAATATGATTCAGAGTTTGGCAGTTGGCTTCAGGATGTTCCATT
GTTATAATCTCCCT
Protein sequenceShow/hide protein sequence
MMYSRSPSPWRAPSRSRSRSRSRSRPRSRSRSWSRPRSPSRGRSRSRSRGRDHAHNPGNTLYVTGLSTRVTERDLEEHFSKEGKVVSCFLVVEPRTRISRGFAFVTMDNV
DDANRCVKHLNQSILEGRYITVEKSRRKRPRTPTPGHYLGLKSTRDSGYRGDRGSRYRGGGSFRDDYGYRRSPRRSPYRGAREYSPRHSPPYGGRSRRDHSRSPPYSPPP
YGGSPDRRYPRGSSPNRNLTGLGLQPNRMKSLPFSPSVGHCNRKSIGATMSLAASLNIPVAKVFFLSNSPLSLPNSNKLSFISLLRPRNRVSSVSSSSHSSYSIQPDELE
DFVIGDCVIFEDGVFDDPYLSDDSSAEHSSPSTGKSKPKSSVAEIKPENLVPDEWKEVQAEINITKKERRKIAQELEFGTRVEKKKKGLVPLTSVNLEEYLSYKEAKLAQ
LKPLVLDNPSSFPSKEVNNADGVKEVKGGEMSRLSERVAPKNPKWAVYGRGLEDVSEFFNSGQYQPADRKSEGPRKLFSKEEKVMLNKRVPDLASAHSDMWLPLHTLVGS
GEFYLVDELLKNNIDINAVGFTALHRAILAKKQAITNYLLRESANPFVRDKDGATLMHYAVQTASSQAIKILLLYNVDINLQDKDGWTPLHLAVQARRTDVVRLLLIKGA
DKSLKNAGGLTPLDICLYSGQDTKTYELIKLLKQLPRPNKLPS