; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004596 (gene) of Snake gourd v1 genome

Gene IDTan0004596
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein WHAT'S THIS FACTOR 1 homolog
Genome locationLG06:32902788..32907033
RNA-Seq ExpressionTan0004596
SyntenyTan0004596
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR021099 - Plant organelle RNA recognition domain
IPR040293 - Protein WHAT'S THIS FACTOR 1-like
IPR045040 - PORR family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051010.1 protein ROOT PRIMORDIUM DEFECTIVE 1 [Cucumis melo var. makuwa]2.0e-24189.33Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        +SVPFIL HK+Y      +SVKS FWGKNLDLR+RND     NL K+H PFQPIRA+VKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTD GLPLEFRDTIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSK EMRKISQFRDIPYISPYSDFSG+K+GT Q
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEIL+LTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFR RGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLG----DDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPD
         GDT GG+TNQP DM GEEWSDVD+LL     DDDEFDDD+ DDD   AFEDDWSDEDDTP SF+GD+DGES+NIG+R QKQVN+LQKVGQS LSPVLPD
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLG----DDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPD

Query:  GRPRER
        GRPRER
Subjt:  GRPRER

XP_008461525.1 PREDICTED: protein ROOT PRIMORDIUM DEFECTIVE 1 [Cucumis melo]8.9e-24289.86Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        +SVPFIL HK+Y      +SVKS FWGKNLDLR+RND     NL K+H PFQPIRA+VKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTD GLPLEFRDTIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSK EMRKISQFRDIPYISPYSDFSG+K+GT Q
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEIL+LTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFR RGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLL-GDDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRP
         GDT GG+TNQP DM GEEWSDVD+LL  DDDEFDDD+ DDD   AFEDDWSDEDDTP SF+GD+DGES+NIG+R QKQVN+LQKVGQS LSPVLPDGRP
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLL-GDDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRP

Query:  RER
        RER
Subjt:  RER

XP_022156569.1 protein ROOT PRIMORDIUM DEFECTIVE 1 [Momordica charantia]1.2e-23888.02Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKS-DFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKEL
        +SVPFILP K YFSENPSFS+KS +FWG+NLDLRHR DS   SNL K HVP QPIRA+VKRRKEL FDNVIQRDKKLKLVMRIRKILVQ+PDR+MSLKEL
Subjt:  MSVPFILPHKAYFSENPSFSVKS-DFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKEL

Query:  GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTI
        GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLT EAERLYLEELKIRNEMEGLLV+KLRKLLMMS DKRILLEKIAHLRTD GLPLEFRDTI
Subjt:  GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTI

Query:  CHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTS
        CHRYP YFRVVAT RGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPY+SPYSDFS LK+GT 
Subjt:  CHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTS

Query:  QKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA
        QKEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFY+SLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA
Subjt:  QKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA

Query:  PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE
         K DTDG ETNQPDD+ GE+WSDVD+LLGDD   DDD++D D  ++D WSDEDDTP SFDGD DGE++NI +  QK+V+NLQKVGQSLL+PVLPDGR RE
Subjt:  PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE

Query:  R
        R
Subjt:  R

XP_023514273.1 protein WHAT'S THIS FACTOR 1 homolog [Cucurbita pepo subsp. pepo]2.5e-23687.62Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        MS+PF+LPH+ YF ENPS SVKS FWGKNLDLRHRND  FGS+L KS VPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQ PDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLK KLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMS DKRILLEKIAHLRTD GLPLEFR+TIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        H YPQYFRVVAT RGPALELTHWDPELAVSA+ELAEEENRA ELEEKNLIIDRPLKFNRV+LPKGLN+SK EMR+I QFRDIPYISPYSDFSGLK+GT +
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYR+S LIDKDRLLIIKEKLRALVAVPR RGRGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE
        K DTDGG+ +QPD M G+E SD+D+LL DDDEFDD   ++DGAFED+WSD EDDTP SF+GDRDGES+NI AR Q+QV+N QK+ QSLLSPVLPDGRPRE
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE

Query:  R
        R
Subjt:  R

XP_038900012.1 protein WHAT'S THIS FACTOR 1 homolog, chloroplastic [Benincasa hispida]1.1e-23989.4Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        +SVPFIL HK+Y       SVKS+FWGKNL+ R+RNDS     L K HVPFQPIRA+VKRRKEL FDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTD GLPLEFRDTIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSK EMRKISQFRDIPYISPYSDFSG+K+GT Q
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLR+LVAVPRF+GRGAP
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER
        K D DG +T+QPDDM GEEWSDVD++  DDD  D+ D+ DDGAFEDDW DEDDTP SFDGDRDGES+N G+R QKQVNNLQKVGQSLLSPVLPDGRPRER
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER

TrEMBL top hitse value%identityAlignment
A0A1S3CG79 protein ROOT PRIMORDIUM DEFECTIVE 14.3e-24289.86Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        +SVPFIL HK+Y      +SVKS FWGKNLDLR+RND     NL K+H PFQPIRA+VKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTD GLPLEFRDTIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSK EMRKISQFRDIPYISPYSDFSG+K+GT Q
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEIL+LTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFR RGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLL-GDDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRP
         GDT GG+TNQP DM GEEWSDVD+LL  DDDEFDDD+ DDD   AFEDDWSDEDDTP SF+GD+DGES+NIG+R QKQVN+LQKVGQS LSPVLPDGRP
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLL-GDDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRP

Query:  RER
        RER
Subjt:  RER

A0A5D3BY23 Protein ROOT PRIMORDIUM DEFECTIVE 19.6e-24289.33Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        +SVPFIL HK+Y      +SVKS FWGKNLDLR+RND     NL K+H PFQPIRA+VKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTD GLPLEFRDTIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSK EMRKISQFRDIPYISPYSDFSG+K+GT Q
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEIL+LTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFR RGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLG----DDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPD
         GDT GG+TNQP DM GEEWSDVD+LL     DDDEFDDD+ DDD   AFEDDWSDEDDTP SF+GD+DGES+NIG+R QKQVN+LQKVGQS LSPVLPD
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLG----DDDEFDDDDNDDDG--AFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPD

Query:  GRPRER
        GRPRER
Subjt:  GRPRER

A0A6J1DTV5 protein ROOT PRIMORDIUM DEFECTIVE 15.8e-23988.02Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKS-DFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKEL
        +SVPFILP K YFSENPSFS+KS +FWG+NLDLRHR DS   SNL K HVP QPIRA+VKRRKEL FDNVIQRDKKLKLVMRIRKILVQ+PDR+MSLKEL
Subjt:  MSVPFILPHKAYFSENPSFSVKS-DFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKEL

Query:  GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTI
        GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLT EAERLYLEELKIRNEMEGLLV+KLRKLLMMS DKRILLEKIAHLRTD GLPLEFRDTI
Subjt:  GKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTI

Query:  CHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTS
        CHRYP YFRVVAT RGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPY+SPYSDFS LK+GT 
Subjt:  CHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTS

Query:  QKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA
        QKEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFY+SLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA
Subjt:  QKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA

Query:  PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE
         K DTDG ETNQPDD+ GE+WSDVD+LLGDD   DDD++D D  ++D WSDEDDTP SFDGD DGE++NI +  QK+V+NLQKVGQSLL+PVLPDGR RE
Subjt:  PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE

Query:  R
        R
Subjt:  R

A0A6J1H780 protein WHAT'S THIS FACTOR 1 homolog2.1e-23687.43Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        MS+PF+LPH+ YF ENPS SVKS+FWGKNLDLRHRND   GS+L KS VPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQ PDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLK KLTAEAERLYLEELKIRNEMEGLLV+KLRKLLMMS DKRILLEKIAHLRTD GLPLEFR+TIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        H YPQYFRVVAT RGPALELTHWDPELAVSA+ELAEEENRA ELEEKNLIIDRPLKFNRV+LPKGLN+SK EMR+I QFRDIPYISPYSDFSGLK+GT +
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYR+SQLIDKDRLLIIKEKLRALVAVPR RGRGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE
        K DTDGG+ +QPD M G+E SD+D+LL DDDEFDD   ++DGAFED+WSD EDDTP SF+GDRDGES+NI AR Q+QV+N QK+ QSLLSPVLPDGRPRE
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE

Query:  R
        R
Subjt:  R

A0A6J1KW65 protein WHAT'S THIS FACTOR 1 homolog1.6e-23387.03Show/hide
Query:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG
        MS+PF+LPH+AYF ENPS SVKS+FWGKNLDLRHRND   GS+L KS VPFQPI AIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQ PDRVMSLKELG
Subjt:  MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELG

Query:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC
        KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLK KLTAEAE LYLEELKIRNEMEGLLV+KLRKLLMMS DKRILLEKIAHLRTD GLPLEFR+TIC
Subjt:  KFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTIC

Query:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ
        H YPQYFRVVAT RGPALELTHWDPELAVSA+ELAEEENRA ELEEKNLIIDRPLKFNRV+LPKGLN+SK EMR+I QFRDIPYISPYSDFSGLK+GT +
Subjt:  HRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQ

Query:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP
        KEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYR+SQLIDKDRLLIIKEKLRALVAVPR RGRGA 
Subjt:  KEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAP

Query:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE
        K DTDGG+ +QPD M GEE SD+D+LL D DEFDD   ++DGAFED+WSD EDDTP SF+ DRDGES+NI AR Q QV+N QK+ QSLLSPVLPDGRPRE
Subjt:  KGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSD-EDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRE

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
A0MFS5 Protein WHAT'S THIS FACTOR 1 homolog, chloroplastic1.7e-17166.19Show/hide
Query:  SVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLK
        + +S F G+ L L   N   F S   K+ V  +P+RA VKRRKEL FD+V+QRDKKLKLV+ IRKILV QPDR+MSL+ LGK+RRDLGL+K+RR IALL+
Subjt:  SVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLK

Query:  KFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALE
        K+P VFE+VEEGA+SL+FK+T+EAERLYL+E++IRNE+E +LV+KLRKL+MMS DKRILLEKI+HL+TD+GLPLEFRDTIC RYPQYFRVV T RGPALE
Subjt:  KFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALE

Query:  LTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEK
        LTHWDPELAVSAAEL+E++NR RE EE+NLIIDRP KFNRVKLP+GLNLSK E RKISQFRD+ YISPY DFS L+SGT +KEKHACGV+HE+LSLT EK
Subjt:  LTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEK

Query:  RVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGET----------
        R LVDHLTHFREEFRFSQQLRGMLIRHPD+FYVSLKG+RDSVFLREAYR+S+LIDKD L ++KEK+RALV+VPRF  RG P+ D +G E           
Subjt:  RVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGET----------

Query:  NQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGE--SVNIG-ARNQKQVNNLQKVGQSLLSPVLPDGRPRER
         + ++   EEWSDVD  L    E +D  NDDDG + DD  +ED  P   D D D E  SV IG + + ++ ++ +K  + +L+PV PDG PRE+
Subjt:  NQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGE--SVNIG-ARNQKQVNNLQKVGQSLLSPVLPDGRPRER

B6TTV8 Protein WHAT'S THIS FACTOR 1, chloroplastic6.1e-15365.05Show/hide
Query:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR
        +A VKRRKE PFD VIQRDKKLKLV+++R ILV QPDRVMSL+ELG+FRRDLGL +KRRLIALL++FP VF+VVEEG +SL+F+LT  AERLYL+EL++R
Subjt:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR

Query:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRP
        NE EGL V KLRKLLMMS +KRIL+EK+AHL+ D+GLP EFRDT+C RYPQYFRVV  +RGPALELTHWDPELAVSAAELAEEE+RARE EE+NLIIDRP
Subjt:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRP

Query:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSL
        LKFNRV+LPKGL L++GE R+I++F+++PYISPY+DFS L+SG+ +KEKHACGVVHEILSLT+EKR LVDHLTHFREEFRFSQ LRGM+IRHPDMFYVS 
Subjt:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSL

Query:  KGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA----PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDD----EFDDDDNDDDGAFED
        KGDRDSVFLREAY+DSQL++K++L+++KEK+RALVAVPRF  R A     + +   G     D +  EE+ D D+ L D +    E     +D D  + D
Subjt:  KGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGA----PKGDTDGGETNQPDDMGGEEWSDVDDLLGDDD----EFDDDDNDDDGAFED

Query:  DW-SDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER
         W  + DD+P  F  D    ++ I   +            S   PV PDGRPRER
Subjt:  DW-SDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER

Q65XL5 Protein WHAT'S THIS FACTOR 1 homolog, chloroplastic1.8e-15766.74Show/hide
Query:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR
        +A VKRRKE+PFDNVIQRDKKLKLV+++R ILV  PDRVMSL++LG+FRRDLGL +KRRLIALLK+FP VFEVVEEG +SL+F+LT  AERLYL+EL ++
Subjt:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR

Query:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRP
        NE EGL V KLRKLLMMS DKRIL+EKIAHL+ D+GLP EFRDTIC RYPQYFRVV  +RGP LELTHWDPELAVSAAE+AEEENRARE +E+NLIIDRP
Subjt:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELEEKNLIIDRP

Query:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSL
        LKFNRVKLP+GL LS+GE R+++QF+++PYISPYSDFS L+SG+++KEKHACGVVHEILSLTLEKR LVDHLTHFREEFRFSQ LRGMLIRHPDMFYVSL
Subjt:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSL

Query:  KGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGETN----------QPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAF
        KGDRDSVFLREAY++SQL++K +L+++KEK+RALVAVPRF  RG P    +   TN            +D   E  SD++DL+    E     +D D  +
Subjt:  KGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGETN----------QPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAF

Query:  EDDW-SDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER
         D W  + DD+P  F+ D DG S+       K+  N          PV PDGRPRER
Subjt:  EDDW-SDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRER

Q689D6 Protein ROOT PRIMORDIUM DEFECTIVE 12.9e-3832.32Show/hide
Query:  PFQPIRAIVKRR---KELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLE-KKRRLIALLKKFPAVFEVVEEGAFSLKF-KLTAEAE
        PF     I K++   ++  +DN ++ +KK++ V++   +++ QP+  +++  L    R LGL  K+    A L KFP VFE+ E     + + +LT +A 
Subjt:  PFQPIRAIVKRR---KELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLE-KKRRLIALLKKFPAVFEVVEEGAFSLKF-KLTAEAE

Query:  RLYLEELKIRNEMEGLL------VIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATE--RGPALELTHWDPELAVSAAELAE
           L++  IR+E E +L      V +LRKL+MMS   RI LE +   RT+ GLP +F  ++  ++PQ+FR++  E  R   +E+   DP L++ A E   
Subjt:  RLYLEELKIRNEMEGLL------VIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATE--RGPALELTHWDPELAVSAAELAE

Query:  EENRARELEEKNLIID-RPLKFN-RVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSG--LKSGTSQK--EKHACGVVHEILSLTLEKRVLVDHLTHFR
           R RE+E +   ID   ++F+  V  P G  + K     + +++ +PY SPY D SG  L+S  +Q   EK +   +HE+LSLT+EK++ ++ + HFR
Subjt:  EENRARELEEKNLIID-RPLKFN-RVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSG--LKSGTSQK--EKHACGVVHEILSLTLEKRVLVDHLTHFR

Query:  EEFRFSQQLRGMLIRHPDMFYVSLKGD---RDSVFLREAYRDSQLIDKDRLLIIKEKLRALV
              ++L+  L++H  +FY+S +G+     +VFLRE Y+  +L++ + + + + +L  LV
Subjt:  EEFRFSQQLRGMLIRHPDMFYVSLKGD---RDSVFLREAYRDSQLIDKDRLLIIKEKLRALV

Q9ZUZ6 Protein WHAT'S THIS FACTOR 9, mitochondrial4.1e-2428.44Show/hide
Query:  VKRRKELPFDNV--IQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLK-FKLTAEAERLYLEELKIR
        +K +++  FDN+  I R  +LK V+ ++  +VQ+P+R + +  + K  R   +  K  +   L+KFP++FE      ++L  F+LT EA  L  +E  + 
Subjt:  VKRRKELPFDNV--IQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLK-FKLTAEAERLYLEELKIR

Query:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATE---RGPALELTHWDPELAVSAAELAEEENRARELEEKNLII
              L  +L+KL++MS D  + L  +  ++  +GLP ++           FR V  E   +G A++    D  L+V      ++      LEE    I
Subjt:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATE---RGPALELTHWDPELAVSAAELAEEENRARELEEKNLII

Query:  DRPLKFNRVKLP-KGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMF
        + PL       P KG  L       + +F+ +PY+SPY D+S L   +   EK   G +HE+L L +E       L   ++ F   Q++     RHP +F
Subjt:  DRPLKFNRVKLP-KGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMF

Query:  YVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEK
        Y+S+K    +  LRE YRD   ++   +L +++K
Subjt:  YVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEK

Arabidopsis top hitse value%identityAlignment
AT3G63090.1 Ubiquitin carboxyl-terminal hydrolase family protein1.8e-4639.17Show/hide
Query:  RKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEV----VEEGAFSLKFKLTAEAERLYL-EELKIRN
        +K+   D  I++DK+ KL  R+ K ++ +P +V+ L+ L K R  L L  K +  + ++  P++FE+    ++  +  ++F       R +L EE +I +
Subjt:  RKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEV----VEEGAFSLKFKLTAEAERLYL-EELKIRN

Query:  EMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVA-TERGPA-LELTHWDPELAVSAAELAEEENRARELEEKNLIIDR
        E E LLV KL +LLMM+ DK I  +K+ H++ D G P +F   +  +YP YFR+    E G + LEL  W+P+ A S  EL     RA +   K  +  R
Subjt:  EMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVA-TERGPA-LELTHWDPELAVSAAELAEEENRARELEEKNLIIDR

Query:  PLKFNRVKLPKGLNLSKGEMRKISQ-FRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYV
        P  +N VKLP G  L K EMR+ ++ + +  YISPY D S L   + + EK   GVVHE+LSL+L KRV V  L  F +EFRFS     +  RH  +FY+
Subjt:  PLKFNRVKLPKGLNLSKGEMRKISQ-FRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYV

Query:  SLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALV
        SLKG   +  LREAY+D +L+D+D LL IK+K   L+
Subjt:  SLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALV

AT4G01037.1 Ubiquitin carboxyl-terminal hydrolase family protein1.2e-17266.19Show/hide
Query:  SVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLK
        + +S F G+ L L   N   F S   K+ V  +P+RA VKRRKEL FD+V+QRDKKLKLV+ IRKILV QPDR+MSL+ LGK+RRDLGL+K+RR IALL+
Subjt:  SVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLK

Query:  KFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALE
        K+P VFE+VEEGA+SL+FK+T+EAERLYL+E++IRNE+E +LV+KLRKL+MMS DKRILLEKI+HL+TD+GLPLEFRDTIC RYPQYFRVV T RGPALE
Subjt:  KFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALE

Query:  LTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEK
        LTHWDPELAVSAAEL+E++NR RE EE+NLIIDRP KFNRVKLP+GLNLSK E RKISQFRD+ YISPY DFS L+SGT +KEKHACGV+HE+LSLT EK
Subjt:  LTHWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEK

Query:  RVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGET----------
        R LVDHLTHFREEFRFSQQLRGMLIRHPD+FYVSLKG+RDSVFLREAYR+S+LIDKD L ++KEK+RALV+VPRF  RG P+ D +G E           
Subjt:  RVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGET----------

Query:  NQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGE--SVNIG-ARNQKQVNNLQKVGQSLLSPVLPDGRPRER
         + ++   EEWSDVD  L    E +D  NDDDG + DD  +ED  P   D D D E  SV IG + + ++ ++ +K  + +L+PV PDG PRE+
Subjt:  NQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDDWSDEDDTPQSFDGDRDGE--SVNIG-ARNQKQVNNLQKVGQSLLSPVLPDGRPRER

AT4G08940.1 Ubiquitin carboxyl-terminal hydrolase family protein9.1e-4335.12Show/hide
Query:  RAIVKRRKELP-FDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKI
        ++I     +LP  ++++ RD   + ++R ++ + +QP+R++ L + GK  R+LG  + R++   + K P +F+        +    T   E L  EE  +
Subjt:  RAIVKRRKELP-FDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKI

Query:  RNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERG-PALELTHWDPELAVSAAE--LAEEENRARELEEKNLI
           ME   V  +RKLLMM+ DKRILL KI H R   G+P +FRD +  +YP YFRVV    G   LEL +WD  LAVS  E     +E++A+        
Subjt:  RNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERG-PALELTHWDPELAVSAAE--LAEEENRARELEEKNLI

Query:  IDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMF
          R  KF  VK  K L L + + RK++     P +SPYSD   L   + + EK+  G+VHE L+LTLEKR  + H+  F++EF  ++Q   ML +    F
Subjt:  IDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMF

Query:  YVSLKGDRDSVFLREAY-RDSQLIDKDRLLIIKEKL
        Y++      +VFL++AY  +  L+ KD  ++  EKL
Subjt:  YVSLKGDRDSVFLREAY-RDSQLIDKDRLLIIKEKL

AT5G21970.1 Ubiquitin carboxyl-terminal hydrolase family protein1.2e-4233.33Show/hide
Query:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR
        R+  KR +EL  +   ++ K    V+ + ++L  + D +M+++   ++RR + L K  ++   ++K P +FE+ ++    L   LT   E L  E  K+ 
Subjt:  RAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIR

Query:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPA-LELTHWDPELAVSAAELAEEENRARELEEKNLIIDR
         E        + + LMMS DK++ L+KI H R D GLPL+FR    + +PQ+F+VV    G   LEL  W+P  A++            ELE+K L I  
Subjt:  NEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPA-LELTHWDPELAVSAAELAEEENRARELEEKNLIIDR

Query:  PLKFN--------RVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIR
          +           +K P           KI  F+   Y+SPY+D  GL++G+ + +K A  V+HE+LS TLEKR++ DHLTHFR EF   Q+L  + ++
Subjt:  PLKFN--------RVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIR

Query:  HPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRG---APKGDT---------DGGETNQPDDMGGEEWSDVDDLLGDDDEF
        H  +FYVS +G R SVFL E Y   +LI+K  L++ KEKL        +RGR        DT         + G  ++   +G E+  D DD++ DDDE 
Subjt:  HPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRG---APKGDT---------DGGETNQPDDMGGEEWSDVDDLLGDDDEF

Query:  DDDDNDDDGAFEDD
        D  + +D  A+E++
Subjt:  DDDDNDDDGAFEDD

AT5G62990.1 Ubiquitin carboxyl-terminal hydrolase family protein1.3e-5736.47Show/hide
Query:  DNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEV-----------VEEGAFSLKFKLTAEAERLYLEELKIRN
        D  + +  +++ V ++  +L+ +P   + ++ L K R  L +E    ++++++++P +FE+             +    L  +LT+ A  L ++EL +++
Subjt:  DNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEKKRRLIALLKKFPAVFEV-----------VEEGAFSLKFKLTAEAERLYLEELKIRN

Query:  EMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELE-EKNLIIDRP
        E+   L  KL+KLLM+S+ +R+LL K+ H+  D G P  FR  +C+ YP  F+ V T  G ALEL   DPELA          N+    E ++ LI+DRP
Subjt:  EMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALELTHWDPELAVSAAELAEEENRARELE-EKNLIIDRP

Query:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSG-LKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVS
        LKF R+ L +GLNL +     + +FR+ P + PY   S  L S + + EK AC VV E+L LT+EKR L+DHLTHFR+EF    +LR +++RHP++FYVS
Subjt:  LKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSG-LKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFREEFRFSQQLRGMLIRHPDMFYVS

Query:  LKGDRDSVFLREAYRDS-QLIDKDRLLIIKEKLRALVAVPR-----FRGRGAPKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDD
        +KG RDSVFL EAY D+  L+DKD  L+I+E+L  L+   +      R +GA  GD    E  + D+   +  SD+DD   D  E   D  D    +  D
Subjt:  LKGDRDSVFLREAYRDS-QLIDKDRLLIIKEKLRALVAVPR-----FRGRGAPKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDDDGAFEDD

Query:  WSDEDDTPQSFDGDRDGESVNIGAR
          D+DD     +   +GESV   +R
Subjt:  WSDEDDTPQSFDGDRDGESVNIGAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGTACCTTTCATTCTTCCGCACAAAGCTTATTTCTCTGAGAATCCCTCTTTTTCTGTCAAATCGGATTTTTGGGGTAAAAACTTGGATTTGCGGCACAGGAATGA
TAGTTGTTTCGGTAGTAATTTAAGCAAATCCCATGTCCCTTTTCAACCGATTAGAGCTATTGTGAAACGGAGGAAAGAGCTTCCGTTTGATAATGTTATACAGAGGGATA
AGAAGCTGAAGTTGGTTATGAGGATTAGGAAGATTCTAGTTCAACAACCTGATAGAGTTATGTCACTTAAGGAATTGGGTAAATTCAGGAGAGATTTGGGTCTGGAGAAA
AAGAGGAGGCTAATTGCTTTGTTGAAGAAGTTCCCTGCAGTGTTTGAGGTTGTGGAAGAAGGGGCATTTTCGTTGAAATTCAAGTTAACGGCAGAGGCCGAAAGGTTGTA
TTTGGAGGAGTTGAAGATCAGAAATGAGATGGAAGGTTTGTTGGTTATTAAGCTGAGGAAGCTACTGATGATGTCGGCCGATAAACGAATATTGTTAGAGAAGATTGCAC
ATTTGAGGACTGATGTTGGTCTTCCTTTAGAGTTTCGCGACACGATTTGCCATCGTTATCCTCAGTACTTTAGAGTTGTGGCGACCGAGCGCGGTCCTGCACTCGAATTG
ACTCACTGGGATCCTGAACTTGCTGTTTCTGCAGCAGAGTTGGCAGAAGAGGAAAATCGAGCTAGAGAATTGGAAGAAAAGAATTTGATTATCGACCGACCACTGAAGTT
TAATCGAGTGAAGCTACCCAAGGGGCTTAACCTTTCAAAGGGTGAGATGAGGAAAATTAGTCAGTTTAGGGACATTCCTTACATTTCTCCCTATTCTGATTTCTCAGGGC
TTAAGTCAGGTACATCCCAGAAAGAGAAACATGCTTGTGGAGTTGTTCATGAGATTTTAAGCCTCACACTTGAGAAGAGAGTTTTAGTGGATCATCTTACTCATTTCCGG
GAGGAGTTTAGGTTCTCACAGCAGTTAAGGGGGATGCTAATAAGGCATCCTGATATGTTTTACGTCTCGTTGAAAGGGGATAGAGATTCGGTCTTCCTCAGAGAAGCATA
TCGTGATTCTCAGTTAATTGACAAAGATCGGTTACTAATCATAAAGGAGAAACTTCGAGCTCTTGTTGCTGTTCCTCGATTCCGAGGCAGAGGAGCTCCCAAAGGTGACA
CAGATGGGGGAGAAACGAATCAGCCAGATGATATGGGTGGTGAAGAATGGTCTGATGTCGATGACCTTTTAGGCGATGATGATGAATTTGATGACGATGACAATGATGAC
GATGGAGCTTTTGAGGATGATTGGAGTGATGAAGATGATACCCCACAAAGTTTCGATGGAGATCGAGATGGAGAATCCGTAAATATTGGAGCAAGGAACCAAAAACAGGT
CAATAACTTGCAAAAGGTTGGTCAGAGTCTTCTTTCTCCTGTATTACCTGATGGACGGCCCAGAGAAAGATGCATCGATCGGTCGGGGTCGGTTTTCGACTCCAACCGAT
CTCCGAACCGACCAAGGTCGGTCGGAGTTTTTTTTTTTTTTAAACCTTACCAAAACCGACCAACCGATCGGTCGGACTTATTAGATACATTTCAAAGTTCAGGGACTTCT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGTACCTTTCATTCTTCCGCACAAAGCTTATTTCTCTGAGAATCCCTCTTTTTCTGTCAAATCGGATTTTTGGGGTAAAAACTTGGATTTGCGGCACAGGAATGA
TAGTTGTTTCGGTAGTAATTTAAGCAAATCCCATGTCCCTTTTCAACCGATTAGAGCTATTGTGAAACGGAGGAAAGAGCTTCCGTTTGATAATGTTATACAGAGGGATA
AGAAGCTGAAGTTGGTTATGAGGATTAGGAAGATTCTAGTTCAACAACCTGATAGAGTTATGTCACTTAAGGAATTGGGTAAATTCAGGAGAGATTTGGGTCTGGAGAAA
AAGAGGAGGCTAATTGCTTTGTTGAAGAAGTTCCCTGCAGTGTTTGAGGTTGTGGAAGAAGGGGCATTTTCGTTGAAATTCAAGTTAACGGCAGAGGCCGAAAGGTTGTA
TTTGGAGGAGTTGAAGATCAGAAATGAGATGGAAGGTTTGTTGGTTATTAAGCTGAGGAAGCTACTGATGATGTCGGCCGATAAACGAATATTGTTAGAGAAGATTGCAC
ATTTGAGGACTGATGTTGGTCTTCCTTTAGAGTTTCGCGACACGATTTGCCATCGTTATCCTCAGTACTTTAGAGTTGTGGCGACCGAGCGCGGTCCTGCACTCGAATTG
ACTCACTGGGATCCTGAACTTGCTGTTTCTGCAGCAGAGTTGGCAGAAGAGGAAAATCGAGCTAGAGAATTGGAAGAAAAGAATTTGATTATCGACCGACCACTGAAGTT
TAATCGAGTGAAGCTACCCAAGGGGCTTAACCTTTCAAAGGGTGAGATGAGGAAAATTAGTCAGTTTAGGGACATTCCTTACATTTCTCCCTATTCTGATTTCTCAGGGC
TTAAGTCAGGTACATCCCAGAAAGAGAAACATGCTTGTGGAGTTGTTCATGAGATTTTAAGCCTCACACTTGAGAAGAGAGTTTTAGTGGATCATCTTACTCATTTCCGG
GAGGAGTTTAGGTTCTCACAGCAGTTAAGGGGGATGCTAATAAGGCATCCTGATATGTTTTACGTCTCGTTGAAAGGGGATAGAGATTCGGTCTTCCTCAGAGAAGCATA
TCGTGATTCTCAGTTAATTGACAAAGATCGGTTACTAATCATAAAGGAGAAACTTCGAGCTCTTGTTGCTGTTCCTCGATTCCGAGGCAGAGGAGCTCCCAAAGGTGACA
CAGATGGGGGAGAAACGAATCAGCCAGATGATATGGGTGGTGAAGAATGGTCTGATGTCGATGACCTTTTAGGCGATGATGATGAATTTGATGACGATGACAATGATGAC
GATGGAGCTTTTGAGGATGATTGGAGTGATGAAGATGATACCCCACAAAGTTTCGATGGAGATCGAGATGGAGAATCCGTAAATATTGGAGCAAGGAACCAAAAACAGGT
CAATAACTTGCAAAAGGTTGGTCAGAGTCTTCTTTCTCCTGTATTACCTGATGGACGGCCCAGAGAAAGATGCATCGATCGGTCGGGGTCGGTTTTCGACTCCAACCGAT
CTCCGAACCGACCAAGGTCGGTCGGAGTTTTTTTTTTTTTTAAACCTTACCAAAACCGACCAACCGATCGGTCGGACTTATTAGATACATTTCAAAGTTCAGGGACTTCT
TAA
Protein sequenceShow/hide protein sequence
MSVPFILPHKAYFSENPSFSVKSDFWGKNLDLRHRNDSCFGSNLSKSHVPFQPIRAIVKRRKELPFDNVIQRDKKLKLVMRIRKILVQQPDRVMSLKELGKFRRDLGLEK
KRRLIALLKKFPAVFEVVEEGAFSLKFKLTAEAERLYLEELKIRNEMEGLLVIKLRKLLMMSADKRILLEKIAHLRTDVGLPLEFRDTICHRYPQYFRVVATERGPALEL
THWDPELAVSAAELAEEENRARELEEKNLIIDRPLKFNRVKLPKGLNLSKGEMRKISQFRDIPYISPYSDFSGLKSGTSQKEKHACGVVHEILSLTLEKRVLVDHLTHFR
EEFRFSQQLRGMLIRHPDMFYVSLKGDRDSVFLREAYRDSQLIDKDRLLIIKEKLRALVAVPRFRGRGAPKGDTDGGETNQPDDMGGEEWSDVDDLLGDDDEFDDDDNDD
DGAFEDDWSDEDDTPQSFDGDRDGESVNIGARNQKQVNNLQKVGQSLLSPVLPDGRPRERCIDRSGSVFDSNRSPNRPRSVGVFFFFKPYQNRPTDRSDLLDTFQSSGTS