; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010583 (gene) of Snake gourd v1 genome

Gene IDTan0010583
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein WHAT'S THIS FACTOR 1
Genome locationLG10:14489929..14494607
RNA-Seq ExpressionTan0010583
SyntenyTan0010583
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582056.1 Protein WHAT'S THIS FACTOR 1-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.4e-24390.08Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD HVT LSSNLL G+PLWLHSNVGCKYQR+ENFKT YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT  AASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAKPLP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEP
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADIIN  +GGHIEP
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEP

XP_022955645.1 protein WHAT'S THIS FACTOR 1 [Cucurbita moschata]2.2e-24489.9Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD HVT LSSNLL G+PLWLHSNVGCKYQR+ENFKT YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT  AASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAKPLP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADIIN  +GGHI+PW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW

XP_022979770.1 protein WHAT'S THIS FACTOR 1 [Cucurbita maxima]1.1e-24389.69Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD HVT  SSNLLRG+PLWLHSNVGCKYQR+ENF+T YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT AAASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAK LP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADI N  +GGHIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW

XP_023527715.1 protein WHAT'S THIS FACTOR 1 [Cucurbita pepo subsp. pepo]1.7e-24490.1Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD +VT LSSNLLRG+ LWLHSNVGCKYQR+ENFKT YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT AAASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAKPLP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADIIN  +GGHIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW

XP_038904265.1 protein WHAT'S THIS FACTOR 1, chloroplastic [Benincasa hispida]2.3e-24689.88Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        M LSSHF TP D HVT LSSNLL G+PLWLHSNV CKYQR+ENF+TRYTL PCSSLKIVRS SLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL +PRSLLSMIHRYPSIFELFSIPYPPTPLNATK+YPQLCVRLT AAASLAKQDS+LKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELVSWDPELAKPLP  QVPSRELIVDRPLKFNLLRLRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFD+KGVL++KDETLAIKNQWM+LLMEGK+MR+
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW
        EK+KA+ YDSKYGN       DHEMETD+DDDYDDGFESLFQYEDLDFEDER+DLPSN  NGDFW TNN DIIN  DGG IEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW

TrEMBL top hitse value%identityAlignment
A0A0A0KU86 PORR domain-containing protein2.4e-24188.22Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MAL SHF TP D HVT LSSNLL GSPLWLHS V  K QR+ENF+T YTL PCSS+KIVRS SLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL +PRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT AAAS+AKQDS+LK+ ISN LAEKLQKLLMLSSH+RILLSKLVHLAPDLS+
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELVSWDPELAKPLP  QVPSRELIVDRPLKFNLLRLRKGLNLKRTHQ+FLIKFRDLPDVCPYK PASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMM+EKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVL+EKDETLAIKNQWM LL E K++RR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW
        EK+KAQIYDSKYGN       DHEME DYDDDYDDGFESLFQYEDLDFEDE + +PS  SNGDFWTTNN DI N  DGGHIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW

A0A1S3BYB2 protein ROOT PRIMORDIUM DEFECTIVE 12.0e-24088.84Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSSHF TP D H + LSSNLL GSPLWLHS V  K QR+ENF T YTL PCSS+KIVRS SLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL +PRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAAS+AKQDS+LKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELVSWDPELAKPLP  QVPSRELIVDRPLKFNLLRLRKGLNLKRTHQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKG L+EKDETLAIKN+WM LL EGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW
        EK+KAQIYDSKYGN       DHEME DYDD+YDDGFESLFQYEDLDFEDE + +PS  SNGDFWTTNN DI N TD  HIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW

A0A6J1CVW2 protein ROOT PRIMORDIUM DEFECTIVE 13.7e-24287.4Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSSHFST RD H+T LSSN LRG+PLWLHSN+GCKYQR++NFK R+   PCSSLKIVRS SLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYL+LHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT AAASLA+QDS+LKLAISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCND+PDKFRTVDTSYGRALELVSWD ELAKPLP PQVPSRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPA+ELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        +ESEKRACAVVRE+LGMM+EKRTLIDHLTHFRKDF LPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVL++KDET+ IKNQWM+L+ EGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW
        EK KAQIYDSKYGN       D+E+ETDYDDDYDDGFESLF++ED DFE+E +DL S+RSNGDFWTT NADII+  DGGHIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW

A0A6J1GVP1 protein WHAT'S THIS FACTOR 11.0e-24489.9Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD HVT LSSNLL G+PLWLHSNVGCKYQR+ENFKT YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT  AASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAKPLP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADIIN  +GGHI+PW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW

A0A6J1IUA5 protein WHAT'S THIS FACTOR 15.2e-24489.69Show/hide
Query:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC
        MALSS F TPRD HVT  SSNLLRG+PLWLHSNVGCKYQR+ENF+T YTL P +S+KIVRS  LDRHAVKHNKTRFVQKLIILLLSKPKHYIP+HILSKC
Subjt:  MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKC

Query:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM
        RGYLSL KPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLT AAASLAKQDSNLKL ISNTLAEKLQKLLMLSSH+RILLSKLVHLAPDLSM
Subjt:  RGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSM

Query:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES
        PPNFRSRLCNDYP+KFRTVDTSYGRALELV WDPELAK LP  QV SRELIVDRPLKFNLL+LRKGLNLKR HQ+FLIKFRDLPDVCPYKTPASELAKES
Subjt:  PPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKES

Query:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR
        LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGF+DKG L+EKDETLAIKNQWM+LLMEGK+MRR
Subjt:  LESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR

Query:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW
        EKRKAQIYDS+YGN       DHEMETDYDDDY+DGFESLFQYEDLDFEDE +DLPSNRSNGDFWTTNN ADI N  +GGHIEPW
Subjt:  EKRKAQIYDSKYGN-------DHEMETDYDDDYDDGFESLFQYEDLDFEDERNDLPSNRSNGDFWTTNN-ADIINYTDGGHIEPW

SwissProt top hitse value%identityAlignment
A0MFS5 Protein WHAT'S THIS FACTOR 1 homolog, chloroplastic2.7e-5635.86Show/hide
Query:  KTRYTLIPC-SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQ
        KTR  + P  +++K  +  + D    +  K + V  +  +L+S+P   + L  L K R  L L K R  ++++ +YP +FE+             +    
Subjt:  KTRYTLIPC-SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQ

Query:  LCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLP--
        L  ++T+ A  L   +  ++  + + L  KL+KL+M+S   RILL K+ HL  DL +P  FR  +C  YP  FR V T  G ALEL  WDPELA      
Subjt:  LCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLP--

Query:  ------SPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDF
              + +   R LI+DRP KFN ++L +GLNL ++    + +FRD+  + PYK   S L   +LE EK AC V+ E+L +  EKRTL+DHLTHFR++F
Subjt:  ------SPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDF

Query:  GLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR-------EKRKAQIYDSKY-GNDHEMETDYDDDYD
            +LRGM++RHP+LFYVSLKG+RDSVFL E + +   L++KD    +K +   L+   +  RR       E R+ +I  S   G + E E+D ++  D
Subjt:  GLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR-------EKRKAQIYDSKY-GNDHEMETDYDDDYD

Query:  -DGF---ESLFQYEDLDFEDE--RNDLPSNRSNGD
         DG+   E     +D D+ D+    D+P N  + D
Subjt:  -DGF---ESLFQYEDLDFEDE--RNDLPSNRSNGD

B6TTV8 Protein WHAT'S THIS FACTOR 1, chloroplastic1.0e-5836.67Show/hide
Query:  SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAA
        +++K  +    D    +  K + V KL  +L+++P   + L  L + R  L L + R L++++ R+P +F++              +Y  L  RLT AA 
Subjt:  SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAA

Query:  SLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELA--------KPLPSPQV
         L   +  L+         KL+KLLM+S   RIL+ K+ HL  DL +PP FR  +C  YP  FR V    G ALEL  WDPELA        +   + + 
Subjt:  SLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELA--------KPLPSPQV

Query:  PSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMI
          R LI+DRPLKFN +RL KGL L R     + +F+++P + PY    S L   S E EK AC VV E+L + +EKRTL+DHLTHFR++F     LRGMI
Subjt:  PSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMI

Query:  VRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREK-RKAQIYDSKYGNDHEMETDYDDDYDDGFESLFQYEDLDFEDE
        +RHP++FYVS KG RDSVFL E + D   L+EK++ + +K +   L+   +  RR      +  +   G+    +   D++YDD  E L   EDL  E  
Subjt:  VRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREK-RKAQIYDSKYGNDHEMETDYDDDYDDGFESLFQYEDLDFEDE

Query:  RNDLPSNRSNGDFWTTNNAD
             ++   GD W   N D
Subjt:  RNDLPSNRSNGDFWTTNNAD

Q65XL5 Protein WHAT'S THIS FACTOR 1 homolog, chloroplastic6.5e-5835.92Show/hide
Query:  RYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCV
        R  ++  +++K  +    D    +  K + V KL  +L+S P   + L  L + R  L L + R L++++ R+P +FE+              +Y  L  
Subjt:  RYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCV

Query:  RLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELA--------K
        RLT AA  L   + +LK         KL+KLLM+S   RIL+ K+ HL  DL +PP FR  +C  YP  FR V    G  LEL  WDPELA        +
Subjt:  RLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELA--------K

Query:  PLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLP
           + +   R LI+DRPLKFN ++L +GL L R     + +F+++P + PY +  S L   S E EK AC VV E+L + +EKRTL+DHLTHFR++F   
Subjt:  PLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLP

Query:  NKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETDYDDDYDDGFESLFQYED
          LRGM++RHP++FYVSLKG RDSVFL E + +   L+EK + + +K +   L+   +  RR          +     +M ++  D  DD  E L   ED
Subjt:  NKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETDYDDDYDDGFESLFQYED

Query:  LDFEDERNDLPSNRSNGDFWTTNNAD
        L  E       ++   GD W   N D
Subjt:  LDFEDERNDLPSNRSNGDFWTTNNAD

Q689D6 Protein ROOT PRIMORDIUM DEFECTIVE 15.0e-1825Show/hide
Query:  VRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSL----HKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAAS
        VR    D +     K R V K   L+LS+P H I + +L      L L    H+P + L    ++P +FE++  P          +   L  RLT  A  
Subjt:  VRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSL----HKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAAS

Query:  LAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVD--TSYGRALELVSWDPELAKPLPS--PQVPSREL
          + +    L        +L+KL+M+S+  RI L  +     +  +P +F   +   +P  FR +D   +  + +E+V  DP L+        ++  R  
Subjt:  LAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVD--TSYGRALELVSWDPELAKPLPS--PQVPSREL

Query:  IVD-RPLKFN-LLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLES----EKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGM
         +D   ++F+ ++    G  + +  +  + K++ LP   PY+   S     S+E+    EKR+ A + E+L + +EK+  ++ + HFR    LP KL+  
Subjt:  IVD-RPLKFN-LLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLES----EKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGM

Query:  IVRHPELFYVSLK---GQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETD
        +++H  +FY+S +   G+  +VFL EG+  +G L+E ++    + +  +L++     R+ K  A++   + G D E + +
Subjt:  IVRHPELFYVSLK---GQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETD

Q9ZUZ6 Protein WHAT'S THIS FACTOR 9, mitochondrial3.8e-1824.34Show/hide
Query:  LKIVRSPSLD--RHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAA
        +K  R P  D   H ++ ++ + V  L   ++ +P   IP+  +SK      +     +   + ++PSIFE F  P    P            RLT  A 
Subjt:  LKIVRSPSLD--RHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAA

Query:  SLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTS---YGRALELVSWDPELAKPLPSPQVPSREL
         L +Q+  +    ++ L ++L+KL+++S  N + LS +  +   L +P ++      +    FR VD      G A++    D  L+    +     R  
Subjt:  SLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTS---YGRALELVSWDPELAKPLPSPQVPSREL

Query:  IVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYK-----TPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMI
        +    ++F L    KG  L+   +D+L++F+ LP V PY       P+S++A      EKR    + E+L + +E       L   +K FGLP K+    
Subjt:  IVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYK-----TPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMI

Query:  VRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETDYD
         RHP++FY+S+K +  +  L E + DK   +E    L ++ +++QL+   + + + +R +  +  +   D +++ D++
Subjt:  VRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETDYD

Arabidopsis top hitse value%identityAlignment
AT3G63090.1 Ubiquitin carboxyl-terminal hydrolase family protein1.2e-2729.81Show/hide
Query:  SSLKIV--RSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSL-HKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTA
        SSLK+V  +   LD    +  + +   +++  +L++P   IPL  L K R  L L  K +S + M    PS+FE++     P      K  P   +R T 
Subjt:  SSLKIV--RSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSL-HKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTA

Query:  AAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFR-TVDTSYGRA-LELVSWDPELAKPLPSPQVPSR
           +   ++  +       L  KL +LLM++    I   KLVH+  D   P +F  +L   YP+ FR T     G++ LELVSW+P+ AK     +    
Subjt:  AAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFR-TVDTSYGRA-LELVSWDPELAKPLPSPQVPSR

Query:  ELIVDRPLKFNL-LRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVR
         L     ++ N  ++L  G  L++  +++   + +   + PY+   S L + S E EKR   VV E+L + + KR  +  L  F  +F   N    +  R
Subjt:  ELIVDRPLKFNL-LRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVR

Query:  HPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQ
        H  +FY+SLKG   +  L E + D   L+++D  LAIK+++++LL EG + R+++ K Q
Subjt:  HPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQ

AT4G01037.1 Ubiquitin carboxyl-terminal hydrolase family protein1.9e-5735.86Show/hide
Query:  KTRYTLIPC-SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQ
        KTR  + P  +++K  +  + D    +  K + V  +  +L+S+P   + L  L K R  L L K R  ++++ +YP +FE+             +    
Subjt:  KTRYTLIPC-SSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQ

Query:  LCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLP--
        L  ++T+ A  L   +  ++  + + L  KL+KL+M+S   RILL K+ HL  DL +P  FR  +C  YP  FR V T  G ALEL  WDPELA      
Subjt:  LCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLP--

Query:  ------SPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDF
              + +   R LI+DRP KFN ++L +GLNL ++    + +FRD+  + PYK   S L   +LE EK AC V+ E+L +  EKRTL+DHLTHFR++F
Subjt:  ------SPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDF

Query:  GLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR-------EKRKAQIYDSKY-GNDHEMETDYDDDYD
            +LRGM++RHP+LFYVSLKG+RDSVFL E + +   L++KD    +K +   L+   +  RR       E R+ +I  S   G + E E+D ++  D
Subjt:  GLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRR-------EKRKAQIYDSKY-GNDHEMETDYDDDYD

Query:  -DGF---ESLFQYEDLDFEDE--RNDLPSNRSNGD
         DG+   E     +D D+ D+    D+P N  + D
Subjt:  -DGF---ESLFQYEDLDFEDE--RNDLPSNRSNGD

AT5G21970.1 Ubiquitin carboxyl-terminal hydrolase family protein6.7e-2627.25Show/hide
Query:  EENFKTRYTLIPCSSLKI---VRSPSLDRHAVKHNKTRFVQKLIIL---LLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTP
        E +F+  +     SS ++    R   +    +   K +   K+I L   L  +    + +    + R  ++L KP  +   I + P +FEL+        
Subjt:  EENFKTRYTLIPCSSLKI---VRSPSLDRHAVKHNKTRFVQKLIIL---LLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTP

Query:  LNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRA-LELVSWD
                 L   LT     L  +   L     +  AE + + LM+S   ++ L K+VH   D  +P +FR     ++P  F+ V    G   LELVSW+
Subjt:  LNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRA-LELVSWD

Query:  PELA-KPLPSPQVPSRELIVDRPLKFNL---LRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLT
        P  A   L    +   E    +P   +L   ++          ++  +  F+    + PY   A  L   S E +KRA AV+ E+L   +EKR + DHLT
Subjt:  PELA-KPLPSPQVPSRELIVDRPLKFNL---LRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLT

Query:  HFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIY-DSKYGNDHEM--ETDYDDDY
        HFR++F +P KL  + ++H  +FYVS +G+R SVFL EG++    L+EK   +     W + L++    R  KR  Q Y D+    + E+      D+D 
Subjt:  HFRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIY-DSKYGNDHEM--ETDYDDDY

Query:  DDGFESLFQYEDLDFEDERNDL
          GFE    Y+D+  +D+  D+
Subjt:  DDGFESLFQYEDLDFEDERNDL

AT5G48040.1 Ubiquitin carboxyl-terminal hydrolase family protein2.5e-2527.15Show/hide
Query:  LKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASL
        LK V+   LD   V+    R V  L+ ++ + P   +P+  L   RG L L +   L + I RYP+IF    + +       T +    C  LT     L
Subjt:  LKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASL

Query:  AKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYG-RALELVSWDPELAKPLPSPQVPSRELIVDR
          ++ ++       +  +L KLLML+    + L  + HL  DL +P ++R  L   +PD F  V  S     L+L+ WD  LA      Q+  RE + + 
Subjt:  AKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYG-RALELVSWDPELAKPLPSPQVPSRELIVDR

Query:  PLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYV
              ++  +G  LKR   ++L +++ LP   PY   AS L   +  SEKR   V  E+L + I K+T   ++++ RK F LP K   +  RHP +FY+
Subjt:  PLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPELFYV

Query:  SLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEG-----KKMRREKRKAQIYDSKYGNDHEMETDYDDDYDDGFES
        S+K    +V L E +D +  L+EK   + ++ ++  ++ EG     + + ++  +A +  +     + + +D +++ D+   S
Subjt:  SLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEG-----KKMRREKRKAQIYDSKYGNDHEMETDYDDDYDDGFES

AT5G62990.1 Ubiquitin carboxyl-terminal hydrolase family protein6.5e-14661.94Show/hide
Query:  IPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTA
        I CS+ KIVRSPSLDRH VK N+ RFVQKL  LLLSKPKHYIP+ IL KCR YL +  P ++LSMI RYP+IFELF+ P P  P+NATK   QLCVRLT+
Subjt:  IPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPRSLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTA

Query:  AAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSREL
        AA+SLA Q+ NLK  IS+ LA KLQKLLMLSSH R+LLSKLVH+APD   PPNFRSRLCNDYPDKF+TVDTSYGRALELVS DPELA  +PSP+V  R L
Subjt:  AAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVDTSYGRALELVSWDPELAKPLPSPQVPSREL

Query:  IVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPE
        IVDRPLKF  L LR+GLNLKR HQ FLIKFR+ PDVCPYK  +  LA ES+E+EKRACAVVREVLG+ +EKRTLIDHLTHFRK+F LPNKLR +IVRHPE
Subjt:  IVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTHFRKDFGLPNKLRGMIVRHPE

Query:  LFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKA------QIYDSKYGNDH---EMETDYDDDYDDGFESLFQYEDL--
        LFYVS+KG RDSVFLVE ++D G L++KDE L I+ + + L+ EGK++RRE+R+       +  + K  +D    + ++D DD+Y+DGFE+LF  EDL  
Subjt:  LFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKA------QIYDSKYGNDH---EMETDYDDDYDDGFESLFQYEDL--

Query:  ----DFEDERNDLPSNRSNGDFWTTNNADIINYTDGGH--IEPW
            D ED+ ++   N  + ++W+   +   + +D  +  +E W
Subjt:  ----DFEDERNDLPSNRSNGDFWTTNNADIINYTDGGH--IEPW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTATCTTCTCATTTCTCGACCCCTAGGGACCGGCATGTAACATTCCTTAGTTCAAACCTCCTCCGTGGCAGTCCATTGTGGCTGCATTCCAATGTTGGTTGCAA
GTACCAGAGAGAAGAAAATTTCAAGACTCGTTATACTTTGATTCCTTGTTCCTCTCTCAAGATTGTCCGTAGTCCTTCATTAGATCGGCATGCTGTAAAACATAACAAAA
CTCGATTTGTTCAAAAGCTGATAATACTACTGCTATCTAAACCAAAGCATTATATACCACTCCACATTCTTTCCAAGTGTCGTGGCTATCTTTCCCTTCACAAACCTCGC
TCCCTCCTTTCAATGATTCATCGTTATCCTTCCATTTTTGAACTTTTTTCAATCCCTTATCCACCCACACCACTCAATGCAACAAAGCTATATCCCCAACTTTGTGTTCG
TCTAACCGCAGCAGCAGCATCTCTTGCTAAACAGGACTCCAATCTCAAATTGGCGATCTCTAACACCTTGGCTGAAAAGCTCCAAAAGCTACTTATGCTTTCTTCACACA
ACAGGATTCTCTTATCAAAGCTGGTTCACCTTGCTCCCGATCTAAGCATGCCCCCTAATTTTAGATCCCGTCTCTGTAATGATTATCCAGATAAATTCAGGACTGTTGAT
ACCTCATATGGCCGTGCACTGGAGCTTGTCTCCTGGGACCCAGAGTTGGCAAAGCCTTTACCTTCCCCTCAAGTTCCTTCTCGTGAGCTAATAGTGGATAGGCCATTAAA
ATTCAACTTGCTGAGACTAAGAAAGGGGCTGAATTTGAAAAGAACCCACCAGGATTTTCTAATTAAGTTCAGGGATTTGCCAGATGTTTGCCCTTACAAAACTCCTGCGA
GTGAGTTGGCTAAGGAGTCTCTTGAGTCAGAGAAACGAGCTTGTGCTGTGGTGCGAGAGGTATTGGGGATGATGATTGAGAAGAGAACTTTAATAGACCACTTGACGCAT
TTCAGAAAAGATTTTGGGCTACCCAATAAACTGAGAGGGATGATTGTGAGGCATCCAGAGTTATTTTATGTGAGTTTGAAGGGTCAGAGAGACTCTGTATTCCTTGTGGA
GGGCTTTGATGATAAGGGTGTTCTAATGGAGAAGGATGAGACTTTGGCAATCAAAAATCAATGGATGCAGCTTTTAATGGAAGGGAAAAAGATGAGGCGGGAGAAGAGGA
AGGCTCAAATATATGACAGTAAATATGGAAATGATCATGAGATGGAAACTGACTATGATGATGATTACGATGATGGTTTTGAGAGTTTATTTCAGTATGAGGATTTAGAT
TTTGAGGATGAGAGGAATGACCTGCCAAGCAATAGGTCGAATGGAGACTTTTGGACTACAAATAATGCAGATATTATTAATTACACAGATGGAGGACATATAGAACCTTG
GTGA
mRNA sequenceShow/hide mRNA sequence
GCTATCGAGCTTTAAAATCTTTCCTTCGGCCTCAGCTGCTTGCTTACAAACAGCTATGGACTATGCGTCTCCAGTTCTTCGCAACTTTCAACAGGTAATAAGGGCTTTGA
GATCCTTCAATCAATGGAAGTTTCTTGACGCGTCGTGGAGGAATTGGTGGCGTGGACTTCAGTTTCTTGTTCCATGAGGAAGAGGACAGAGAAATCTCAAACAGAGGAAG
AAAATCTGAAGAAGAAGAGGAGAATCGCAACGGAAGGCCGTTTATTCAACTAATACCATATTAGAAGTGGAAAATGAGATAAAATGATCTCTGATATTGTATTAAAGTAA
GATCTCCACCCCTATATATAATACAAAATATAACTATTCAGTAACTAGAATTTACTGAAGTAAATAACAAACTTTTAACTACCATATATGCAGTTTATATCTATAGTTTT
TACGATTTTCTAATTTACCCTCTCCCATGGCACAAGACAATTTTCTTGCACAGATTAAGAGTTCTTTTATTCTCAGAATCAGACATTTGGCATATAGTTCGTTATAAACT
CTAGAAGATGAAAAATTTTGCCAATTTTGCGAATATTCTGCTTTTGTTTATTTACATTTTACTACCGAAATGATTTTCTGGATTCTGTAGCACATTGCTATTTTGTTCAC
AATTTTCACTTTGTACAAGTTGAGAAAAGGTGTTGTTTTCATGCCATGGAACCATAAGCTCTACTGAAATGTTTAACTGTATAAATGTAATTATGTACTGCAGGTGGTAA
GGTACCTGAAAGTAATTGTCTACAACTTAATCTATTGATGCAAGACAGTAGCATCCCCCATCTGTTGCTCTTGAATCAGAGTTGGATGTGTCATATACTTTAGATAAATT
CTATGAATCTGTTTATGTATTGTAGGCTAAAAGGCCTTCCATTGATACAAAACTGAAAAATGGCTTTATCTTCTCATTTCTCGACCCCTAGGGACCGGCATGTAACATTC
CTTAGTTCAAACCTCCTCCGTGGCAGTCCATTGTGGCTGCATTCCAATGTTGGTTGCAAGTACCAGAGAGAAGAAAATTTCAAGACTCGTTATACTTTGATTCCTTGTTC
CTCTCTCAAGATTGTCCGTAGTCCTTCATTAGATCGGCATGCTGTAAAACATAACAAAACTCGATTTGTTCAAAAGCTGATAATACTACTGCTATCTAAACCAAAGCATT
ATATACCACTCCACATTCTTTCCAAGTGTCGTGGCTATCTTTCCCTTCACAAACCTCGCTCCCTCCTTTCAATGATTCATCGTTATCCTTCCATTTTTGAACTTTTTTCA
ATCCCTTATCCACCCACACCACTCAATGCAACAAAGCTATATCCCCAACTTTGTGTTCGTCTAACCGCAGCAGCAGCATCTCTTGCTAAACAGGACTCCAATCTCAAATT
GGCGATCTCTAACACCTTGGCTGAAAAGCTCCAAAAGCTACTTATGCTTTCTTCACACAACAGGATTCTCTTATCAAAGCTGGTTCACCTTGCTCCCGATCTAAGCATGC
CCCCTAATTTTAGATCCCGTCTCTGTAATGATTATCCAGATAAATTCAGGACTGTTGATACCTCATATGGCCGTGCACTGGAGCTTGTCTCCTGGGACCCAGAGTTGGCA
AAGCCTTTACCTTCCCCTCAAGTTCCTTCTCGTGAGCTAATAGTGGATAGGCCATTAAAATTCAACTTGCTGAGACTAAGAAAGGGGCTGAATTTGAAAAGAACCCACCA
GGATTTTCTAATTAAGTTCAGGGATTTGCCAGATGTTTGCCCTTACAAAACTCCTGCGAGTGAGTTGGCTAAGGAGTCTCTTGAGTCAGAGAAACGAGCTTGTGCTGTGG
TGCGAGAGGTATTGGGGATGATGATTGAGAAGAGAACTTTAATAGACCACTTGACGCATTTCAGAAAAGATTTTGGGCTACCCAATAAACTGAGAGGGATGATTGTGAGG
CATCCAGAGTTATTTTATGTGAGTTTGAAGGGTCAGAGAGACTCTGTATTCCTTGTGGAGGGCTTTGATGATAAGGGTGTTCTAATGGAGAAGGATGAGACTTTGGCAAT
CAAAAATCAATGGATGCAGCTTTTAATGGAAGGGAAAAAGATGAGGCGGGAGAAGAGGAAGGCTCAAATATATGACAGTAAATATGGAAATGATCATGAGATGGAAACTG
ACTATGATGATGATTACGATGATGGTTTTGAGAGTTTATTTCAGTATGAGGATTTAGATTTTGAGGATGAGAGGAATGACCTGCCAAGCAATAGGTCGAATGGAGACTTT
TGGACTACAAATAATGCAGATATTATTAATTACACAGATGGAGGACATATAGAACCTTGGTGATGGTTAATTTAATTTGTTTGATGTATTGGGAAGATGCATAAAATTTT
TTTATGCTTAACGTAAACAAAGTCCTCTTCAGTTTGTTTATCTTATTAAATTCAGGAAAAAAAAAACACTTTGTGAAGATAAAGGTCGGCCTTGAAAGCTCAAAGGATGT
TTAAATGTAGCCTGATGATCTATACGAAGGGCTGAAGCTACTGCCTCTAGGCTCATCTATTTGTGTATCTATTTGTGCATAAGCAAGGGCCCCATGAATTTATCCTCTGC
TGATATTCCACTATACCTTCTCAAGCTCTGTATAAGTTGAACCTGGCCTGTCATGTTCTTTTATCTCCTCAATGTACAATGATTCGAAGAACTTTCTATGAAGTGTGAAG
GAAACTTCTCCCTAATCAAGGAGGGACGTATGAGAGATAAGTCCTAACTCCTATTTATGTATACCATTAGATAAATTAAGACAACAAATTCATGTTAATGAATGATGTAA
ATATTCAAATATAGTTTGGTACCTCATCTCTACTCCTTGGCCGCTGGCAGTGAATGTGTATTCAAAATTTTTTCTCATCTCTACTCCTTGGTTGGTGGCGATGAATGTAT
TCAGAGTTTTTTCTCCCCTTTAATAAGAGGTTCTTTAGTAACATTCCCGTTTCTGGTTTACTCGTGTACTGTTTCTTATTTTTGAGTTTTTAGCAAACAATTATCTCTGC
TTTTCATTTGTCATTTTTAAGAACCAAGAAACAATGACCAAATCATTACCAAGC
Protein sequenceShow/hide protein sequence
MALSSHFSTPRDRHVTFLSSNLLRGSPLWLHSNVGCKYQREENFKTRYTLIPCSSLKIVRSPSLDRHAVKHNKTRFVQKLIILLLSKPKHYIPLHILSKCRGYLSLHKPR
SLLSMIHRYPSIFELFSIPYPPTPLNATKLYPQLCVRLTAAAASLAKQDSNLKLAISNTLAEKLQKLLMLSSHNRILLSKLVHLAPDLSMPPNFRSRLCNDYPDKFRTVD
TSYGRALELVSWDPELAKPLPSPQVPSRELIVDRPLKFNLLRLRKGLNLKRTHQDFLIKFRDLPDVCPYKTPASELAKESLESEKRACAVVREVLGMMIEKRTLIDHLTH
FRKDFGLPNKLRGMIVRHPELFYVSLKGQRDSVFLVEGFDDKGVLMEKDETLAIKNQWMQLLMEGKKMRREKRKAQIYDSKYGNDHEMETDYDDDYDDGFESLFQYEDLD
FEDERNDLPSNRSNGDFWTTNNADIINYTDGGHIEPW