; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001084 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001084
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPHD-type domain-containing protein
Genome locationtig00000695:38748..41995
RNA-Seq ExpressionSgr001084
SyntenySgr001084
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017930 - Myb domain
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154229.1 uncharacterized protein LOC111021537 isoform X1 [Momordica charantia]2.7e-30854.16Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+PESAS   L+WR  IEALAS  +V   LLHDVI+ A EL +D R+NAGEMVAL+CLEGL   LN  R   PPA  SKVTFDSSESCE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SAL VAGP+++KWDV PF+AQKRASM  TL Q+KDT+LDGTHPYVDFLK KSGL  VNKRD I+LNNDDR ELS+RLD SS D Q QKEK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDER-----------------------------------------------GL----------------------------------------------
         EDER                                               GL                                              
Subjt:  HEDER-----------------------------------------------GL----------------------------------------------

Query:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------
               SVA+PSS SLLPSKRSRVD  SEDEA Q+P CDDG+ NVK L+                                                  
Subjt:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V
                                                          QHSA TLYSGQE ASSHGTEL+EDS ERVVPQNEGD+   LDE  MTL V
Subjt:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V

Query:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM
        EDKL +EEHFG KRS  CTATDE HQ ESGIPC+TMPA  QD EMHE+I VEKV DRS LP E   S +S AEGN HN   D SKCD GHD HV A+NTM
Subjt:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM

Query:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS
        SHS F P+TVAT+IDVGMNPDEEEKDMLSDSDG+ N+ IDIAMKKNEFFSSQC+VDHDS  LADR+ELT+CVKCNEGGQLLSCNISDC LVVHD+CLG S
Subjt:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS

Query:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD
        ARMNDEGDF CPFC YSLAISEYLEAK+ AA AKKNV TFIRI LEH+ I IKEVLQR DLGPSRKAGV+DVAKICEDV+LENKDNQ TL+GE+VNEV D
Subjt:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD

Query:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------
        HQSPK TDIER  KLSKPL I+NSNHRENEA+PLRVAPDVLA EKDGDELVDQE +GN                                          
Subjt:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------

Query:  ----------VAELEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLR
                  VAELEDGLK TEQYD  EF+HED        KQEGL+YQTDDN+EEPVYA+NIEGEKSSDDE+DESIISRYSIR R++Y+  CPETPQ R
Subjt:  ----------VAELEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLR

Query:  RKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK
        RKKLPWTAEEEETLREGVRKFSSS +RSPTIPWKKIL+FGSTVFLKGRTS+DLKDKWRNLCRSP  K
Subjt:  RKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK

XP_022154231.1 uncharacterized protein LOC111021537 isoform X3 [Momordica charantia]6.2e-31054.77Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+PESAS   L+WR  IEALAS  +V   LLHDVI+ A EL +D R+NAGEMVAL+CLEGL   LN  R   PPA  SKVTFDSSESCE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SAL VAGP+++KWDV PF+AQKRASM  TL Q+KDT+LDGTHPYVDFLK KSGL  VNKRD I+LNNDDR ELS+RLD SS D Q QKEK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDER-----------------------------------------------GL----------------------------------------------
         EDER                                               GL                                              
Subjt:  HEDER-----------------------------------------------GL----------------------------------------------

Query:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------
               SVA+PSS SLLPSKRSRVD  SEDEA Q+P CDDG+ NVK L+                                                  
Subjt:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V
                                                          QHSA TLYSGQE ASSHGTEL+EDS ERVVPQNEGD+   LDE  MTL V
Subjt:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V

Query:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM
        EDKL +EEHFG KRS  CTATDE HQ ESGIPC+TMPA  QD EMHE+I VEKV DRS LP E   S +S AEGN HN   D SKCD GHD HV A+NTM
Subjt:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM

Query:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS
        SHS F P+TVAT+IDVGMNPDEEEKDMLSDSDG+ N+ IDIAMKKNEFFSSQC+VDHDS  LADR+ELT+CVKCNEGGQLLSCNISDC LVVHD+CLG S
Subjt:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS

Query:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD
        ARMNDEGDF CPFC YSLAISEYLEAK+ AA AKKNV TFIRI LEH+ I IKEVLQR DLGPSRKAGV+DVAKICEDV+LENKDNQ TL+GE+VNEV D
Subjt:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD

Query:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI--------------------------------------VAE
        HQSPK TDIER  KLSKPL I+NSNHRENEA+PLRVAPDVLA EKDGDELVDQE +GN                                       VAE
Subjt:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI--------------------------------------VAE

Query:  LEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEET
        LEDGLK TEQYD  EF+HED        KQEGL+YQTDDN+EEPVYA+NIEGEKSSDDE+DESIISRYSIR R++Y+  CPETPQ RRKKLPWTAEEEET
Subjt:  LEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEET

Query:  LREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK
        LREGVRKFSSS +RSPTIPWKKIL+FGSTVFLKGRTS+DLKDKWRNLCRSP  K
Subjt:  LREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK

XP_023527258.1 uncharacterized protein LOC111790548 isoform X1 [Cucurbita pepo subsp. pepo]2.9e-27366.54Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        M+N  ESAS+S+L+WRW IEALAS +EV   LLHDVID   EL + TR+NAGEMVALKCLEGL   LN+   +  P Q SKV FDSSESCE V KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+S+L VAGPDL+KWDV  F  QKRASM CTL ++KD +LDGTHPY DFL QKSGL  +NKRD+I LNN+D IELS RLD SSS P+ Q EK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE
         ED+R +SV +P S SLLPSKRS VDFTSEDEARQ P  DDGY NVK LKQHSAHT +SGQE ASSH TE++EDSPER VPQNE D+TD LDE ++T V+
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE

Query:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS
        D+LVE+ HFGSK+         SHQ +SGIPCYTMPA T+D EM EV+ VEKVKD SELPFE   S  SPAEGN HNTS D SKCD GHDY V   NTMS
Subjt:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS

Query:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA
         SGF+ KTVATN++VG+ PD +EKD+LSDSDGYH E IDIA +K EF SSQCMVDHDS  LAD R L +CVKCNEGGQLL CNISDCPLVVH +CL SSA
Subjt:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA

Query:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH
         M DEGDF CPFCLYSLAISEYLEAKK  AS KKNV +F R +L H+  V++EVLQ+ D+  S++A V+DVAKICEDVDLE+KDNQ +L+GE VNEV D+
Subjt:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH

Query:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------
        QS   TD E+M +LSKPLHIANSNHR+N+ASP RVA D L  +++G ELVDQE QGN             VAE EDG K TEQ+D  E LHE +      
Subjt:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------

Query:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL
          Q GLQYQTDD+E +   A+  EGEKSSDD +DESIISRYSIR R+K +H  PET  LRRKKL WTAEEEET+
Subjt:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL

XP_023527259.1 uncharacterized protein LOC111790548 isoform X2 [Cucurbita pepo subsp. pepo]2.5e-26965.89Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        M+N  ESAS+S+L+WRW IEALAS +EV   LLH+++D        TR+NAGEMVALKCLEGL   LN+   +  P Q SKV FDSSESCE V KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+S+L VAGPDL+KWDV  F  QKRASM CTL ++KD +LDGTHPY DFL QKSGL  +NKRD+I LNN+D IELS RLD SSS P+ Q EK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE
         ED+R +SV +P S SLLPSKRS VDFTSEDEARQ P  DDGY NVK LKQHSAHT +SGQE ASSH TE++EDSPER VPQNE D+TD LDE ++T V+
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE

Query:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS
        D+LVE+ HFGSK+         SHQ +SGIPCYTMPA T+D EM EV+ VEKVKD SELPFE   S  SPAEGN HNTS D SKCD GHDY V   NTMS
Subjt:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS

Query:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA
         SGF+ KTVATN++VG+ PD +EKD+LSDSDGYH E IDIA +K EF SSQCMVDHDS  LAD R L +CVKCNEGGQLL CNISDCPLVVH +CL SSA
Subjt:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA

Query:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH
         M DEGDF CPFCLYSLAISEYLEAKK  AS KKNV +F R +L H+  V++EVLQ+ D+  S++A V+DVAKICEDVDLE+KDNQ +L+GE VNEV D+
Subjt:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH

Query:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------
        QS   TD E+M +LSKPLHIANSNHR+N+ASP RVA D L  +++G ELVDQE QGN             VAE EDG K TEQ+D  E LHE +      
Subjt:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------

Query:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL
          Q GLQYQTDD+E +   A+  EGEKSSDD +DESIISRYSIR R+K +H  PET  LRRKKL WTAEEEET+
Subjt:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL

XP_038904579.1 uncharacterized protein LOC120090944 [Benincasa hispida]2.6e-29064.77Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN  ESASSS L+WRW IEALA  +EV   LLHDVID A EL + TR+NAGEMVALKCLEGL  PLN    +GPPAQ SKV FDSSESCE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SAL VAGPD++KWDV PF+ QK ASM CTL Q+KD++LDGTHPY DFL QKSGL  +NKRD+ISLNN+D I+LS+RLD SSS PQ +KE+ KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE
         +DE  +S+ +PSS SLLPSKRS VDFTSEDEARQ P CDDG+ NVK LK HSA TLYSGQE ASSHGTELVEDS ER  PQ E D+T+ LD  ++TLV 
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE

Query:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS
        DKLVEEEHFGSK+SGQCTATDE H  ES IP YT+ A TQDGEM EV+  EKV D  ELPFE   S  SPAEG P+N     SKCD GHDYHV  M T+S
Subjt:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS

Query:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA
        HSGFL  TVATNIDVGMNPDE+EKD+LSDSDGYH E IDIAM+K EF SSQCMVD DS LLADRRE+T+CVKCNEGGQLLSCNISDCPLVVH +CLGSSA
Subjt:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA

Query:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIR-ISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD
        RMNDEG+F CPFCLYSLAIS+YLEAKK AA AKKNV  F+   +LE + I I+EVLQ+ DL PSR+AGV+DVAKI EDVDLENK+N+ TL+GE+VNE  D
Subjt:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIR-ISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD

Query:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------
         QS  +TD ER+I+LSKP+H ANSNHRENE+S LRVAPDVL+ EKD +ELVD+E  GN                                          
Subjt:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------

Query:  -----------------------------VAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRY
                                     VAEL+DG K TEQ++  + LH+D+        ++ LQYQTDDNE+E   A+  EGEKSSDD +D+SIISRY
Subjt:  -----------------------------VAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRY

Query:  SIRCRRKYYHPCPETPQLRRKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVF
        SIR R+KY+H   ET   RRKKLPWTAEEEE + EGVRKFSSS +RSPTIPWKKIL+FGS+VF
Subjt:  SIRCRRKYYHPCPETPQLRRKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVF

TrEMBL top hitse value%identityAlignment
A0A1S3B7A1 uncharacterized protein LOC1034868084.3e-25960.79Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+  SASSS L+WRW IEALAS  +V   LLHDVI+TA EL + TR NAGEMVAL+CLEGL  PL+    +G PAQ SKV FDSSESC  V+KRIY E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SALGVAGPD+ KWDV PF+ QKRASM CTL Q+KD++LDGTHPY +FL  KSGL  +NKRD  SLNN+D +EL +RLD SSS PQ +KE  KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELV----------------------------
         EDER +SV  PSS SLLP+KRS ++FTSEDEA Q P CDDG+ NVK LK HSAH LYSGQE ASSHGTE+V                            
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELV----------------------------

Query:  ----------------------EDSPERVVPQNEGDETDLLDEPRMTLVEDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFV
                              EDS ER  PQ E D+ D LD  ++ LVEDKLVEEEH GSK   QCTATDE H GESGIPCYT+   TQDGE  EV+  
Subjt:  ----------------------EDSPERVVPQNEGDETDLLDEPRMTLVEDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFV

Query:  EKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMSHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSS
        EKV D SELPFE      SPAEGN  NT  + SK DFGHD+HV  MN +SHSGF+  TVAT+ DVGM PDEEEKDMLSD+D YH E +DIAM+K EF SS
Subjt:  EKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMSHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSS

Query:  QCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIV
        QCMVD DS L+ADR ELT+CVKCNEGGQLLSCN  DCPLVVH +CLGS A MNDE DF CPFCLYS AISEYLEAKK AA AKKNVT+F R +LEH  I 
Subjt:  QCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIV

Query:  IKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIK-----------LSKPLHIANSNHRENEASPLRVAPDV
         K VLQ  DL PSR+AGV+DVAKICEDVD+ENKDNQ T++GE+VNEV DHQS  VTD ER I            LSK ++IAN+NHRENE+S LRVAPDV
Subjt:  IKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIK-----------LSKPLHIANSNHRENEASPLRVAPDV

Query:  LAREKDGD--------------------------ELVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEG
        L+ EKD +                          ELVDQE QGN  A+LEDG  +T+Q+   E LHED+        +E LQYQT+DNE+E   A+  E 
Subjt:  LAREKDGD--------------------------ELVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEG

Query:  EKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKL
        EKSSDD +DESIISRYSIR R+KY+H   ET  L RKKL
Subjt:  EKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKL

A0A5A7TK87 PHD domain-containing protein1.0e-25560.85Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+  SASSS L+WRW IEALAS  +V   LLHDVI+TA EL + TR NAGEMVAL+CLEGL  PL+    +G PAQ SKV FDSSESC  V+KRIY E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SALGVAGPD+ KWDV PF+ QKRASM CTL Q+KD++LDGTHPY +FL  KSGL  +NKRD  SLNN+D +EL +RLD SSS PQ +KE  KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELV----------------------------
         EDER +SV  PSS SLLP+KRS ++FTSEDEA Q P CDDG+ NVK LK HSAH LYSGQE ASSHGTE+V                            
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELV----------------------------

Query:  ----------------------EDSPERVVPQNEGDETDLLDEPRMTLVEDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFV
                              EDS ER  PQ E D+ D LD  ++ LVEDKLVEEEH GSK   QCTATDE H GESGIPCYT+   TQDGE  EV+  
Subjt:  ----------------------EDSPERVVPQNEGDETDLLDEPRMTLVEDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFV

Query:  EKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMSHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSS
        EKV D SELPFE      SPAEGN  NT  + SK DFGHD+HV  MN +SHSGF+  TVAT+ DVGM PDEEEKDMLSD+D YH E +DIAM+K EF SS
Subjt:  EKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMSHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSS

Query:  QCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIV
        QCMVD DS L+ADR ELT+CVKCNEGGQLLSCN  DCPLVVH +CLGS A MNDE DF CPFCLYS AISEYLEAKK AA AKKNVT+F R +LEH  I 
Subjt:  QCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIV

Query:  IKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIK-----------LSKPLHIANSNHRENEASPLRVAPDV
         K VLQ  DL PSR+AGV+DVAKICEDVD+ENKDNQ T++GE+VNEV DHQS  VTD ER I            LSK ++IAN+NHRENE+S LRVAPDV
Subjt:  IKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIK-----------LSKPLHIANSNHRENEASPLRVAPDV

Query:  LAREKDGD--------------------------ELVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEG
        L+ EKD +                          ELVDQE QGN  A+LEDG  +T+Q+   E LHED+        +E LQYQT+DNE+E   A+  E 
Subjt:  LAREKDGD--------------------------ELVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDK--------QEGLQYQTDDNEEEPVYALNIEG

Query:  EKSSDDEDDESIISRYSIRCRRKYY
        EKSSDD +DESIISRYSIR R+KY+
Subjt:  EKSSDDEDDESIISRYSIRCRRKYY

A0A6J1DLF4 uncharacterized protein LOC111021537 isoform X11.3e-30854.16Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+PESAS   L+WR  IEALAS  +V   LLHDVI+ A EL +D R+NAGEMVAL+CLEGL   LN  R   PPA  SKVTFDSSESCE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SAL VAGP+++KWDV PF+AQKRASM  TL Q+KDT+LDGTHPYVDFLK KSGL  VNKRD I+LNNDDR ELS+RLD SS D Q QKEK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDER-----------------------------------------------GL----------------------------------------------
         EDER                                               GL                                              
Subjt:  HEDER-----------------------------------------------GL----------------------------------------------

Query:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------
               SVA+PSS SLLPSKRSRVD  SEDEA Q+P CDDG+ NVK L+                                                  
Subjt:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V
                                                          QHSA TLYSGQE ASSHGTEL+EDS ERVVPQNEGD+   LDE  MTL V
Subjt:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V

Query:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM
        EDKL +EEHFG KRS  CTATDE HQ ESGIPC+TMPA  QD EMHE+I VEKV DRS LP E   S +S AEGN HN   D SKCD GHD HV A+NTM
Subjt:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM

Query:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS
        SHS F P+TVAT+IDVGMNPDEEEKDMLSDSDG+ N+ IDIAMKKNEFFSSQC+VDHDS  LADR+ELT+CVKCNEGGQLLSCNISDC LVVHD+CLG S
Subjt:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS

Query:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD
        ARMNDEGDF CPFC YSLAISEYLEAK+ AA AKKNV TFIRI LEH+ I IKEVLQR DLGPSRKAGV+DVAKICEDV+LENKDNQ TL+GE+VNEV D
Subjt:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD

Query:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------
        HQSPK TDIER  KLSKPL I+NSNHRENEA+PLRVAPDVLA EKDGDELVDQE +GN                                          
Subjt:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI-----------------------------------------

Query:  ----------VAELEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLR
                  VAELEDGLK TEQYD  EF+HED        KQEGL+YQTDDN+EEPVYA+NIEGEKSSDDE+DESIISRYSIR R++Y+  CPETPQ R
Subjt:  ----------VAELEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLR

Query:  RKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK
        RKKLPWTAEEEETLREGVRKFSSS +RSPTIPWKKIL+FGSTVFLKGRTS+DLKDKWRNLCRSP  K
Subjt:  RKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK

A0A6J1DN48 uncharacterized protein LOC111021537 isoform X33.0e-31054.77Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        MEN+PESAS   L+WR  IEALAS  +V   LLHDVI+ A EL +D R+NAGEMVAL+CLEGL   LN  R   PPA  SKVTFDSSESCE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+SAL VAGP+++KWDV PF+AQKRASM  TL Q+KDT+LDGTHPYVDFLK KSGL  VNKRD I+LNNDDR ELS+RLD SS D Q QKEK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDER-----------------------------------------------GL----------------------------------------------
         EDER                                               GL                                              
Subjt:  HEDER-----------------------------------------------GL----------------------------------------------

Query:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------
               SVA+PSS SLLPSKRSRVD  SEDEA Q+P CDDG+ NVK L+                                                  
Subjt:  -------SVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V
                                                          QHSA TLYSGQE ASSHGTEL+EDS ERVVPQNEGD+   LDE  MTL V
Subjt:  --------------------------------------------------QHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTL-V

Query:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM
        EDKL +EEHFG KRS  CTATDE HQ ESGIPC+TMPA  QD EMHE+I VEKV DRS LP E   S +S AEGN HN   D SKCD GHD HV A+NTM
Subjt:  EDKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTM

Query:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS
        SHS F P+TVAT+IDVGMNPDEEEKDMLSDSDG+ N+ IDIAMKKNEFFSSQC+VDHDS  LADR+ELT+CVKCNEGGQLLSCNISDC LVVHD+CLG S
Subjt:  SHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSS

Query:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD
        ARMNDEGDF CPFC YSLAISEYLEAK+ AA AKKNV TFIRI LEH+ I IKEVLQR DLGPSRKAGV+DVAKICEDV+LENKDNQ TL+GE+VNEV D
Subjt:  ARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPD

Query:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI--------------------------------------VAE
        HQSPK TDIER  KLSKPL I+NSNHRENEA+PLRVAPDVLA EKDGDELVDQE +GN                                       VAE
Subjt:  HQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI--------------------------------------VAE

Query:  LEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEET
        LEDGLK TEQYD  EF+HED        KQEGL+YQTDDN+EEPVYA+NIEGEKSSDDE+DESIISRYSIR R++Y+  CPETPQ RRKKLPWTAEEEET
Subjt:  LEDGLKTTEQYDFCEFLHED--------KQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEET

Query:  LREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK
        LREGVRKFSSS +RSPTIPWKKIL+FGSTVFLKGRTS+DLKDKWRNLCRSP  K
Subjt:  LREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK

A0A6J1FB82 uncharacterized protein LOC1114424392.7e-26965.76Show/hide
Query:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE
        M+N  ESAS+S+L+WRW IEALAS +EV   LLHDVID   EL + TR+NAGEMVALKCLEGL   L++   +  P Q SKV FDSSE CE V+KRIY+E
Subjt:  MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEE

Query:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL
        TP+S+L VAGPDL+KWDV  F  QKRASM CTL ++KD +LDGTHP  DFL QKSGL  +NKR  I LNN+D IELS RLD SSS P+ Q EK KGSPLL
Subjt:  TPQSALGVAGPDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLL

Query:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE
         ED+R +SV +P S SLLPSKRS VDFTSEDEARQ P C DGY NVK LKQHSAHT +SGQE ASSH TE++EDS ER VPQNE D+TD LDE ++T V+
Subjt:  HEDERGLSVADPSSFSLLPSKRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVE

Query:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS
        D+ VE+ HFGSK+          HQ +SGI CYTMPA TQD EM EV+ VEKVKD SELPFE   S  SPAE N HNTS D SKCD GHDYHV   NTMS
Subjt:  DKLVEEEHFGSKRSGQCTATDESHQGESGIPCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMS

Query:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA
         SGF+ KTVATN++VG+ PD +EKD+LSDSDGYH E IDIA +K EF SSQCMVDHDS  LAD R L +CVKCNEGGQLL CNISDCPLVVH +CL SSA
Subjt:  HSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSA

Query:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH
         M DEGDF CPFCLYSLAISEYLEAKK  AS KKNV +F R +L H+   ++EVLQ+ D+ PS++  V+DVAKICEDV+LE+KDNQ +L+GE VNEV DH
Subjt:  RMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDH

Query:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------
        QS   TD E++ +LSKPLHIANSNHRE +ASP RVA D L  E++G ELVDQE QGN             VAE EDG K TEQ+D  E LHE +      
Subjt:  QSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNI------------VAELEDGLKTTEQYDFCEFLHEDK------

Query:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL
          Q GLQYQTDD+E +   A+  EGEKSSDD +DESIISRYSIR R+K +H  PET  LRRKKLPWTAEEEETL
Subjt:  --QEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETL

SwissProt top hitse value%identityAlignment
F4I7L1 Telomere repeat-binding factor 42.2e-0548.44Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNLCRSP
        +KL WTAEEEE L  GVRK            WK IL D      L  R+++DLKDKWRNL  +P
Subjt:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNLCRSP

F4IEY4 Telomere repeat-binding factor 52.4e-0446.88Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNLCRSP
        +KL WTAEEEE L  G+RK            WK IL D      L  R+++DLKDKWRNL   P
Subjt:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNLCRSP

Q6WLH4 Single myb histone 39.1e-0444.07Show/hide
Query:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKI-LDFGSTVFLKGRTSVDLKDKWRNL
        K  WT+EEE+ LR GVRK  +         W+ I  D   +  L  R+++DLKDKWRNL
Subjt:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKI-LDFGSTVFLKGRTSVDLKDKWRNL

Q9FJW5 Telomere repeat-binding factor 29.1e-0445.76Show/hide
Query:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNL
        K  WT EEE  L+ GV K         T  W+ IL D   ++ LK R++VDLKDKWRN+
Subjt:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNL

Q9M2X3 Telomere repeat-binding factor 35.3e-0447.46Show/hide
Query:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNL
        KL WT EEE  L+ GV K         T  W+ IL D   +  LK R++VDLKDKWRN+
Subjt:  KLPWTAEEEETLREGVRKFSSSAERSPTIPWKKIL-DFGSTVFLKGRTSVDLKDKWRNL

Arabidopsis top hitse value%identityAlignment
AT1G01150.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain6.0e-1152.54Show/hide
Query:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNL
        K++ WT  EE+ LREGV KFS +  ++  +PWKKIL+ G  +F   R S DLKDKWRN+
Subjt:  KKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNL

AT1G14770.1 RING/FYVE/PHD zinc finger superfamily protein9.6e-0926.44Show/hide
Query:  SASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDP-LNHTRGHG-PPAQYSKVTFDSSESCEGVIKRIYEETPQS
        S  +    W W IE +A   +  + LL D+++   +  +D  +   E+++L+ LE + DP ++   G G   A   KV FD S S   V++ I +E P +
Subjt:  SASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDP-LNHTRGHG-PPAQYSKVTFDSSESCEGVIKRIYEETPQS

Query:  ALGVAGPDLVKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLLHED
         L V  P+L K++V+PF+A K   +  C L++++D  L           Q S    +   D +    DDR   S  +D    +P  +++   G+     D
Subjt:  ALGVAGPDLVKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLLHED

Query:  ERGLSVAD
        E+ + + +
Subjt:  ERGLSVAD

AT1G14770.2 RING/FYVE/PHD zinc finger superfamily protein9.6e-0926.44Show/hide
Query:  SASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDP-LNHTRGHG-PPAQYSKVTFDSSESCEGVIKRIYEETPQS
        S  +    W W IE +A   +  + LL D+++   +  +D  +   E+++L+ LE + DP ++   G G   A   KV FD S S   V++ I +E P +
Subjt:  SASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDP-LNHTRGHG-PPAQYSKVTFDSSESCEGVIKRIYEETPQS

Query:  ALGVAGPDLVKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLLHED
         L V  P+L K++V+PF+A K   +  C L++++D  L           Q S    +   D +    DDR   S  +D    +P  +++   G+     D
Subjt:  ALGVAGPDLVKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLLHED

Query:  ERGLSVAD
        E+ + + +
Subjt:  ERGLSVAD

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein1.7e-1331.21Show/hide
Query:  SWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEETPQSALGVAGPDL
        +W W IE  A  K    ++L+DV + A +LP+   +   EMVA +CL  L D  +           S + FDSSESCE V++ I +E P S L    P L
Subjt:  SWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEETPQSALGVAGPDL

Query:  VKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSS
         KW++ PF+  K  S+  C L+ M +         V    ++  L S  K +       D  +L+ R +   S
Subjt:  VKWDVIPFVAQKRASM-HCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSS

AT1G68030.1 RING/FYVE/PHD zinc finger superfamily protein4.2e-1228.03Show/hide
Query:  MNPDEEEKDM---LSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFC
        + P  +E D+     + + +     ++  ++N  F  +   D  S L +   ++  CV C E G+LL C+   C ++VH +CL S    +D GDFYC  C
Subjt:  MNPDEEEKDM---LSDSDGYHNERIDIAMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFC

Query:  LYSLAISEYLEAKKKAASAKKNVTTFIRISLE
          +   +EY++ + + A AK+ + +F+R+  E
Subjt:  LYSLAISEYLEAKKKAASAKKNVTTFIRISLE

AT5G03780.1 TRF-like 103.9e-1022.16Show/hide
Query:  ELTLCVKCNEGGQLLS-CNISDCPLVVHDQCL---------GSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVL
        +L  C+ C    + +S C   DC L  H +CL          SS+   D  + +CP+C   +   +    ++K   A+K V  ++               
Subjt:  ELTLCVKCNEGGQLLS-CNISDCPLVVHDQCL---------GSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIVIKEVL

Query:  QRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREK-------DGDE
                             D +++++D   TL+G+ +    +  +  V+D E  ++  K    +  +  + +    +V  +V A EK       D ++
Subjt:  QRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREK-------DGDE

Query:  LVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDKQEGLQYQTDDNEEE-----PVYALNIEGEKSSDDEDDESIISRYS--------------------I
            + QG  +     G K  E   F   + E      Q Q   NE+       +   +I  + SS++ + E +  + +                    +
Subjt:  LVDQEYQGNIVAELEDGLKTTEQYDFCEFLHEDKQEGLQYQTDDNEEE-----PVYALNIEGEKSSDDEDDESIISRYS--------------------I

Query:  RCRRKYYHPCPETPQLRRKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCR
          + K           +R++L WT EEEE L+ GV KF  +AE +  +PW+KIL+ G  VF + RT  DLKDKWR++ +
Subjt:  RCRRKYYHPCPETPQLRRKKLPWTAEEEETLREGVRKFSSSAERSPTIPWKKILDFGSTVFLKGRTSVDLKDKWRNLCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATGACCCCGAATCTGCTTCAAGTTCAACCCTTTCTTGGCGTTGGGCCATCGAAGCCCTTGCAAGTACCAAGGAAGTGAATACATATCTTTTACATGATGTAAT
TGATACGGCTGAAGAATTACCTGAGGATACAAGACAGAATGCGGGGGAAATGGTTGCTTTGAAATGCTTGGAGGGTTTGCTTGATCCTTTAAATCATACTAGAGGACATG
GTCCGCCTGCCCAATATTCAAAAGTTACGTTTGATTCATCTGAGAGCTGTGAAGGTGTTATTAAACGCATATATGAGGAGACTCCACAATCTGCCTTAGGAGTGGCTGGA
CCAGATTTGGTAAAATGGGATGTTATTCCTTTTGTTGCACAAAAAAGAGCATCCATGCATTGTACATTACAGCAGATGAAAGATACACTTCTTGATGGTACACATCCATA
TGTTGATTTCTTAAAGCAGAAGAGTGGGTTGGCATCTGTAAATAAGAGGGATAGCATTTCTCTAAATAATGATGATCGTATTGAGCTCAGCAAGAGACTTGATAGAAGCT
CCTCTGATCCTCAAAGTCAAAAAGAAAAAAGCAAAGGAAGCCCTTTACTTCATGAGGATGAAAGAGGACTATCAGTGGCAGACCCATCTAGTTTTAGTTTGTTACCTTCT
AAAAGGAGTAGAGTTGACTTTACATCTGAAGATGAGGCAAGACAGTGGCCTGCTTGTGATGATGGCTACACGAATGTTAAAATGCTTAAGCAGCATTCTGCACATACTTT
GTATTCAGGACAGGAAGGGGCTTCTTCACATGGAACGGAGTTGGTAGAAGATTCACCTGAAAGAGTTGTGCCACAAAATGAGGGAGATGAAACCGATCTCTTGGACGAAC
CTCGGATGACTTTGGTGGAAGACAAACTTGTAGAGGAGGAGCATTTTGGGTCAAAGAGGTCTGGACAGTGTACTGCTACTGATGAATCGCACCAGGGTGAATCAGGTATT
CCTTGTTATACAATGCCGGCTCATACACAAGATGGTGAAATGCATGAAGTTATTTTTGTCGAGAAAGTGAAAGATAGAAGTGAACTGCCTTTTGAACGAACAGAATCTAC
TACTTCTCCTGCTGAAGGAAACCCGCATAACACCAGCACTGATGGTTCCAAGTGTGACTTTGGGCATGATTATCATGTACAAGCAATGAATACTATGTCTCATAGTGGAT
TTCTGCCAAAGACTGTTGCTACCAACATTGATGTTGGCATGAATCCTGATGAGGAAGAGAAAGACATGTTAAGTGATAGTGATGGATATCATAATGAAAGGATAGATATT
GCCATGAAAAAAAATGAATTCTTTAGTTCTCAATGTATGGTCGATCATGATTCCTTACTATTAGCTGACAGGAGGGAGCTAACTCTTTGTGTAAAATGTAATGAAGGTGG
TCAGTTGTTGTCTTGTAACATTAGTGATTGTCCTTTGGTGGTTCATGATCAGTGCTTGGGTTCCTCAGCTAGGATGAATGATGAAGGTGATTTTTATTGTCCTTTCTGCT
TATATTCACTTGCTATATCAGAATACCTTGAAGCTAAGAAGAAAGCTGCATCAGCAAAGAAAAATGTTACTACTTTTATTCGGATAAGTTTGGAACATCGGCCTATAGTT
ATTAAAGAGGTATTGCAACGAACAGATCTTGGCCCATCACGAAAAGCTGGGGTTCAGGATGTTGCTAAAATTTGTGAAGATGTAGACTTGGAAAATAAAGACAATCAAGA
AACTCTAAATGGAGAAAATGTAAATGAAGTTCCTGACCATCAATCCCCAAAAGTTACAGATATTGAGCGAATGATAAAGCTTTCTAAACCGTTGCATATTGCCAATTCCA
ATCATAGAGAAAATGAGGCAAGTCCTTTGAGAGTGGCACCTGATGTTTTAGCTAGAGAGAAAGATGGCGATGAATTGGTGGACCAAGAGTATCAAGGAAATATAGTAGCA
GAACTTGAAGATGGTCTAAAAACCACAGAGCAATATGACTTTTGTGAATTTCTCCACGAGGATAAGCAAGAAGGTTTACAGTACCAAACTGATGATAATGAAGAGGAACC
TGTTTATGCACTTAACATTGAAGGAGAAAAATCTTCTGATGACGAAGATGATGAGTCTATCATTTCTAGATACTCGATAAGATGTCGACGGAAATATTATCATCCATGTC
CAGAAACTCCTCAATTAAGACGGAAGAAACTACCCTGGACAGCTGAAGAGGAAGAGACACTAAGGGAGGGAGTTCGAAAATTCTCCAGTTCTGCTGAAAGAAGTCCTACC
ATACCTTGGAAAAAGATTTTAGACTTTGGTAGTACTGTGTTTCTGAAAGGTCGTACATCTGTAGATCTTAAAGATAAATGGAGGAACTTGTGCAGAAGTCCACTGCTTAA
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATGACCCCGAATCTGCTTCAAGTTCAACCCTTTCTTGGCGTTGGGCCATCGAAGCCCTTGCAAGTACCAAGGAAGTGAATACATATCTTTTACATGATGTAAT
TGATACGGCTGAAGAATTACCTGAGGATACAAGACAGAATGCGGGGGAAATGGTTGCTTTGAAATGCTTGGAGGGTTTGCTTGATCCTTTAAATCATACTAGAGGACATG
GTCCGCCTGCCCAATATTCAAAAGTTACGTTTGATTCATCTGAGAGCTGTGAAGGTGTTATTAAACGCATATATGAGGAGACTCCACAATCTGCCTTAGGAGTGGCTGGA
CCAGATTTGGTAAAATGGGATGTTATTCCTTTTGTTGCACAAAAAAGAGCATCCATGCATTGTACATTACAGCAGATGAAAGATACACTTCTTGATGGTACACATCCATA
TGTTGATTTCTTAAAGCAGAAGAGTGGGTTGGCATCTGTAAATAAGAGGGATAGCATTTCTCTAAATAATGATGATCGTATTGAGCTCAGCAAGAGACTTGATAGAAGCT
CCTCTGATCCTCAAAGTCAAAAAGAAAAAAGCAAAGGAAGCCCTTTACTTCATGAGGATGAAAGAGGACTATCAGTGGCAGACCCATCTAGTTTTAGTTTGTTACCTTCT
AAAAGGAGTAGAGTTGACTTTACATCTGAAGATGAGGCAAGACAGTGGCCTGCTTGTGATGATGGCTACACGAATGTTAAAATGCTTAAGCAGCATTCTGCACATACTTT
GTATTCAGGACAGGAAGGGGCTTCTTCACATGGAACGGAGTTGGTAGAAGATTCACCTGAAAGAGTTGTGCCACAAAATGAGGGAGATGAAACCGATCTCTTGGACGAAC
CTCGGATGACTTTGGTGGAAGACAAACTTGTAGAGGAGGAGCATTTTGGGTCAAAGAGGTCTGGACAGTGTACTGCTACTGATGAATCGCACCAGGGTGAATCAGGTATT
CCTTGTTATACAATGCCGGCTCATACACAAGATGGTGAAATGCATGAAGTTATTTTTGTCGAGAAAGTGAAAGATAGAAGTGAACTGCCTTTTGAACGAACAGAATCTAC
TACTTCTCCTGCTGAAGGAAACCCGCATAACACCAGCACTGATGGTTCCAAGTGTGACTTTGGGCATGATTATCATGTACAAGCAATGAATACTATGTCTCATAGTGGAT
TTCTGCCAAAGACTGTTGCTACCAACATTGATGTTGGCATGAATCCTGATGAGGAAGAGAAAGACATGTTAAGTGATAGTGATGGATATCATAATGAAAGGATAGATATT
GCCATGAAAAAAAATGAATTCTTTAGTTCTCAATGTATGGTCGATCATGATTCCTTACTATTAGCTGACAGGAGGGAGCTAACTCTTTGTGTAAAATGTAATGAAGGTGG
TCAGTTGTTGTCTTGTAACATTAGTGATTGTCCTTTGGTGGTTCATGATCAGTGCTTGGGTTCCTCAGCTAGGATGAATGATGAAGGTGATTTTTATTGTCCTTTCTGCT
TATATTCACTTGCTATATCAGAATACCTTGAAGCTAAGAAGAAAGCTGCATCAGCAAAGAAAAATGTTACTACTTTTATTCGGATAAGTTTGGAACATCGGCCTATAGTT
ATTAAAGAGGTATTGCAACGAACAGATCTTGGCCCATCACGAAAAGCTGGGGTTCAGGATGTTGCTAAAATTTGTGAAGATGTAGACTTGGAAAATAAAGACAATCAAGA
AACTCTAAATGGAGAAAATGTAAATGAAGTTCCTGACCATCAATCCCCAAAAGTTACAGATATTGAGCGAATGATAAAGCTTTCTAAACCGTTGCATATTGCCAATTCCA
ATCATAGAGAAAATGAGGCAAGTCCTTTGAGAGTGGCACCTGATGTTTTAGCTAGAGAGAAAGATGGCGATGAATTGGTGGACCAAGAGTATCAAGGAAATATAGTAGCA
GAACTTGAAGATGGTCTAAAAACCACAGAGCAATATGACTTTTGTGAATTTCTCCACGAGGATAAGCAAGAAGGTTTACAGTACCAAACTGATGATAATGAAGAGGAACC
TGTTTATGCACTTAACATTGAAGGAGAAAAATCTTCTGATGACGAAGATGATGAGTCTATCATTTCTAGATACTCGATAAGATGTCGACGGAAATATTATCATCCATGTC
CAGAAACTCCTCAATTAAGACGGAAGAAACTACCCTGGACAGCTGAAGAGGAAGAGACACTAAGGGAGGGAGTTCGAAAATTCTCCAGTTCTGCTGAAAGAAGTCCTACC
ATACCTTGGAAAAAGATTTTAGACTTTGGTAGTACTGTGTTTCTGAAAGGTCGTACATCTGTAGATCTTAAAGATAAATGGAGGAACTTGTGCAGAAGTCCACTGCTTAA
ATGA
Protein sequenceShow/hide protein sequence
MENDPESASSSTLSWRWAIEALASTKEVNTYLLHDVIDTAEELPEDTRQNAGEMVALKCLEGLLDPLNHTRGHGPPAQYSKVTFDSSESCEGVIKRIYEETPQSALGVAG
PDLVKWDVIPFVAQKRASMHCTLQQMKDTLLDGTHPYVDFLKQKSGLASVNKRDSISLNNDDRIELSKRLDRSSSDPQSQKEKSKGSPLLHEDERGLSVADPSSFSLLPS
KRSRVDFTSEDEARQWPACDDGYTNVKMLKQHSAHTLYSGQEGASSHGTELVEDSPERVVPQNEGDETDLLDEPRMTLVEDKLVEEEHFGSKRSGQCTATDESHQGESGI
PCYTMPAHTQDGEMHEVIFVEKVKDRSELPFERTESTTSPAEGNPHNTSTDGSKCDFGHDYHVQAMNTMSHSGFLPKTVATNIDVGMNPDEEEKDMLSDSDGYHNERIDI
AMKKNEFFSSQCMVDHDSLLLADRRELTLCVKCNEGGQLLSCNISDCPLVVHDQCLGSSARMNDEGDFYCPFCLYSLAISEYLEAKKKAASAKKNVTTFIRISLEHRPIV
IKEVLQRTDLGPSRKAGVQDVAKICEDVDLENKDNQETLNGENVNEVPDHQSPKVTDIERMIKLSKPLHIANSNHRENEASPLRVAPDVLAREKDGDELVDQEYQGNIVA
ELEDGLKTTEQYDFCEFLHEDKQEGLQYQTDDNEEEPVYALNIEGEKSSDDEDDESIISRYSIRCRRKYYHPCPETPQLRRKKLPWTAEEEETLREGVRKFSSSAERSPT
IPWKKILDFGSTVFLKGRTSVDLKDKWRNLCRSPLLK