; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029053 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029053
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSET domain protein
Genome locationtig00153210:2840177..2848101
RNA-Seq ExpressionSgr029053
SyntenySgr029053
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601446.1 SET and MYND domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-18071Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNFEEALKDC+EALKIE THFKTLLCKGKILLNLNRYS+AL CFKTAL+DPQV+G+SENLNGYLEKCKK EHLS+TGAFD+SDW+LNGF GKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG VQ               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNF+DKVTDSATKS KTK+LIG LS+GEAE+ L+VPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +  T    EMS+ILSVLDIN+LVEDAASAKVLGKNS+YYGVGLWVLASFINHSC          DHI +                F P     
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE
          +     WGF+C CKRC+FEEQ+  KEE+KE+EMG       G+E GAA+Y+LEEGM+RWMVRGKEKGYLRASFW +Y E+F SEKAMKKWGRRI  +E
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE

Query:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY
         VV+SV +AVGSDERV+K MVERFKRN   GG LEMER+LKLGRGVYGKVMKKQA+RSLL+LGSHEY Y
Subjt:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY

KAG7032228.1 SET and MYND domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-18071.22Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNFEEALKDC+EALKIE THFKTLLCKGKILLNLNRYS+AL CFKTAL+DPQV+G+SENLNGYLEKCKK EHLS+TGAFD+SDW+LNGF GKP 
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG VQ               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNF+DKVTDSATKS KTK+LIG LS+GEAE+ L+VPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +  T    EMS+ILSVLDIN+LVEDAASAKVLGKNS+YYGVGLWVLASFINHSC          DHI +                F P     
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE
          +     WGF+C CKRC+FEEQ+  KEE+KE+EMG       G+E GAA+Y+LEEGM+RWMVRGKEKGYLRASFW +Y EVF SEKAMKKWGRRI  +E
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE

Query:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY
         VV+SV +AVGSDERVVK MVERFKRN   GG LEMER+LKLGRGVYGKVMKKQA+RSLL+LGSHEY Y
Subjt:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY

XP_004135447.1 uncharacterized protein LOC101202892 [Cucumis sativus]6.2e-18172.16Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLR FEEAL+DCEEALKIESTHFKTLLCKGKILLNLNRYS+AL CFKTAL DPQV+GNSENLNGY+EKCKKLEHLS+TGAFDLSDWVLNGFRGK P
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG +Q               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKS KTK LIG LSSGE EE LEVPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +S     SEMS ILSVLDINSLVEDA SAKVLGKN +YYGVGLWVL SFINHSC          DHI +                F P  +  
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD
          +     WGF+CNCKRC+FEE+I +KEE+KE+EM + GG+E+GAA+Y+LEEGM+RW VRGKEKGYLRASFW AY E+F S+KAMKKWGRRI  +E VVD
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD

Query:  SVAEAVGSDERVVKIMVERFKR----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG-SHEYGY
        SV +AVGSDERVVK+MVERFKR    NGGV+EME++LKLGRGVYGKVMKKQA+R LL+LG SHEYG+
Subjt:  SVAEAVGSDERVVKIMVERFKR----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG-SHEYGY

XP_022151079.1 uncharacterized protein LOC111019101 [Momordica charantia]1.7e-18373.08Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNF++ALKDCEEAL I+STHFKTLLCKGKILLNLNRYSAAL CFK+AL DPQV G+SENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IGAVQ               NVDAGTL LVTKAIAIERGILPEN DENAQLVMWKNFIDKVTDSATKSAKTK+LIG LS GE EE LEVPEM  F
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKS-EMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTT
        KPET D  S   D+S E  RI++VLDINSLVEDAASAKVLGKN +YYGVGLWVLASFINHSC          DHI +                F P   +
Subjt:  KPETVDHLSYTDDKS-EMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTT

Query:  GKAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVV
           ++  + WGF+CNCKRCKFEEQI S+EE+KE+EMG+ GG E+GA VY+LEEGM+RWMVRG+EKGYLRA+FWAAY E F SEKAMKKWGRRIPA E V 
Subjt:  GKAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVV

Query:  DSVAEAVGSDERVVKIMVERFKR---NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDL--GSHEYGY
        +SVAEAVGSDERVVK+MVERFKR   NGGV+EM+R+LKLGRG+YGKVMKKQA+RS+L+L  GSHEYGY
Subjt:  DSVAEAVGSDERVVKIMVERFKR---NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDL--GSHEYGY

XP_022957539.1 uncharacterized protein LOC111458910 [Cucurbita moschata]1.4e-18071.22Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNFEEALKDC+EALKIE THFKTLLCKGKILLNLNRYS+AL CFKTAL+DPQV+G+SENLNGYLEKCKK EHLS+TGAFD+SDW+LNGF GKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG VQ               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNF+DKVTDSATKS KTK+LIG LS+GEAE+ L+VPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +  T    EMS+ILSVLDIN+LVEDAASAKVLGKNS+YYGVGLWVLASFINHSC          DHI +                F P     
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE
          +     WGF+C CKRC+FEEQ+  KEE+KE+EMG       G+E GAA+Y+LEEGM+RWMVRGKEKGYLRASFW +Y EVF SEKAMKKWGRRI  +E
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE

Query:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY
         VV+SV +AVGSDERV+K MVERFKRN   GG LEMER+LKLGRGVYGKVMKKQA+RSLL+LGSHEY Y
Subjt:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY

TrEMBL top hitse value%identityAlignment
A0A0A0KQP3 SET domain protein3.0e-18172.16Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLR FEEAL+DCEEALKIESTHFKTLLCKGKILLNLNRYS+AL CFKTAL DPQV+GNSENLNGY+EKCKKLEHLS+TGAFDLSDWVLNGFRGK P
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG +Q               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKS KTK LIG LSSGE EE LEVPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +S     SEMS ILSVLDINSLVEDA SAKVLGKN +YYGVGLWVL SFINHSC          DHI +                F P  +  
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD
          +     WGF+CNCKRC+FEE+I +KEE+KE+EM + GG+E+GAA+Y+LEEGM+RW VRGKEKGYLRASFW AY E+F S+KAMKKWGRRI  +E VVD
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD

Query:  SVAEAVGSDERVVKIMVERFKR----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG-SHEYGY
        SV +AVGSDERVVK+MVERFKR    NGGV+EME++LKLGRGVYGKVMKKQA+R LL+LG SHEYG+
Subjt:  SVAEAVGSDERVVKIMVERFKR----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG-SHEYGY

A0A5A7SYT1 SET domain-containing family protein6.3e-17971.8Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLR FEEAL+DCEEALKIESTHFKTLLCKGKILLNLNRYS+AL CFKTAL DPQV+GNSENLNGY+EKCKKLEHLS+TGAFDLSDWVLNGFRGK P
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG +Q               NVD+GTLLLVT+AIAIERGILPENCDENAQLVMWKNFIDKVTDS+TKS KTK+LIG LSSGEAEE LEVPEMS+F
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KP   DH+S     SEMS ILSVLDINSLVEDA SAKVLGKN +YYGVGLW+L SFINHSC          DHI +                F P  +  
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD
          +     WGF+CNCKRC+FEE+I +KEE+KE+EMG+ GG+E+GAA+Y+LEEGM+RWMVRGKEKGYLRASFW AY E+F SEKAMKKWGRRI  +E VVD
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVD

Query:  SVAEAVGSDERVVKIMVERFKR-----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG
        SV +AVGSDERVVK+MVERFKR     NGGV+EME++LKLGRGVYGKVMKKQA+R LL+LG
Subjt:  SVAEAVGSDERVVKIMVERFKR-----NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLG

A0A6J1DBY2 uncharacterized protein LOC1110191018.4e-18473.08Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNF++ALKDCEEAL I+STHFKTLLCKGKILLNLNRYSAAL CFK+AL DPQV G+SENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IGAVQ               NVDAGTL LVTKAIAIERGILPEN DENAQLVMWKNFIDKVTDSATKSAKTK+LIG LS GE EE LEVPEM  F
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKS-EMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTT
        KPET D  S   D+S E  RI++VLDINSLVEDAASAKVLGKN +YYGVGLWVLASFINHSC          DHI +                F P   +
Subjt:  KPETVDHLSYTDDKS-EMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTT

Query:  GKAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVV
           ++  + WGF+CNCKRCKFEEQI S+EE+KE+EMG+ GG E+GA VY+LEEGM+RWMVRG+EKGYLRA+FWAAY E F SEKAMKKWGRRIPA E V 
Subjt:  GKAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVV

Query:  DSVAEAVGSDERVVKIMVERFKR---NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDL--GSHEYGY
        +SVAEAVGSDERVVK+MVERFKR   NGGV+EM+R+LKLGRG+YGKVMKKQA+RS+L+L  GSHEYGY
Subjt:  DSVAEAVGSDERVVKIMVERFKR---NGGVLEMERILKLGRGVYGKVMKKQAMRSLLDL--GSHEYGY

A0A6J1H283 uncharacterized protein LOC1114589106.7e-18171.22Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNFEEALKDC+EALKIE THFKTLLCKGKILLNLNRYS+AL CFKTAL+DPQV+G+SENLNGYLEKCKK EHLS+TGAFD+SDW+LNGF GKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG VQ               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNF+DKVTDSATKS KTK+LIG LS+GEAE+ L+VPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +  T    EMS+ILSVLDIN+LVEDAASAKVLGKNS+YYGVGLWVLASFINHSC          DHI +                F P     
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE
          +     WGF+C CKRC+FEEQ+  KEE+KE+EMG       G+E GAA+Y+LEEGM+RWMVRGKEKGYLRASFW +Y EVF SEKAMKKWGRRI  +E
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMG----IAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE

Query:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY
         VV+SV +AVGSDERV+K MVERFKRN   GG LEMER+LKLGRGVYGKVMKKQA+RSLL+LGSHEY Y
Subjt:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY

A0A6J1K403 uncharacterized protein LOC1114914613.7e-17971.06Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EARSKLRNFEEALKDC+EALKIE THFKTLLCKGKILLNLNRYS+AL CFKTAL+DPQV+G+SENLNGYLEKCK  +HLS+TGAFDLSDW+LNGF GKPP
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF
         +AE+IG VQ               NVD+GTLLLVTKAIAIERGILPENCDENAQLVMWKNF+DKVTDSATKS KTK+LIG LS+GEAE+ L+VPEMSVF
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVF

Query:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG
        KPET D +  T    EMS+ILSVLDIN+LVEDAASAKVLGKNS+YYGVGLWVLASFINHSC          DHI +                F P     
Subjt:  KPETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGR-------DHIRI----------------FRPAFTTG

Query:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAG----GMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE
          +     WGF+C CKRC+FEEQ+  KEE+KE+EMG       G+E GAA+Y+LEEGM+RWMVRGKEKGYLRASFW +Y EVF SEKAMKKWGRRI  +E
Subjt:  KAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAG----GMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVE

Query:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDL-GSHEYGY
         VV+SV +AVGSDERVVK MVERFKRN   GG LEMER+LKLGRGVYGKVMKKQA+RSLL+L GSHEY Y
Subjt:  TVVDSVAEAVGSDERVVKIMVERFKRN---GGVLEMERILKLGRGVYGKVMKKQAMRSLLDL-GSHEYGY

SwissProt top hitse value%identityAlignment
Q80W98 Small glutamine-rich tetratricopeptide repeat-containing protein beta4.0e-0531.91Show/hide
Query:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLN
        A+SKL ++ +A+KDCE+A+ I+S + K     G  L  +N++  A+  ++ AL LDP+ +    NL    +K +++   + TG  FD++  + N
Subjt:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLN

Q8VD33 Small glutamine-rich tetratricopeptide repeat-containing protein beta3.0e-0531.91Show/hide
Query:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLN
        A+SKL ++ +A+KDCE+A+ I+S + K     G  L  +N++  A+  ++ AL LDP+ +    NL    +K +++   + TG +FD++  + N
Subjt:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLN

Q96EQ0 Small glutamine-rich tetratricopeptide repeat-containing protein beta1.0e-0529.92Show/hide
Query:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLNGFRGKP
        A+SKL ++ +A+KDCE+A+ I+S + K     G  L  LN++  A+  ++ AL LDP+ +    NL    +K +++   + TG +FD++  + N     P
Subjt:  ARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTAL-LDPQVNGNSENLNGYLEKCKKLEHLSRTG-AFDLSDWVLNGFRGKP

Query:  PVMAEYIGAVQNVDAGTLL--LVTKAI
          ++     +QN     L+  ++T AI
Subjt:  PVMAEYIGAVQNVDAGTLL--LVTKAI

Arabidopsis top hitse value%identityAlignment
AT1G26750.1 unknown protein3.1e-4555.09Show/hide
Query:  MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRADGTVKTISLPEDVYVKKFFQKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEE
        MS+MKGDLL++TR+LV GLA  EPVWLKAME +PPP FPR++G ++ I LPED YV+KF  KHP +K +D  K  +F P  AR++G RVLELKE G+SE 
Subjt:  MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRADGTVKTISLPEDVYVKKFFQKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEE

Query:  EAMAVANMEYRAEKKAKKKAYSRLKQIARLQGKRPPPNPYPVPSRRYKLRKGNSSVIVSSTPRLKRL
        +AM+VANMEY +E+K  KKAY RLK++A +Q K PPP PYP   +    +   S+     TP ++RL
Subjt:  EAMAVANMEYRAEKKAKKKAYSRLKQIARLQGKRPPPNPYPVPSRRYKLRKGNSSVIVSSTPRLKRL

AT1G26760.1 SET domain protein 351.1e-13051.4Show/hide
Query:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP
        EAR++LR+F EA++DC++AL+IE THFKTLLCKGK+LL L++YS AL CFKTALLDPQ + N E +  Y+EKCKKLE  ++TGAFDLSDW+L+ FRGK P
Subjt:  EARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPP

Query:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPE-NCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSV
         +AE+IG+++               N+ AGTL+LVTKA+AIERGIL    C E AQL+MWKNF+++VT+S  K  +T+ ++  LS+G+ E+ LE+PE+++
Subjt:  VMAEYIGAVQ---------------NVDAGTLLLVTKAIAIERGILPE-NCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSV

Query:  FKP-ETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSC----------------RGRD-------HIRIFRPAFT
        F+P E  +         +  ++LS+LD+NSLVEDA S KV+GKN EYYGVGLW LASFINHSC                  RD           F     
Subjt:  FKP-ETVDHLSYTDDKSEMSRILSVLDINSLVEDAASAKVLGKNSEYYGVGLWVLASFINHSC----------------RGRD-------HIRIFRPAFT

Query:  TGKAQEHVRNWGFDCNCKRCKFEEQI-CSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVET
          K +E   +WGF C C RCKFE  +  + +E++E EMG+  G++ G AVY +EEGMKRW V+GK+KG LRAS+W  Y E++ SE+ MK+WGR+IP +E 
Subjt:  TGKAQEHVRNWGFDCNCKRCKFEEQI-CSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGKEKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVET

Query:  VVDSVAEAVGSDERVVKIMVE-RFKRNGG---VLEMERILKLGRGVYGKVM-KKQAMRSLLDL
        VVDSV++ VGSDER++K+ VE   K++GG   ++EME+I+KLG+GVYGKV+ KK+AM++LL +
Subjt:  VVDSVAEAVGSDERVVKIMVE-RFKRNGG---VLEMERILKLGRGVYGKVM-KKQAMRSLLDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGCGGCTCGGTTTTGCGTTTTTCCCTCTCCATCCAGTGGGTTTCCATTTTATTGTATCCAGCAGATCATTGGACGGAATCGAAGTCCCAAAAAGGAGGAGCAAT
GTCCTTCATGAAAGGAGATTTGCTAACTAGGACTAGAAAGCTCGTCAAGGGGTTGGCCAAAGCAGAACCTGTTTGGCTCAAAGCCATGGAGCAGGCACCACCTCCAACAT
TTCCTCGAGCAGATGGAACAGTTAAGACAATCAGTCTTCCTGAGGATGTATATGTAAAAAAGTTCTTCCAGAAACATCCAGATTCAAAATATGAAGATGCCATCAAGTTT
TGTAGTTTTGATCCTCCTCCAGCCCGAATATTTGGCTTGCGGGTGCTTGAACTGAAGGAACAGGGCGTTAGTGAGGAGGAAGCCATGGCAGTAGCCAATATGGAATACCG
GGCAGAGAAAAAGGCAAAAAAGAAGGCTTACTCTCGCTTGAAGCAAATTGCACGTCTTCAAGGGAAGAGACCTCCTCCTAACCCTTATCCAGTGCCATCAAGGAGATACA
AGCTGAGGAAAGGAAATTCGTCCGTGATCGTTTCTTCAACCCCAAGATTAAAGAGATTGCACAAAGCTGGACAACTCAGTGCAGTTGCAGACAGACCAGAAATTACAGAA
GCAAAACAGGCTAATTCTCCCATGTTAACCATAAAAGCTCTTTGGACATTTGGGACAACTCCTTTCCAAAGCCCAATTATCCCTTCTGCACGCACTATCTTGCCCAAGGC
ATCGAGAGGTCCTGAGTATCTGGGTTGGAGACCTTGGCTTGTGAGACGGCCATCAGCTTGCATCCTCACCTTAACAAGATCAGCAGGACTTGCCACTACCTGTGGTACTA
ATGGTAGACACAAAAATATTGATAATTTAGCAAAGGGAGTAGGGAACCTTAGTACAAAACATAATGTGGAAAATTCGAACCTGAGCGATAGAGCCGGATATTCCACCGAC
AAGAGCCTTGGCATGGAGGGAAAGAGAGGCGCTATCGGAGGCAAGGAAGAGGGATCCCACTCTGAGCGCGTTTGTGGAGCTAGAGGAGGAGGACGACTCGCCGTGGAGCT
GAAGCCTTGTCTTGGTCAGGTCTATCGGGAAGGTCGAGGTCTCCGCCACCATGGCGGACATCGAGATCCAGAGCCACAGAGCTCCTCCTCAGAGAAGAATGGGACGACGC
CGTTCATGCCTACTCTCAGTTCATCAGCCTCTGCCGGAACCAAACGGCGGCCATTCGTAAACTCTCAGATGCAGATCACCACCTCCCAAGCTTCAGAAATCTCTCTGTTT
AGCCCTCTCCAACCGGGCGAAGCGCGATCCAAACTCAGAAACTTCGAGGAAGCTTTAAAAGACTGTGAAGAAGCTTTGAAGATCGAGAGCACCCACTTCAAAACTCTGCT
CTGTAAAGGTAAGATTTTGCTGAATCTCAACAGGTACTCTGCGGCATTGGGGTGCTTCAAAACAGCTCTGCTTGACCCACAGGTTAATGGGAATTCTGAAAACCTTAATG
GGTATCTCGAGAAGTGCAAAAAACTCGAACATTTGTCAAGGACTGGAGCTTTTGATCTCTCTGACTGGGTTCTAAATGGGTTTCGTGGAAAACCTCCAGTAATGGCCGAA
TACATCGGCGCAGTGCAGAATGTAGATGCTGGGACTCTCTTACTAGTCACCAAAGCCATTGCTATCGAAAGAGGCATTTTGCCAGAAAATTGCGACGAAAACGCTCAGTT
GGTTATGTGGAAGAATTTCATCGATAAAGTCACAGATTCTGCAACGAAGAGTGCCAAAACAAAGGATCTGATCGGTTTTCTCTCGAGCGGTGAAGCCGAGGAGGGGCTCG
AGGTTCCTGAAATGAGTGTCTTCAAGCCAGAGACAGTGGATCATCTCAGTTACACAGATGATAAATCTGAAATGAGCAGAATCCTTAGCGTTTTGGACATCAATTCCCTC
GTTGAAGATGCAGCTTCTGCGAAAGTTCTAGGGAAGAACAGCGAGTACTATGGAGTTGGGCTATGGGTGCTGGCGTCGTTCATCAACCATTCTTGCAGGGGAAGAGATCA
CATTCGCATATTTCGACCCGCTTTCACAACTGGGAAAGCGCAAGAACATGTCAGAAACTGGGGCTTCGACTGCAACTGCAAACGATGCAAATTCGAAGAACAAATCTGCA
GCAAAGAAGAGCTAAAAGAGGTCGAGATGGGAATTGCAGGAGGCATGGAGATCGGCGCTGCAGTTTACAGGCTTGAAGAAGGAATGAAGAGATGGATGGTGAGAGGAAAA
GAGAAGGGTTATTTGAGGGCGTCGTTTTGGGCGGCATATTCTGAAGTGTTTTGTTCAGAGAAGGCGATGAAGAAGTGGGGAAGACGAATTCCGGCGGTAGAGACAGTGGT
GGATAGCGTTGCAGAAGCAGTGGGGAGCGATGAGAGAGTGGTGAAGATAATGGTGGAAAGGTTTAAGAGAAATGGTGGGGTTTTGGAAATGGAGAGGATTTTGAAGTTGG
GAAGAGGGGTTTATGGGAAGGTGATGAAGAAACAGGCAATGAGGTCTCTTCTTGACCTTGGCAGCCATGAATATGGCTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGCGGCTCGGTTTTGCGTTTTTCCCTCTCCATCCAGTGGGTTTCCATTTTATTGTATCCAGCAGATCATTGGACGGAATCGAAGTCCCAAAAAGGAGGAGCAAT
GTCCTTCATGAAAGGAGATTTGCTAACTAGGACTAGAAAGCTCGTCAAGGGGTTGGCCAAAGCAGAACCTGTTTGGCTCAAAGCCATGGAGCAGGCACCACCTCCAACAT
TTCCTCGAGCAGATGGAACAGTTAAGACAATCAGTCTTCCTGAGGATGTATATGTAAAAAAGTTCTTCCAGAAACATCCAGATTCAAAATATGAAGATGCCATCAAGTTT
TGTAGTTTTGATCCTCCTCCAGCCCGAATATTTGGCTTGCGGGTGCTTGAACTGAAGGAACAGGGCGTTAGTGAGGAGGAAGCCATGGCAGTAGCCAATATGGAATACCG
GGCAGAGAAAAAGGCAAAAAAGAAGGCTTACTCTCGCTTGAAGCAAATTGCACGTCTTCAAGGGAAGAGACCTCCTCCTAACCCTTATCCAGTGCCATCAAGGAGATACA
AGCTGAGGAAAGGAAATTCGTCCGTGATCGTTTCTTCAACCCCAAGATTAAAGAGATTGCACAAAGCTGGACAACTCAGTGCAGTTGCAGACAGACCAGAAATTACAGAA
GCAAAACAGGCTAATTCTCCCATGTTAACCATAAAAGCTCTTTGGACATTTGGGACAACTCCTTTCCAAAGCCCAATTATCCCTTCTGCACGCACTATCTTGCCCAAGGC
ATCGAGAGGTCCTGAGTATCTGGGTTGGAGACCTTGGCTTGTGAGACGGCCATCAGCTTGCATCCTCACCTTAACAAGATCAGCAGGACTTGCCACTACCTGTGGTACTA
ATGGTAGACACAAAAATATTGATAATTTAGCAAAGGGAGTAGGGAACCTTAGTACAAAACATAATGTGGAAAATTCGAACCTGAGCGATAGAGCCGGATATTCCACCGAC
AAGAGCCTTGGCATGGAGGGAAAGAGAGGCGCTATCGGAGGCAAGGAAGAGGGATCCCACTCTGAGCGCGTTTGTGGAGCTAGAGGAGGAGGACGACTCGCCGTGGAGCT
GAAGCCTTGTCTTGGTCAGGTCTATCGGGAAGGTCGAGGTCTCCGCCACCATGGCGGACATCGAGATCCAGAGCCACAGAGCTCCTCCTCAGAGAAGAATGGGACGACGC
CGTTCATGCCTACTCTCAGTTCATCAGCCTCTGCCGGAACCAAACGGCGGCCATTCGTAAACTCTCAGATGCAGATCACCACCTCCCAAGCTTCAGAAATCTCTCTGTTT
AGCCCTCTCCAACCGGGCGAAGCGCGATCCAAACTCAGAAACTTCGAGGAAGCTTTAAAAGACTGTGAAGAAGCTTTGAAGATCGAGAGCACCCACTTCAAAACTCTGCT
CTGTAAAGGTAAGATTTTGCTGAATCTCAACAGGTACTCTGCGGCATTGGGGTGCTTCAAAACAGCTCTGCTTGACCCACAGGTTAATGGGAATTCTGAAAACCTTAATG
GGTATCTCGAGAAGTGCAAAAAACTCGAACATTTGTCAAGGACTGGAGCTTTTGATCTCTCTGACTGGGTTCTAAATGGGTTTCGTGGAAAACCTCCAGTAATGGCCGAA
TACATCGGCGCAGTGCAGAATGTAGATGCTGGGACTCTCTTACTAGTCACCAAAGCCATTGCTATCGAAAGAGGCATTTTGCCAGAAAATTGCGACGAAAACGCTCAGTT
GGTTATGTGGAAGAATTTCATCGATAAAGTCACAGATTCTGCAACGAAGAGTGCCAAAACAAAGGATCTGATCGGTTTTCTCTCGAGCGGTGAAGCCGAGGAGGGGCTCG
AGGTTCCTGAAATGAGTGTCTTCAAGCCAGAGACAGTGGATCATCTCAGTTACACAGATGATAAATCTGAAATGAGCAGAATCCTTAGCGTTTTGGACATCAATTCCCTC
GTTGAAGATGCAGCTTCTGCGAAAGTTCTAGGGAAGAACAGCGAGTACTATGGAGTTGGGCTATGGGTGCTGGCGTCGTTCATCAACCATTCTTGCAGGGGAAGAGATCA
CATTCGCATATTTCGACCCGCTTTCACAACTGGGAAAGCGCAAGAACATGTCAGAAACTGGGGCTTCGACTGCAACTGCAAACGATGCAAATTCGAAGAACAAATCTGCA
GCAAAGAAGAGCTAAAAGAGGTCGAGATGGGAATTGCAGGAGGCATGGAGATCGGCGCTGCAGTTTACAGGCTTGAAGAAGGAATGAAGAGATGGATGGTGAGAGGAAAA
GAGAAGGGTTATTTGAGGGCGTCGTTTTGGGCGGCATATTCTGAAGTGTTTTGTTCAGAGAAGGCGATGAAGAAGTGGGGAAGACGAATTCCGGCGGTAGAGACAGTGGT
GGATAGCGTTGCAGAAGCAGTGGGGAGCGATGAGAGAGTGGTGAAGATAATGGTGGAAAGGTTTAAGAGAAATGGTGGGGTTTTGGAAATGGAGAGGATTTTGAAGTTGG
GAAGAGGGGTTTATGGGAAGGTGATGAAGAAACAGGCAATGAGGTCTCTTCTTGACCTTGGCAGCCATGAATATGGCTACTAG
Protein sequenceShow/hide protein sequence
MSGGSVLRFSLSIQWVSILLYPADHWTESKSQKGGAMSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRADGTVKTISLPEDVYVKKFFQKHPDSKYEDAIKF
CSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKAYSRLKQIARLQGKRPPPNPYPVPSRRYKLRKGNSSVIVSSTPRLKRLHKAGQLSAVADRPEITE
AKQANSPMLTIKALWTFGTTPFQSPIIPSARTILPKASRGPEYLGWRPWLVRRPSACILTLTRSAGLATTCGTNGRHKNIDNLAKGVGNLSTKHNVENSNLSDRAGYSTD
KSLGMEGKRGAIGGKEEGSHSERVCGARGGGRLAVELKPCLGQVYREGRGLRHHGGHRDPEPQSSSSEKNGTTPFMPTLSSSASAGTKRRPFVNSQMQITTSQASEISLF
SPLQPGEARSKLRNFEEALKDCEEALKIESTHFKTLLCKGKILLNLNRYSAALGCFKTALLDPQVNGNSENLNGYLEKCKKLEHLSRTGAFDLSDWVLNGFRGKPPVMAE
YIGAVQNVDAGTLLLVTKAIAIERGILPENCDENAQLVMWKNFIDKVTDSATKSAKTKDLIGFLSSGEAEEGLEVPEMSVFKPETVDHLSYTDDKSEMSRILSVLDINSL
VEDAASAKVLGKNSEYYGVGLWVLASFINHSCRGRDHIRIFRPAFTTGKAQEHVRNWGFDCNCKRCKFEEQICSKEELKEVEMGIAGGMEIGAAVYRLEEGMKRWMVRGK
EKGYLRASFWAAYSEVFCSEKAMKKWGRRIPAVETVVDSVAEAVGSDERVVKIMVERFKRNGGVLEMERILKLGRGVYGKVMKKQAMRSLLDLGSHEYGY