; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020220 (gene) of Snake gourd v1 genome

Gene IDTan0020220
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMAP7 domain-containing protein 1-like
Genome locationLG01:23660946..23663152
RNA-Seq ExpressionTan0020220
SyntenyTan0020220
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596871.1 hypothetical protein SDJN03_10051, partial [Cucurbita argyrosperma subsp. sororia]2.0e-24184.89Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDE KT+SGSK R+SSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRRSLT+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEV+NEPPRRGREFGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   +
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

KAG7030146.1 hypothetical protein SDJN02_08493, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-24184.89Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDE KT+SGSK R+SSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRRSLT+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEV+NEPPRRGREFGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   +
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

XP_022949423.1 MAP7 domain-containing protein 1-like [Cucurbita moschata]1.2e-23884.71Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L   Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDE KT+SGSK RVSSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY S  PPPPPPPPPTVRRISSMKP  WL  +D+DV HQ DLRRSLT+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEV+NEPPRRGREFGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   +
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

XP_023005363.1 uncharacterized protein DDB_G0284459-like [Cucurbita maxima]1.6e-24385.61Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L  GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD+E KT+SGSK RVSSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN ELNEK+VL SPVPWRSRSE  EVQEEADN P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQKDLRRSL SKPR SIRD G+  D ++ ANSS E LPRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEVVNEPPRRGREFGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   S
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

XP_023540912.1 uncharacterized protein DDB_G0284459-like [Cucurbita pepo subsp. pepo]8.8e-24285.13Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDE KT+SGSK RVSSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN ELNEK+VL SPVPWRSRSER EVQEEADN P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRS--PPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSM
        LQAKSAEDLVRKKNFY S  PPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRRSLT+KPR  IRD G+  D ++ ANSS E LPRNYVDGQSM
Subjt:  LQAKSAEDLVRKKNFYRS--PPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSM

Query:  GRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNE
        G+SVRTIRPGEVVNEPPRRGREFGG D LKG KMEQN H QEFEENPIEFPDEDKE LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD  
Subjt:  GRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNE

Query:  RSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
         +S  EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  RSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

TrEMBL top hitse value%identityAlignment
A0A6J1DZG8 WW domain-binding protein 114.7e-22576.78Show/hide
Query:  MAESDV--------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDN
        MAE+++        L   Q +  PSKFH+H+LYKVLTAIFFLVILPLVPS+APEFINQTLLTRSWELLHLLFVGIAVSYGLFSRR +E E+E++ SKFDN
Subjt:  MAESDV--------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDN

Query:  VQSYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD----DESKTVSGSKPR
        VQSYVSGLLHVSSVFDDEPETPSANDESLSSSDE+KVQTW SRYFRNESVVVAEERP VNEQRVRSEKPLLLPVRSLKSRVV D    DES+ VSGSKPR
Subjt:  VQSYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD----DESKTVSGSKPR

Query:  VSSRRVLSMPKGSSNGE------------LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPS
         SSRR+LS  K S+ GE            LNE VVLRSPVPWRSRS RME+QEEADNPP+YSP A MEESESNWIDSRSSRPQTSRS+R + I QKLSPS
Subjt:  VSSRRVLSMPKGSSNGE------------LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPS

Query:  PSPS--PRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMK--PWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGA
        PSPS  P+K SP PTVSPELQ K AED VRKK+FYRSPPPPPPPPPPP VRRISSMK   WL  NDNDVPHQKDLRRS TSKPRSSIRD G+  D ++G 
Subjt:  PSPS--PRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMK--PWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGA

Query:  NSS-VEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ
        NSS V   PRNYVD QSMG+SVRTIRPGE+VNEPPRRGRE GGN+ LKG+   QN HVQ+FEENPIEFPDE+KEELVEKL METDDDME+ EEED +  +
Subjt:  NSS-VEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ

Query:  FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT
        FIR+ NG  +   R+DNERSSSNEEA S++MA DGGPDVDKKADEFIAKFREQIRLQRIE IK+SSGQI RN+SRQT
Subjt:  FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT

A0A6J1E6G0 uncharacterized protein DDB_G02844591.5e-22677.35Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      LPP + + TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLTR+WELLHLLFVGIAVSYGLFSRR DE EDEI+VS FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVSGLLHVSSVFDDE ETPSANDES+S SD NKVQTW +RYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVVVDDES+TVSGS  RVSSRR+
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGEL------------NEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI---TQKLSPSPSP
        LS  K SSNGE+            NE V L SPVPWRSRS R EVQEEADNPP+YSPA PMEESESNWIDSRSSRPQTSRSS+ S I       SPSPSP
Subjt:  LSMPKGSSNGEL------------NEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI---TQKLSPSPSP

Query:  SPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSV
        SPRK SPSP VSPEL+AKS+E  VRKK+F+ SPPPPPPPPPPP VRRI+SMKP  WL  NDNDVPHQKDL+RS+ TSKPRSSIR  G+  D ++G NSS 
Subjt:  SPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSV

Query:  EALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEED-NVVGQFIRED
        EALPRNY D  SMG+S R IRPGEV NEPPRRGREFGG D LKGK ++QNAHVQ FEENPIEFP+++K+ELVEKL+METDDDMES+EED N+VG+FIRED
Subjt:  EALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEED-NVVGQFIRED

Query:  NGEPFNVKRRDNERSSSN--EEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT
        NGEPFNV RRDNERSSSN  E  SS+N++NDGGPDVDKKADEFIAKFREQIRLQRIE IK+S+GQI RNTS+QT
Subjt:  NGEPFNVKRRDNERSSSN--EEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT

A0A6J1GC23 MAP7 domain-containing protein 1-like5.8e-23984.71Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L   Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDE KT+SGSK RVSSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY S  PPPPPPPPPTVRRISSMKP  WL  +D+DV HQ DLRRSLT+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEV+NEPPRRGREFGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   +
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

A0A6J1L1K4 uncharacterized protein DDB_G0284459-like8.6e-22778.28Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      LPPG+ Q TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLTR+WELLHLLFVGIAVSYGLFSRR DE ED I+VS FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVSGLLHVSSVFDDE ETPSANDES+SSSD NKVQTW +RYFRNES+VVAEE PVVNEQRVRSEKPLLLPVRSL S+VVVDDES+TVSGS  RVSS R+
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGE------------LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLS-PSPSPSP
        LS  K SSNGE            LNE VVL SPVPWRSRS R EVQEEADNPPVYSPA PMEESESNWIDSRSSRPQTSRS + S I  KLS PSPSP P
Subjt:  LSMPKGSSNGE------------LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLS-PSPSPSP

Query:  RKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALP
        RK SPSP VSPEL+AKS+ED VRKK+F+ SPPPPPPPPPPP VRRI+SMKP    NDNDVPHQKDL+RS+ TSKPR SIRD G+  D ++G NSS EALP
Subjt:  RKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALP

Query:  RNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMET--DDDMESEEED-NVVGQFIREDNG
        RNY D  SMG+S+R IRPGEV NEP RRGREFGGND LKGK ++QN HVQ FEENPIEFPD+DK+E VEKL MET  DDDMESEEED N+VG+FIREDNG
Subjt:  RNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMET--DDDMESEEED-NVVGQFIREDNG

Query:  EPFNVKRRDNERSSSNEEA-SSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT
        EPFNV RRDNERSSSNEEA  S+N++NDGGPDVDKKADEFIAKFREQIRLQRIE IK+S+GQI RNTS+Q+
Subjt:  EPFNVKRRDNERSSSNEEA-SSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT

A0A6J1L1Z4 uncharacterized protein DDB_G0284459-like7.8e-24485.61Show/hide
Query:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ
        MAESDV      L  GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQ
Subjt:  MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ

Query:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV
        SYVS LLHVSSVFDDEP TPSANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVD+E KT+SGSK RVSSRR 
Subjt:  SYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRV

Query:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE
        LSMP  SSN ELNEK+VL SPVPWRSRSE  EVQEEADN P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Subjt:  LSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE

Query:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR
        LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQKDLRRSL SKPR SIRD G+  D ++ ANSS E LPRNYVDGQSMG+
Subjt:  LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGR

Query:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS
        SVRTIRPGEVVNEPPRRGREFGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME  DDME+ EEED+VVGQFIREDNGEPFNVKRRD   S
Subjt:  SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERS

Query:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
        SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ
Subjt:  SSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown7.4e-2128.6Show/hide
Query:  FLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ-SYVSGLLHVSSVFDDEPETPSA------NDESLSS--
        FL+ LPL PSQAP+F+ +T+LT+ WEL+HLLFVGIAV+YGLFSRR  E   ++ +++ D    SYVS +  VSSVFD+E +  S       +DES+S+  
Subjt:  FLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQ-SYVSGLLHVSSVFDDEPETPSA------NDESLSS--

Query:  ----------------------SDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKS--RVVVDDESKTVSGSKPRVSSRRVLSMPK
                               + N+V+ W S+YF+ +S VV   RP          +PL LP+R L+S  R     + K+ + S     +    S+  
Subjt:  ----------------------SDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKS--RVVVDDESKTVSGSKPRVSSRRVLSMPK

Query:  GSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSA
         +   E+       SPVPW++R E M +    DN P  S   P+   E+  + S SSR   S SS+TS  +Q        +  + SPS +VS E    + 
Subjt:  GSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSA

Query:  EDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGT----DTIIGANSSVEALPRNYVDGQSMGRSVRT
        E+LV++K+   S     P  PP      S   P L  ND       +L    T +  S  R   +G+    D   G  + +E         +   +  R 
Subjt:  EDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGT----DTIIGANSSVEALPRNYVDGQSMGRSVRT

Query:  IRPGEVVNEPPRRGRE-------------FGGND--PLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIREDNGEPF
         +   +  E  RRG +              GG D    + + ++Q ++    EEN  +  + D   L  K    + D +E   ED+   + + E      
Subjt:  IRPGEVVNEPPRRGRE-------------FGGND--PLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIREDNGEPF

Query:  NVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFRE
         V +  N ++S     SS      GG D   + D    K  +
Subjt:  NVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFRE

AT4G16790.1 hydroxyproline-rich glycoprotein family protein6.0e-3933.58Show/hide
Query:  PSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRR---------TDEIEDEITVSKFDNVQSYVSGLLHVSSV
        P KF++  ++K L       ++P+  SQ PE  NQ   TR  ELLHL+FVGIAVSYGLFSRR         T   +        +N  SYV  +L VSSV
Subjt:  PSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRR---------TDEIEDEITVSKFDNVQSYVSGLLHVSSV

Query:  FDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLK-SRVVVDDESKTVSGSKPRVSSRRVLSMPKGSSNGE
        F+   E+ S   +  SS D+ K QTW ++Y  +  +   E R V        EKPLLLPVRSL  SR  V D S   SG   +V S+R L    G  N +
Subjt:  FDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLK-SRVVVDDESKTVSGSKPRVSSRRVLSMPKGSSNGE

Query:  LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRK
             VL SP+PWRSRS                  +    S S  ++S  S    +      +I     PS   SPRK +P P ++ E            
Subjt:  LNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRK

Query:  KNFYRSPPPPPPPPPP-PTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNE
          F+ SPPPPPPPPPP P     SS K      D+   ++ + R S   K +              G        P                 P E    
Subjt:  KNFYRSPPPPPPPPPP-PTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNE

Query:  PPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIRE-DNGEPFNVKRRDNERSSSNEEASSNNMAN
        PP + R          +KM++NA  + +  +PI    E KE+  EK          +++  N+  + + E +NGE    +R +NE    ++E     +  
Subjt:  PPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIRE-DNGEPFNVKRRDNERSSSNEEASSNNMAN

Query:  DG------GPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSR
        +G      G DVDKKADEFIAKFREQIRLQRIE IK+S+ +I  N+SR
Subjt:  DG------GPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCT
CCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCT
TTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAG
CCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGA
ACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTG
GTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCG
AGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAG
GCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAAT
TACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATG
AAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGA
TACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATG
AGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCA
GATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGG
AGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTG
ATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
CAAAGCTTGAAATTGTTCTTTGTTCCTTCTCCAAACCCTTTCCCCTTCTCTGCAACTCTCTTTCTCCTACTTTTTCCTTCGCCATTAAGTCTTCATCTTCTTCCATTTCC
CTCCAATTCATCGAAGAACCAGAATAATCTCTTCCTCCCTCCTTCTTCAATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATA
CCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGG
GAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCA
ATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAA
CATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGT
AGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGG
GGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTG
CTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCT
TCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACC
TCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCC
TTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAA
TCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGA
ACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAA
GCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGC
TCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAA
ATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGAAGTTTGAAAGATAGATGTATCTTGCCTTTCATTCAATTCCTGGAAAGTGTTTCTTCTTCTCCAAATT
CACCCTCTGGATTTATGATAGATGTTGTAATTCAGTAGGCATCTGCCATTAGTGTTTGTTGATATACAGCTGAAACAACAAGGTTTATGAGACTTAGTTGTATGTCATGT
GTTTGAGAGTTAGCTTTTTAATTTGTCTATTAAATAGATGGACTAGGTTGTTTCGCTTTAATTAACTATTTCTTTCTATATATTTGGATATAGAACTTTTAACTACAAAA
A
Protein sequenceShow/hide protein sequence
MAESDVLPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDE
PETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRS
RSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSM
KPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFP
DEDKEELVEKLAMETDDDMESEEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT