; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002043 (gene) of Snake gourd v1 genome

Gene IDTan0002043
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhotolyase/cryptochrome alpha/beta domain-containing protein
Genome locationLG01:96793338..96801740
RNA-Seq ExpressionTan0002043
SyntenyTan0002043
Gene Ontology termsNA
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR006050 - DNA photolyase, N-terminal
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR029058 - Alpha/Beta hydrolase fold
IPR036155 - Cryptochrome/photolyase, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579658.1 Pheophytinase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.11Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T  + P      SS+    + SSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+ SLRDRGLDL++KFGDAESVLRELV+EVKATHVFAEEEVE ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLPCL+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA VRAK SK+GSNE NPS KESRTE MGKSIFS +  +  + GGTEG+LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ LHEMVRNSESRDG SFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVGRPVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKSIVLINSAGSVIPGYS LP  KERQV+ AAWL ARLLL YLRLKTKDIV+NCYPTRTERAD WLI+EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPK
        VLESIFS DLSVPLNYLLEGLEERVLIIQGMKDPIYNSK++LA LKE C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+K K
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPK

XP_022928782.1 uncharacterized protein LOC111435603 isoform X1 [Cucurbita moschata]0.0e+0082.75Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T  + P      SS+    + SSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+ SLRDRGLDL++KFGDAESVLRELV+EVKATHVFAEEEVE ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLPCL+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA VRAK SK+GSNE NPS KESRTE MGKSIFS +  +  + GGTEG LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ LHEMVRNSESRDGASFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVGRPVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKSIVLINSAGSVIPGYS LP  KE+QV+ AAWL ARLLL YLR+KTKDIVRNCYPTRTERAD WL++EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS
        VLESIFS DLSVPLN+LLEGLEERVLIIQGMKDPIYNSK++LA LKE C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+KPK  S
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS

XP_022970224.1 uncharacterized protein LOC111469174 isoform X1 [Cucurbita maxima]0.0e+0082.46Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T  + P      SS+    +LSSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+HSLRDRGLDL++KFGDAESVLRELV+EVKATHVFAEEEVE ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLP L+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA VRAK SK+GSNE NPS KESRTE MGKSIFS +  +  + GGTEG+LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ LHEMVRNSESRDG SFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVG+PVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKS VLINSAGSV+PGYS LP  KERQV+ AAWL ARLLL YLRLKTKDIV+NCYPTRTERAD WLI+EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS
        VLESIFS DLSVPLN LLEGLEERVLIIQGMKDPIYNSK++LA LKE C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+K K  S
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS

XP_023550061.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111808364 [Cucurbita pepo subsp. pepo]0.0e+0082.32Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T    P P    SS+    +LSSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+HSLRD GLDL++KFGDAESVLRELV+EVKATHVFAEEE+E ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLPCL+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA V AK SK+GSN  NPS K+SRTE MGKSIFS +  +  + GGTEG+LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ+LHEMVRNSESRDGASFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVGRPVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKS VLINSAGSV+PGYS LP  KERQV+ AAWL ARLLL YLRLKTKDIV+NCYPTRTERAD WLI+EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS
        VLESIFS DLSVPLN+LLEGLEERVLIIQGMKDPIYNSK++LA L+E C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+KPK  S
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS

XP_038875210.1 uncharacterized protein LOC120067727 isoform X1 [Benincasa hispida]0.0e+0083.07Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALE
        MSIFNFS S S  F     +P P    SS+    + SSA P TG LA+LWFKHDLRI DHPALHAAAS FPAL+PLYIFDSRILSRFSDEMLEI+LLALE
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALE

Query:  GLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPL
         LR+SLRDRGLDL++KFGDAESVLRELV++VKATHVFAEEEVE ELCVLMDDVSQTL++LI+SPNLTIWRTPFYD+++LE LP SYDEFRKLQLPV  P 
Subjt:  GLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPL

Query:  SSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQMGGGTEGVLNA
        SSPTLPCL+++LNWG MPTFD LKEFVNS ++Y+P EDW SIKN+TAE I RAK SK+GSNENNPS KE RTEGMGKSIFS  RGK   M GGTE VLNA
Subjt:  SSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQMGGGTEGVLNA

Query:  LAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKG
        LAAYIRYNEGTSRDDWQ LHEMVRNSESRDGASFIKLFGP+IHLG+ISKR+VHYEAIKYEKERNAGFLSPFGYSA TVAAA DAVLSSEWYWLMSLK KG
Subjt:  LAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKG

Query:  RRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVG
        RRLGSHSYR WRWNGFLVQYTVVGC+GPAILLVHGFGAFLEHFRDNI G+ + GNQVWA+TMLGFGS EKPNIVYSE MWAEFLRDFIV VVGRPVH+VG
Subjt:  RRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVG

Query:  NSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVV
        NSIGGYIVAIVA LWP+LVKSIVLINSAGSVIPGYSFLP +KERQ++ AAWL ARLLL YLRLKTKDI++NCYPTRTERAD+WLINEMLRASKDPGGLV+
Subjt:  NSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVV

Query:  LESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSIL
        LESIFS DLSVPLNYLLEGLEERVL+IQGMKDPIYNSKS+L M KEHC GV IKELDAGHCPHDELPEEVNSILCEW+VR+E+K K GSIL
Subjt:  LESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSIL

TrEMBL top hitse value%identityAlignment
A0A0A0KNA5 Photolyase/cryptochrome alpha/beta domain-containing protein0.0e+0080.97Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALE
        MSIFNFS S    F    S P P    SS   + + SS  P TG LA+LWFKHDLRI DHPALHAA+SQF +L+PLYIFDSRILSRFSD+MLEI+LLALE
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALE

Query:  GLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPL
         LRHSLRDRGLDL++KFGDAES+LRELV++VKATHVFAEEEVE ELC+LMDDVSQTL++LI SP+LTIWRTPFYD++++E LP SYDEFRKLQLPV  PL
Subjt:  GLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPL

Query:  SSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQMGGGTEGVLNA
        SSPTLPCL+M+L+WG MPTFD LKEF+NS ++ +P ++W+SIKNTTAE +VRAK SK+G+NENNPS +ESRTE MG SIFS  RGK F M GGTEGVLNA
Subjt:  SSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQMGGGTEGVLNA

Query:  LAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKG
        LAAYIRYNEGTSRDDWQ LHEMVRNSESRDGASFIKLFGPAIHLG+ISKR+ HYEAIKYEKERNAGFLSPFGYSA +VAAAVDAVLSSEWYWLM LKSKG
Subjt:  LAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKG

Query:  RRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVG
        RRLGS+SYRNWRWNGFLVQYTVVGC+GPA LLVHGFGAFLEH+RDNI GI E G QVWA+TMLGFG  EKPNIVYSE MWAEF+RDFIVEVVGRPVH+VG
Subjt:  RRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVG

Query:  NSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVV
        NSIGGYIVAIVA LWP+LVKSIVLINSAGSVIPGY +LP KK+RQV+ AAWL ARLLL YLR+KTKDI++NCYPTRT+RAD+WLINEMLRASKDPGGLV+
Subjt:  NSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVV

Query:  LESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVES
        LESIFS DL+VPLNYLLEGLE RVLIIQGMKDPIYNSKS+L MLK+HC  VTIKELDAGHCPHDELPEEVNSILCEWIV +ES
Subjt:  LESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVES

A0A6J1E0X4 uncharacterized protein LOC111026138 isoform X25.3e-30678.03Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFT----CSSS------SSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDE
        M IFNF HS S+ FTT   NP PF T    CSSS      +     S+A  + GG+A+LWFKHDLRIDDHPALH AAS FPALLPLYIFDSRILSRFS+E
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFT----CSSS------SSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDE

Query:  MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFR
        MLE +LLALE LR SLRDRGLDL+++FGDAESVLRELVIE                          LA+L ESPNLTI RTPFYD+++LE LP SYDEFR
Subjt:  MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFR

Query:  KLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQM
        KLQLPV PPLSSPTLPC K++LNWG MP FD LKEF+NSNQ+YK REDW+ +KNT AEAI+RAK  K+GSN N+ S + +R+E M KSIFSI RG  FQ 
Subjt:  KLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQM

Query:  GGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEW
         GG EGVLNAL+AYIRYNEGT+RDDWQ+LHEMVRNSESRDG+SFIKLFGPAI LG+IS+RRVHYEAIKYEKERNAGFLSPFGYSAGTVAA VDAVLSSEW
Subjt:  GGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEW

Query:  YWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVE
        YWLM LKSKGR  GSHS RNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFR N+ GI E GNQVWAIT+LGFG  EKPNIVY+EL+WAEFLRDFI+E
Subjt:  YWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVE

Query:  VVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLR
        VVGRPVH+VGNSIGGYIVAIV+ LWPSLVKSIVLINSAGSVIPGYSFLP  KERQV+ AA L ARLLLFYLRLKTKDIVRNCYPTRTER+D WLI+EMLR
Subjt:  VVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLR

Query:  ASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSI
        ASKDPGGLVVLESIFS DLSVPLNYLLEG EERVLIIQGMKDPIYNSKSML MLKEHC GV IKELDAGHCPHDELPEEVNSILCEW+VRVESK KAG+I
Subjt:  ASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSI

Query:  L
        L
Subjt:  L

A0A6J1E553 uncharacterized protein LOC111026138 isoform X10.0e+0081.17Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFT----CSSS------SSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDE
        M IFNF HS S+ FTT   NP PF T    CSSS      +     S+A  + GG+A+LWFKHDLRIDDHPALH AAS FPALLPLYIFDSRILSRFS+E
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFT----CSSS------SSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDE

Query:  MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFR
        MLE +LLALE LR SLRDRGLDL+++FGDAESVLRELVIEVKATH+FAEEEVE ELCVLMD+VSQ LA+L ESPNLTI RTPFYD+++LE LP SYDEFR
Subjt:  MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFR

Query:  KLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQM
        KLQLPV PPLSSPTLPC K++LNWG MP FD LKEF+NSNQ+YK REDW+ +KNT AEAI+RAK  K+GSN N+ S + +R+E M KSIFSI RG  FQ 
Subjt:  KLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSI-RGKIFQM

Query:  GGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEW
         GG EGVLNAL+AYIRYNEGT+RDDWQ+LHEMVRNSESRDG+SFIKLFGPAI LG+IS+RRVHYEAIKYEKERNAGFLSPFGYSAGTVAA VDAVLSSEW
Subjt:  GGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEW

Query:  YWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVE
        YWLM LKSKGR  GSHS RNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFR N+ GI E GNQVWAIT+LGFG  EKPNIVY+EL+WAEFLRDFI+E
Subjt:  YWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVE

Query:  VVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLR
        VVGRPVH+VGNSIGGYIVAIV+ LWPSLVKSIVLINSAGSVIPGYSFLP  KERQV+ AA L ARLLLFYLRLKTKDIVRNCYPTRTER+D WLI+EMLR
Subjt:  VVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLR

Query:  ASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSI
        ASKDPGGLVVLESIFS DLSVPLNYLLEG EERVLIIQGMKDPIYNSKSML MLKEHC GV IKELDAGHCPHDELPEEVNSILCEW+VRVESK KAG+I
Subjt:  ASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGSI

Query:  L
        L
Subjt:  L

A0A6J1EKW5 uncharacterized protein LOC111435603 isoform X10.0e+0082.75Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T  + P      SS+    + SSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+ SLRDRGLDL++KFGDAESVLRELV+EVKATHVFAEEEVE ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLPCL+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA VRAK SK+GSNE NPS KESRTE MGKSIFS +  +  + GGTEG LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ LHEMVRNSESRDGASFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVGRPVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKSIVLINSAGSVIPGYS LP  KE+QV+ AAWL ARLLL YLR+KTKDIVRNCYPTRTERAD WL++EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS
        VLESIFS DLSVPLN+LLEGLEERVLIIQGMKDPIYNSK++LA LKE C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+KPK  S
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS

A0A6J1I4X4 uncharacterized protein LOC111469174 isoform X10.0e+0082.46Show/hide
Query:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA
        MSIF FS SPSL F T  + P      SS+    +LSSA   T  GGLALLWFKHDLRIDDHPALHAAASQFPAL+PLYIFDSR+LSRFSDEMLEI LLA
Subjt:  MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKT--GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLA

Query:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP
        LE L+HSLRDRGLDL++KFGDAESVLRELV+EVKATHVFAEEEVE ELCVLMD VSQTL+SLI+SPNLT+WRTPFYD+++LE LP SYDEFRKLQLPV  
Subjt:  LEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHP

Query:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN
        PLSSPTLP L+++LNWG MPTFD LKEF+NSN++YKP EDW SIKNTTAEA VRAK SK+GSNE NPS KESRTE MGKSIFS +  +  + GGTEG+LN
Subjt:  PLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLN

Query:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK
        ALAAYIRYNEGTSRDDWQ LHEMVRNSESRDG SFIKLFGPAIHLG+ISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWY LM LKSK
Subjt:  ALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSK

Query:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV
        GR LGS SYR WRWNGFLVQYTV GC+GPA+LLVHGFGAFLEHFRDNI GI E GNQVWAITMLGFGS EKPNIVYSELMWAE L+DFIV+VVG+PVH+V
Subjt:  GRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVV

Query:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV
        GNSIGGYIVAIVA LWP+LVKS VLINSAGSV+PGYS LP  KERQV+ AAWL ARLLL YLRLKTKDIV+NCYPTRTERAD WLI+EMLRASKDPGGLV
Subjt:  GNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLV

Query:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS
        VLESIFS DLSVPLN LLEGLEERVLIIQGMKDPIYNSK++LA LKE C GVTIKELDAGHCPHDELP++VNSILCEWI+R+E+K K  S
Subjt:  VLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRVESKPKAGS

SwissProt top hitse value%identityAlignment
Q38JU2 Cryptochrome DASH, chloroplastic/mitochondrial1.2e-0927.45Show/hide
Query:  KTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI--------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKA
        K  G+A++WF++DLR+ D+ AL  A     A+LP+Y  D R+        + +      + ++  L  L+ +L  RGLDL+++ G  E ++  L    KA
Subjt:  KTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI--------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKA

Query:  THVFAEEEVEQELCVLMDDVSQTLASLIE-----------SPNLT----IWRTPFYDLRNL----EVLPTSYDEFRKLQLPVHPPLSSPTLPCLKMDLNW
          V+A +E   E   +   V++ L  L+            S N T    +W +  Y + +L    E LP  Y +FRK          S    C K+  ++
Subjt:  THVFAEEEVEQELCVLMDDVSQTLASLIE-----------SPNLT----IWRTPFYDLRNL----EVLPTSYDEFRKLQLPVHPPLSSPTLPCLKMDLNW

Query:  GIMP
        G  P
Subjt:  GIMP

Q651U1 Cryptochrome DASH, chloroplastic/mitochondrial4.2e-1027.65Show/hide
Query:  GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI---------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKAT
        GG+A++WF++DLR+ D+ A+  A +   A+LP+Y  D RI           +      + ++  LE L+ +L  +GLDL+++ G  E +L  +   V A 
Subjt:  GGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI---------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKAT

Query:  HVFAEEEVEQELCVLMDDVSQTLASLI------------ESPNL-TIWRTPFYDLRNLEV----LPTSYDEFRKL-----------QLPVHPPLSSPTLP
         V+A +E   E  ++   V + L  ++             +P L  IW    Y + +L      LP  Y +FRK            +LP  P L  P   
Subjt:  HVFAEEEVEQELCVLMDDVSQTLASLI------------ESPNL-TIWRTPFYDLRNLEV----LPTSYDEFRKL-----------QLPVHPPLSSPTLP

Query:  CLKMDLNWGIMPTFDGL
         L     WG +PT + L
Subjt:  CLKMDLNWGIMPTFDGL

Q7NMD1 Cryptochrome DASH9.5e-1031.18Show/hide
Query:  LLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSR--FSDE-----MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEE
        L+W+++DLR+ DH  L +A  +   ++ LY FD R   +  F  E         +L ++  LR SLR  G DL+++ G  E V+  LV E++   V    
Subjt:  LLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSR--FSDE-----MLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATHVFAEE

Query:  EVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNL----EVLPTSYDEFRK---LQLPVHPPLSSP
        EV  E  V+  D+   LA L   P  + W T      +L    E +P  + +FRK       ++PPL +P
Subjt:  EVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNL----EVLPTSYDEFRK---LQLPVHPPLSSP

Q84KJ5 Cryptochrome DASH, chloroplastic/mitochondrial3.8e-1130.19Show/hide
Query:  SLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI--------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESV
        +L S SS   K  G+ +LWF++DLR+ D+ AL+ A S    +LP+Y  D R+          +        ++  L  LR +L  RGL+L+++ G  E +
Subjt:  SLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRI--------LSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESV

Query:  LRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNL-TIWRTPFYDLRNLEV----LPTSYDEFRK-------LQLPVHPPLSSPTLPCLKMD
        L  L  +  A  VFA +E   E   +   V+Q L  +  S  L  IW +  Y   +L      LP  Y +FRK       ++     PLS    P +   
Subjt:  LRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNL-TIWRTPFYDLRNLEV----LPTSYDEFRK-------LQLPVHPPLSSPTLPCLKMD

Query:  LNWGIMPTFDGL
         +WG +PT + L
Subjt:  LNWGIMPTFDGL

Q9FFZ1 Pheophytinase, chloroplastic1.4e-2125.83Show/hide
Query:  SHSYRNWRWNGFLVQYTVVGC---EGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITML----------------------------GFGSLEKP--
        SHS+  W+     V Y   GC   + PA+L + GFG    H+   +  +G D  +VWAI  L                            GFG   +P  
Subjt:  SHSYRNWRWNGFLVQYTVVGC---EGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITML----------------------------GFGSLEKP--

Query:  -NIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPS--KKERQVTGAAWLAARLLLFYLRLKTK--
          +V+S  +W + ++ F+ EV+G PV++ GNS+GGY+    A+  P LVK + L+N+     P + F P+  +  +      W  A  L   ++  T+  
Subjt:  -NIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPS--KKERQVTGAAWLAARLLLFYLRLKTK--

Query:  -----------DIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIF-----SLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAG
                   +I++  Y   +   D+ + + ++  ++ P       SI       L  S  L+   E    ++ ++ G +DP         + KE    
Subjt:  -----------DIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIF-----SLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAG

Query:  VTIKELDAGHCPHDELPEEVNSILCEWIVRVES
           +   AGHCPHDE+PE VN ++  WI  +ES
Subjt:  VTIKELDAGHCPHDELPEEVNSILCEWIVRVES

Arabidopsis top hitse value%identityAlignment
AT4G25290.1 DNA photolyases;DNA photolyases5.0e-20854.17Show/hide
Query:  LSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATH
        LSSAT   G  A++WFKHDLR+DDHP L  AAS+  A++PLY+ D RILSR++ + LE+ ++ALE LR +L+ +G +LML++G+AE+V+ +LV EV+A  
Subjt:  LSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALEGLRHSLRDRGLDLMLKFGDAESVLRELVIEVKATH

Query:  VFAEEEVEQELCVLMDDVSQTLASLI---ESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQV
        VF EEEVE  LC ++D V   L  +    ESP +  WRTPFY+ +NL  LP S++EF+KL+LP+  P+ +        +L WG +PT D LK+++  + +
Subjt:  VFAEEEVEQELCVLMDDVSQTLASLI---ESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPLSSPTLPCLKMDLNWGIMPTFDGLKEFVNSNQV

Query:  YKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGAS
        ++    W  +   +AE ++  +      +   P +  S  + +  S+F +  K   +GGG E VLNALA Y+RY EGTSRDDWQ +H  +R++E+R GAS
Subjt:  YKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDGAS

Query:  FIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLV
        F KLFGP + LG++S+R VHYEAI+YEKERNAGF+SPFGYSA TV+AA DAV S EWY+L++L  +      H+ R WRW G+L+QYTVVG EGPA+LLV
Subjt:  FIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLV

Query:  HGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIP
        HGFGAFLEH+RDN+  I    N+VW IT+LGFG  EKPNI+Y+EL+WAE LRDF+ EVVG P H VGNSIGGY VA++A LWP+LVKS+VL+NSAG+V+P
Subjt:  HGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIP

Query:  GYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDP
        GYS LP  +ER+V   A   +RLLLF+L+L  K ++++CYP + ERAD++L+ EMLRAS+DPG ++VLESIF  DLS+PLNYLL+G EE+ L+IQGM+DP
Subjt:  GYSFLPSKKERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDP

Query:  IYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRV
        I + +  +A+LKE C  + IK++ AGHCPHDE+ EEVN I+CEWIV+V
Subjt:  IYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVRV

AT4G36530.1 alpha/beta-Hydrolases superfamily protein4.3e-3430.49Show/hide
Query:  YRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYI
        Y  W W G  + Y V G EG  ++L+HGFGA + H+R NI  + +   +V+A+ +LGFG  +K  I Y  ++W + + DF+ EVV  P  VVGNS+GG+ 
Subjt:  YRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYI

Query:  VAIVASLWPSLVKSIVLINSAG--------------SVIPGYSFLPSKK--ERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRA
           VA   P  V  + L+NSAG              +VI  +   P K+  +R V G  +  A+        + + ++++ Y   T   D++L+  + + 
Subjt:  VAIVASLWPSLVKSIVLINSAG--------------SVIPGYSFLPSKK--ERQVTGAAWLAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRA

Query:  SKDPGGLVVLESIFSLDLSVPLNYLLEGLEER----VLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWI-VRVESKPK
        + DP    V   + +  L+    Y L+ +  +    +L++ G  DP +   +    +K   +  ++  L AGHCPHDE+PE VN  L +W+ + + SKP 
Subjt:  SKDPGGLVVLESIFSLDLSVPLNYLLEGLEER----VLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWI-VRVESKPK

Query:  AGSIL
        +  +L
Subjt:  AGSIL

AT4G36530.2 alpha/beta-Hydrolases superfamily protein1.9e-3429.24Show/hide
Query:  FGYSAGTVAAAVDAVLSSEWYWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEK
        F    G VA+ V  + +S         S+        Y  W W G  + Y V G EG  ++L+HGFGA + H+R NI  + +   +V+A+ +LGFG  +K
Subjt:  FGYSAGTVAAAVDAVLSSEWYWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEK

Query:  PNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAG--------------SVIPGYSFLPSKK--ERQVTGAAWLAA
          I Y  ++W + + DF+ EVV  P  VVGNS+GG+    VA   P  V  + L+NSAG              +VI  +   P K+  +R V G  +  A
Subjt:  PNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAG--------------SVIPGYSFLPSKK--ERQVTGAAWLAA

Query:  RLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIFSLDLSVPLNYLLEGLEER----VLIIQGMKDPIYNSKSMLAMLKEHCAG
        +        + + ++++ Y   T   D++L+  + + + DP    V   + +  L+    Y L+ +  +    +L++ G  DP +   +    +K   + 
Subjt:  RLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIFSLDLSVPLNYLLEGLEER----VLIIQGMKDPIYNSKSMLAMLKEHCAG

Query:  VTIKELDAGHCPHDELPEEVNSILCEWI-VRVESKPKAGSIL
         ++  L AGHCPHDE+PE VN  L +W+ + + SKP +  +L
Subjt:  VTIKELDAGHCPHDELPEEVNSILCEWI-VRVESKPKAGSIL

AT5G13800.2 pheophytinase1.0e-2225.83Show/hide
Query:  SHSYRNWRWNGFLVQYTVVGC---EGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITML----------------------------GFGSLEKP--
        SHS+  W+     V Y   GC   + PA+L + GFG    H+   +  +G D  +VWAI  L                            GFG   +P  
Subjt:  SHSYRNWRWNGFLVQYTVVGC---EGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITML----------------------------GFGSLEKP--

Query:  -NIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPS--KKERQVTGAAWLAARLLLFYLRLKTK--
          +V+S  +W + ++ F+ EV+G PV++ GNS+GGY+    A+  P LVK + L+N+     P + F P+  +  +      W  A  L   ++  T+  
Subjt:  -NIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPS--KKERQVTGAAWLAARLLLFYLRLKTK--

Query:  -----------DIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIF-----SLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAG
                   +I++  Y   +   D+ + + ++  ++ P       SI       L  S  L+   E    ++ ++ G +DP         + KE    
Subjt:  -----------DIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIF-----SLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAG

Query:  VTIKELDAGHCPHDELPEEVNSILCEWIVRVES
           +   AGHCPHDE+PE VN ++  WI  +ES
Subjt:  VTIKELDAGHCPHDELPEEVNSILCEWIVRVES

AT5G19850.1 alpha/beta-Hydrolases superfamily protein5.7e-2630.58Show/hide
Query:  WRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPN-------IVYSELMWAEFLRDFIVEVVGRPVHVVGNSI
        W+W G+ ++Y   G  GPA++LVHGFGA  +H+R N   +G+  ++V++I ++G+G  +KPN         Y+   W E L DF ++VV      + NSI
Subjt:  WRWNGFLVQYTVVGCEGPAILLVHGFGAFLEHFRDNIQGIGEDGNQVWAITMLGFGSLEKPN-------IVYSELMWAEFLRDFIVEVVGRPVHVVGNSI

Query:  GGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAA--RLL-------LFYLRL----KTKDIVRNCYPTRTERADEWLINEMLR
        GG +    A   P + + ++LIN +  ++         K++   G  ++ +   LL       LF+  +      K I+  CY   ++  DE L+  +LR
Subjt:  GGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAWLAA--RLL-------LFYLRL----KTKDIVRNCYPTRTERADEWLINEMLR

Query:  ASKDPGGL-VVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVR
           +PG + V LE I      +P + LL  ++  VLI  G KDP    +   A          +   DAGHCP DE PE VN ++  ++ R
Subjt:  ASKDPGGL-VVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHCPHDELPEEVNSILCEWIVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATCTTCAACTTCTCCCATTCTCCTTCTCTTCCCTTTACTACCTCAAACTCAAACCCCTTCCCCTTCTTCACTTGTTCTTCATCCTCATCACTCCTATCTCTATC
TTCTGCAACTCCCAAAACCGGAGGACTCGCACTTCTTTGGTTCAAACATGATCTCCGAATCGACGATCATCCAGCTCTCCATGCCGCTGCCTCCCAATTTCCCGCTCTCC
TCCCTCTGTACATCTTCGATTCCCGCATACTTTCACGGTTTTCCGATGAAATGCTTGAAATTGTTCTTCTTGCCTTGGAAGGCCTCAGACATTCGCTGAGGGATCGGGGT
CTGGATTTGATGCTCAAGTTTGGGGATGCGGAATCTGTACTTCGGGAGCTTGTAATTGAGGTTAAAGCTACCCATGTCTTTGCTGAGGAAGAGGTTGAGCAGGAGTTATG
TGTGTTGATGGATGATGTATCTCAGACATTGGCCAGTCTGATTGAGAGTCCAAATCTAACAATTTGGAGGACGCCATTTTATGATTTGAGGAACTTAGAGGTCTTACCTA
CGTCATACGATGAGTTTAGGAAGCTTCAACTTCCTGTACATCCTCCACTCTCTTCCCCAACATTACCCTGCTTGAAAATGGATTTGAACTGGGGTATCATGCCTACATTT
GATGGTTTGAAGGAGTTTGTGAATAGTAATCAAGTATACAAACCAAGAGAAGATTGGCATTCGATCAAGAATACAACAGCTGAAGCTATTGTGCGGGCTAAATCTTCTAA
AAAAGGAAGCAATGAAAACAATCCAAGTTTGAAAGAGTCCAGGACAGAGGGAATGGGCAAATCAATTTTTTCCATACGAGGAAAGATTTTCCAGATGGGGGGTGGTACGG
AAGGTGTGTTGAATGCTTTAGCTGCATACATAAGATACAATGAGGGAACATCTCGAGATGACTGGCAACTATTACATGAAATGGTACGCAACTCTGAAAGCCGAGATGGA
GCTTCATTTATCAAACTTTTTGGTCCTGCCATTCATCTGGGATTGATTTCTAAAAGGAGAGTGCACTATGAAGCCATCAAATATGAAAAGGAAAGAAATGCTGGGTTTCT
GTCACCATTTGGATACTCTGCAGGCACTGTTGCAGCAGCAGTTGATGCTGTTCTCTCCTCAGAGTGGTATTGGCTTATGAGTCTGAAGAGCAAAGGAAGGCGTTTGGGAA
GTCATTCTTATCGGAATTGGAGATGGAATGGCTTTCTTGTTCAGTACACAGTTGTTGGATGTGAAGGCCCTGCCATTCTTCTTGTGCATGGTTTTGGTGCCTTTTTGGAG
CATTTCCGTGACAATATACAAGGCATTGGTGAAGATGGAAATCAAGTTTGGGCTATAACAATGCTGGGGTTTGGGAGTTTAGAAAAGCCAAACATAGTGTATAGTGAATT
AATGTGGGCTGAATTTCTTAGAGATTTTATCGTTGAAGTTGTGGGCAGACCAGTACATGTTGTTGGAAATTCAATTGGCGGTTATATTGTCGCTATTGTCGCTTCTCTTT
GGCCTTCTTTGGTTAAATCCATTGTCCTCATCAATAGTGCTGGCAGTGTAATTCCAGGATATTCATTTTTGCCCTCCAAGAAGGAGAGACAGGTTACTGGTGCTGCATGG
TTGGCCGCTCGACTCCTTTTATTTTACTTGAGACTGAAAACAAAGGACATAGTGAGGAATTGCTATCCTACTAGAACAGAGCGAGCTGATGAATGGCTTATCAATGAGAT
GCTAAGAGCTTCCAAAGATCCTGGTGGCCTGGTGGTTCTAGAAAGTATTTTTAGCCTTGATCTTTCAGTTCCCCTCAATTATCTCTTGGAAGGATTGGAGGAAAGGGTTC
TCATCATCCAGGGGATGAAAGATCCTATCTACAACTCCAAGTCTATGTTGGCTATGCTTAAAGAGCACTGTGCTGGGGTAACAATCAAGGAATTGGATGCTGGACATTGC
CCACATGATGAGCTACCAGAAGAAGTGAATTCTATCCTCTGTGAATGGATAGTGAGAGTTGAGAGTAAACCAAAGGCTGGTAGTATTTTGTAA
mRNA sequenceShow/hide mRNA sequence
CCGGCTTCCCACCACCACAAACACACACACAAAAACCCACATTCTTCCTTCATCTCCATTCTCCAATGCCGATTTCCCCTTCTATCTTCCATTTCCATTTCCATGTCCAT
CTTCAACTTCTCCCATTCTCCTTCTCTTCCCTTTACTACCTCAAACTCAAACCCCTTCCCCTTCTTCACTTGTTCTTCATCCTCATCACTCCTATCTCTATCTTCTGCAA
CTCCCAAAACCGGAGGACTCGCACTTCTTTGGTTCAAACATGATCTCCGAATCGACGATCATCCAGCTCTCCATGCCGCTGCCTCCCAATTTCCCGCTCTCCTCCCTCTG
TACATCTTCGATTCCCGCATACTTTCACGGTTTTCCGATGAAATGCTTGAAATTGTTCTTCTTGCCTTGGAAGGCCTCAGACATTCGCTGAGGGATCGGGGTCTGGATTT
GATGCTCAAGTTTGGGGATGCGGAATCTGTACTTCGGGAGCTTGTAATTGAGGTTAAAGCTACCCATGTCTTTGCTGAGGAAGAGGTTGAGCAGGAGTTATGTGTGTTGA
TGGATGATGTATCTCAGACATTGGCCAGTCTGATTGAGAGTCCAAATCTAACAATTTGGAGGACGCCATTTTATGATTTGAGGAACTTAGAGGTCTTACCTACGTCATAC
GATGAGTTTAGGAAGCTTCAACTTCCTGTACATCCTCCACTCTCTTCCCCAACATTACCCTGCTTGAAAATGGATTTGAACTGGGGTATCATGCCTACATTTGATGGTTT
GAAGGAGTTTGTGAATAGTAATCAAGTATACAAACCAAGAGAAGATTGGCATTCGATCAAGAATACAACAGCTGAAGCTATTGTGCGGGCTAAATCTTCTAAAAAAGGAA
GCAATGAAAACAATCCAAGTTTGAAAGAGTCCAGGACAGAGGGAATGGGCAAATCAATTTTTTCCATACGAGGAAAGATTTTCCAGATGGGGGGTGGTACGGAAGGTGTG
TTGAATGCTTTAGCTGCATACATAAGATACAATGAGGGAACATCTCGAGATGACTGGCAACTATTACATGAAATGGTACGCAACTCTGAAAGCCGAGATGGAGCTTCATT
TATCAAACTTTTTGGTCCTGCCATTCATCTGGGATTGATTTCTAAAAGGAGAGTGCACTATGAAGCCATCAAATATGAAAAGGAAAGAAATGCTGGGTTTCTGTCACCAT
TTGGATACTCTGCAGGCACTGTTGCAGCAGCAGTTGATGCTGTTCTCTCCTCAGAGTGGTATTGGCTTATGAGTCTGAAGAGCAAAGGAAGGCGTTTGGGAAGTCATTCT
TATCGGAATTGGAGATGGAATGGCTTTCTTGTTCAGTACACAGTTGTTGGATGTGAAGGCCCTGCCATTCTTCTTGTGCATGGTTTTGGTGCCTTTTTGGAGCATTTCCG
TGACAATATACAAGGCATTGGTGAAGATGGAAATCAAGTTTGGGCTATAACAATGCTGGGGTTTGGGAGTTTAGAAAAGCCAAACATAGTGTATAGTGAATTAATGTGGG
CTGAATTTCTTAGAGATTTTATCGTTGAAGTTGTGGGCAGACCAGTACATGTTGTTGGAAATTCAATTGGCGGTTATATTGTCGCTATTGTCGCTTCTCTTTGGCCTTCT
TTGGTTAAATCCATTGTCCTCATCAATAGTGCTGGCAGTGTAATTCCAGGATATTCATTTTTGCCCTCCAAGAAGGAGAGACAGGTTACTGGTGCTGCATGGTTGGCCGC
TCGACTCCTTTTATTTTACTTGAGACTGAAAACAAAGGACATAGTGAGGAATTGCTATCCTACTAGAACAGAGCGAGCTGATGAATGGCTTATCAATGAGATGCTAAGAG
CTTCCAAAGATCCTGGTGGCCTGGTGGTTCTAGAAAGTATTTTTAGCCTTGATCTTTCAGTTCCCCTCAATTATCTCTTGGAAGGATTGGAGGAAAGGGTTCTCATCATC
CAGGGGATGAAAGATCCTATCTACAACTCCAAGTCTATGTTGGCTATGCTTAAAGAGCACTGTGCTGGGGTAACAATCAAGGAATTGGATGCTGGACATTGCCCACATGA
TGAGCTACCAGAAGAAGTGAATTCTATCCTCTGTGAATGGATAGTGAGAGTTGAGAGTAAACCAAAGGCTGGTAGTATTTTGTAATTTACTCCGACTCACCATAATCTAC
ATTTTAATTGTCAATGACGACCTTCTTAATTTTATGGAAATGTCGATGTCTTATTTTCATATGAATTTTGATGGAAGTATCGATAAAATGTTGGTGTGGATGGATATTTC
TAAAAAAATTGTAAACATCAAAAATATTTGAATCAAC
Protein sequenceShow/hide protein sequence
MSIFNFSHSPSLPFTTSNSNPFPFFTCSSSSSLLSLSSATPKTGGLALLWFKHDLRIDDHPALHAAASQFPALLPLYIFDSRILSRFSDEMLEIVLLALEGLRHSLRDRG
LDLMLKFGDAESVLRELVIEVKATHVFAEEEVEQELCVLMDDVSQTLASLIESPNLTIWRTPFYDLRNLEVLPTSYDEFRKLQLPVHPPLSSPTLPCLKMDLNWGIMPTF
DGLKEFVNSNQVYKPREDWHSIKNTTAEAIVRAKSSKKGSNENNPSLKESRTEGMGKSIFSIRGKIFQMGGGTEGVLNALAAYIRYNEGTSRDDWQLLHEMVRNSESRDG
ASFIKLFGPAIHLGLISKRRVHYEAIKYEKERNAGFLSPFGYSAGTVAAAVDAVLSSEWYWLMSLKSKGRRLGSHSYRNWRWNGFLVQYTVVGCEGPAILLVHGFGAFLE
HFRDNIQGIGEDGNQVWAITMLGFGSLEKPNIVYSELMWAEFLRDFIVEVVGRPVHVVGNSIGGYIVAIVASLWPSLVKSIVLINSAGSVIPGYSFLPSKKERQVTGAAW
LAARLLLFYLRLKTKDIVRNCYPTRTERADEWLINEMLRASKDPGGLVVLESIFSLDLSVPLNYLLEGLEERVLIIQGMKDPIYNSKSMLAMLKEHCAGVTIKELDAGHC
PHDELPEEVNSILCEWIVRVESKPKAGSIL