; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013561 (gene) of Chayote v1 genome

Gene IDSed0013561
OrganismSechium edule (Chayote v1)
DescriptionRING/U-box protein, putative isoform 2
Genome locationLG04:47755813..47759334
RNA-Seq ExpressionSed0013561
SyntenySed0013561
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659696.1 uncharacterized protein LOC101205950 isoform X1 [Cucumis sativus]0.0e+0071.73Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVV----EQQQRNKPRNGVPPKARTAFGSCKSR
        MCEEQMVRGG+V SRKNFKKKFR KDK  SDDSDEDYVVSSD  GVSE SDE+Y S LD NASGEDN+VV    E+QQ+ K R  V PKAR    S ++R
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVV----EQQQRNKPRNGVPPKARTAFGSCKSR

Query:  KKNGRESLIFTYQEEEDG----DEDYKVDDDNDY--------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCK
        KKNG +   F+ QEEEDG    DEDY VD+DNDY        EEEEDV  D E+EDEDEDF+LEEEDF DEEEP+VRKR TNMKRG+ G+ K+ VGKVCK
Subjt:  KKNGRESLIFTYQEEEDG----DEDYKVDDDNDY--------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCK

Query:  KRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNY--CVVTRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRT
        KRKP AAKKPSRNKRRKKSG +T+RNSDDDD DFSDNY    +TRRK+ VSKRK +  Q DLD  LSGSSDYEYTISEEEREQ+REAE+LC  LRNR RT
Subjt:  KRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNY--CVVTRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRT

Query:  VPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGID
        VPSP RI + DL QQR  R  VRKGKEKVE +KAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKP +ST GID
Subjt:  VPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGID

Query:  LREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTAN
        LRE V+ VPERDQVYQPSEEELRSYLDPYENVICIEC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC DC T A GSSSPQP +RLS+RRT N
Subjt:  LREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTAN

Query:  SLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQ
        +LFNRTFP ANR+GLDLNS+SS RTP +QGF NI SPRLPVEVQSTSPMSQAVAPTL RRRILRLHINN+R S+QMGLV    DGVSA NP  GGTL+ Q
Subjt:  SLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQ

Query:  RGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PE
         GQ RE+  E+  TQ+ AI S TLFGE L+H S S +MQH  FL+ E SH+ RQ LQ  H S   D     GTI+N LRGL VEN   V RNLNG    E
Subjt:  RGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PE

Query:  LARVSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEACRASP
        LA V+S  NCE  QI  Y++ +NT S N SLP L  DEK+YCA RE LQP+I+ HLKN LSRD DLG+  A+DIAT+A STIL ACG++HSIN A R+SP
Subjt:  LARVSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEACRASP

Query:  PSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        PSQCSHIEL+ GE +RSLIKGLC  CFD YVRDVVKKITDD SWLNLRL
Subjt:  PSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

XP_022931593.1 uncharacterized protein LOC111437762 isoform X2 [Cucurbita moschata]0.0e+0070.88Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK
        MCEEQMVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+  SYLD NASGEDNFV      VE+QQ NK R GV PK     GS K
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK

Query:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV
         RKKNGR+   F+YQEEE     + DEDY +D+DNDYEEEE+   +EE          IEDEDE+FM+EEEDF DEEEP+VRK+ATN KRG++GV K+TV
Subjt:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV

Query:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR
        GKVCKKRKP AAKKP+RNKRRKK  S T+R+SDDDD DFS+N  VV  TR+K++VSKRKNH  Q DLDTF+SGSSDYEYTISEEEREQ+REAEKLC HLR
Subjt:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR

Query:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS
        NR RTVPSP RI E + RQ R PR  VRKGKEK+E+VKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+S
Subjt:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS

Query:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS
          GIDLREVVI VPERDQVYQPSEEELRSYLDPYENVIC EC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DCT  A G SSPQ PP+RLS
Subjt:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS

Query:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN
        DRRT N+LFNR+FP ANR+GLDLNSLSS RTP  QGFG+I SPR+PVEVQSTSPMSQAVAPTL RRRILRLHIN++R SNQMGLVN   +GVS    A +
Subjt:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN

Query:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT
         C G T+                 Q+T + S +LFGE+L+H +PSPLMQH VFL SE + +S QV+Q  HLSM         +GTI N+LR G + EN T
Subjt:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT

Query:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACG
         V RNLNG    ELA V+  SNCE   I  Y+++LNTGS +SS P   GDEKDYCA REHLQPMIESHLKNYLS D DLG   AN+IAT+AA TILAACG
Subjt:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACG

Query:  FKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        FK+SINEACRA+PPSQCSHIELS GEE+RSLIKGLCS+CFD YVRDVVKK TDD SWL+L L
Subjt:  FKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

XP_023530078.1 uncharacterized protein LOC111792737 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0070.64Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV--------VEQQQRNKPRNGVPPKARTAFGS
        MCEEQMVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+  SYLD NASGEDNFV        VE+QQ NK R GV PK   A GS
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV--------VEQQQRNKPRNGVPPKARTAFGS

Query:  CKSRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE------------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVS
         K RKKNGR+   F+YQEEE     + DEDY +D+DNDYEEEE+   +EE            IEDEDE+FM+EEEDF DEEEP+VRK+ATN KRG++GV 
Subjt:  CKSRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE------------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVS

Query:  KSTVGKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLC
        K+TVGKVCKKRKP AAKKP+RNKRRKK  S T+R+SDDDD DFS+N  VV  TR+K++VSKRKNH  Q DLDTF+SGSSDYEYTISEEEREQ+REAEKLC
Subjt:  KSTVGKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLC

Query:  YHLRNRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISK
         +LRNR RTVPSP RI E + RQ R PR  VRKGKEK+EEVKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISK
Subjt:  YHLRNRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISK

Query:  PAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPP-
        PA+S  GIDLREVVI VPERDQVYQPSEEELRSYLDPYENVIC EC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DCT  A G SSPQPP 
Subjt:  PAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPP-

Query:  SRLSDRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS---
        +RLSDRRT N+LFNR+FP ANR+GLDLNSLSS RTP  QGFG+I SPR+PVEVQSTSPMSQAVAPTL RRRILRLHIN++R SNQMGLVN   +GVS   
Subjt:  SRLSDRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS---

Query:  AGNPCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVE
        A + C G T+                 Q+T + S +LFGE+L+H +PSPLMQH VFL SE + +S QV+Q  HLSM         +GTI N+LR G + E
Subjt:  AGNPCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVE

Query:  NLTVVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILA
        N T V RNLNG    E A V+  SNCE   I  Y+++LNTGS +SS P   GDEKDYCA REHLQPMIESHLKNYLS D DLG+  AN+IAT+AA TILA
Subjt:  NLTVVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILA

Query:  ACGFKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        ACGFK+SINEACRA+PPSQCSHIELS GEE+RSLIKGLCS+CFD YVRDVVKK TDD SWL+L L
Subjt:  ACGFKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

XP_038880559.1 uncharacterized protein LOC120072219 isoform X1 [Benincasa hispida]0.0e+0072.5Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV----VEQQQRNKPRNGVPPKARTAFGSCKSR
        MCEEQMVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVSSD  GVSE SDE+Y S LD NASGEDNFV    VE+QQ    R  V PKAR AFGS K R
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV----VEQQQRNKPRNGVPPKARTAFGSCKSR

Query:  KKNGRESLIFTYQEEEDG----DEDYKVDDDNDY----------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKV
        K NG++   F+YQ+EEDG    DEDY VD+DNDY          EEEEDV  DEE+EDEDEDF+LEEEDF DEEEP+VRKR TNMKRG+ GV K+TVGKV
Subjt:  KKNGRESLIFTYQEEEDG----DEDYKVDDDNDY----------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKV

Query:  CKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRG
        CKKR+P AAKKPSRNKRRKKSG+++++NSDD   DFSD+Y  V  TRRK+ VSKRK +  Q DLDTFLSGSSDYEYTISEEEREQ+REAE+LC  L NR 
Subjt:  CKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRG

Query:  RTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVG
        RT PSP RI + D+ QQR  R  VRKGKEKVEEVKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+ST G
Subjt:  RTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVG

Query:  IDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRT
        IDLREVVI VPERDQVYQPSEEELRSYLDPYENVICIEC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DC   A GSSSPQP SRLS+RRT
Subjt:  IDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRT

Query:  ANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLN
         N+LFNRTFP ANR+GLDLNSLSS R+  +QGF NI SPRLPVEVQSTSPMSQA+APTL RRRILRLHIN++R S+QMGLV    +GVSA NPC GGTL+
Subjt:  ANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLN

Query:  SQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---
         Q GQ RE+ AE+T TQ+  I + TL GE L+H SPSPLMQH  FL+ E SH+SRQVLQ  H SMPID     GTI+N LRGL VEN   V RNLNG   
Subjt:  SQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---

Query:  PELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRAS
        PE+A ++S SNCE  QI  Y++V+NT S N SLP L GDEKDYCA RE LQP+I+ HLKN     DL + +A DIAT+AASTIL ACGF H IN+A RAS
Subjt:  PELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRAS

Query:  PPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD--SWLNLRL
        PPS+CSHIELS GE+ RSLIKGLC SCFD YV+D+VKKITDD  SWLNL L
Subjt:  PPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD--SWLNLRL

XP_038880561.1 uncharacterized protein LOC120072219 isoform X2 [Benincasa hispida]0.0e+0072.34Show/hide
Query:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV----VEQQQRNKPRNGVPPKARTAFGSCKSRKKNGR
        MVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVSSD  GVSE SDE+Y S LD NASGEDNFV    VE+QQ    R  V PKAR AFGS K RK NG+
Subjt:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV----VEQQQRNKPRNGVPPKARTAFGSCKSRKKNGR

Query:  ESLIFTYQEEEDG----DEDYKVDDDNDY----------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCKKRK
        +   F+YQ+EEDG    DEDY VD+DNDY          EEEEDV  DEE+EDEDEDF+LEEEDF DEEEP+VRKR TNMKRG+ GV K+TVGKVCKKR+
Subjt:  ESLIFTYQEEEDG----DEDYKVDDDNDY----------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCKKRK

Query:  PNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPS
        P AAKKPSRNKRRKKSG+++++NSDD   DFSD+Y  V  TRRK+ VSKRK +  Q DLDTFLSGSSDYEYTISEEEREQ+REAE+LC  L NR RT PS
Subjt:  PNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPS

Query:  PRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLRE
        P RI + D+ QQR  R  VRKGKEKVEEVKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+ST GIDLRE
Subjt:  PRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLRE

Query:  VVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLF
        VVI VPERDQVYQPSEEELRSYLDPYENVICIEC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DC   A GSSSPQP SRLS+RRT N+LF
Subjt:  VVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLF

Query:  NRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQ
        NRTFP ANR+GLDLNSLSS R+  +QGF NI SPRLPVEVQSTSPMSQA+APTL RRRILRLHIN++R S+QMGLV    +GVSA NPC GGTL+ Q GQ
Subjt:  NRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQ

Query:  GRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PELAR
         RE+ AE+T TQ+  I + TL GE L+H SPSPLMQH  FL+ E SH+SRQVLQ  H SMPID     GTI+N LRGL VEN   V RNLNG   PE+A 
Subjt:  GRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PELAR

Query:  VSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQC
        ++S SNCE  QI  Y++V+NT S N SLP L GDEKDYCA RE LQP+I+ HLKN     DL + +A DIAT+AASTIL ACGF H IN+A RASPPS+C
Subjt:  VSSQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQC

Query:  SHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD--SWLNLRL
        SHIELS GE+ RSLIKGLC SCFD YV+D+VKKITDD  SWLNL L
Subjt:  SHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD--SWLNLRL

TrEMBL top hitse value%identityAlignment
A0A0A0LWB6 Uncharacterized protein0.0e+0071.56Show/hide
Query:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVV----EQQQRNKPRNGVPPKARTAFGSCKSRKKNGR
        MVRGG+V SRKNFKKKFR KDK  SDDSDEDYVVSSD  GVSE SDE+Y S LD NASGEDN+VV    E+QQ+ K R  V PKAR    S ++RKKNG 
Subjt:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVV----EQQQRNKPRNGVPPKARTAFGSCKSRKKNGR

Query:  ESLIFTYQEEEDG----DEDYKVDDDNDY--------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCKKRKPN
        +   F+ QEEEDG    DEDY VD+DNDY        EEEEDV  D E+EDEDEDF+LEEEDF DEEEP+VRKR TNMKRG+ G+ K+ VGKVCKKRKP 
Subjt:  ESLIFTYQEEEDG----DEDYKVDDDNDY--------EEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCKKRKPN

Query:  AAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNY--CVVTRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPSPR
        AAKKPSRNKRRKKSG +T+RNSDDDD DFSDNY    +TRRK+ VSKRK +  Q DLD  LSGSSDYEYTISEEEREQ+REAE+LC  LRNR RTVPSP 
Subjt:  AAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNY--CVVTRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPSPR

Query:  RIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVV
        RI + DL QQR  R  VRKGKEKVE +KAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKP +ST GIDLRE V
Subjt:  RIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVV

Query:  INVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLFNR
        + VPERDQVYQPSEEELRSYLDPYENVICIEC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC DC T A GSSSPQP +RLS+RRT N+LFNR
Subjt:  INVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLFNR

Query:  TFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQGR
        TFP ANR+GLDLNS+SS RTP +QGF NI SPRLPVEVQSTSPMSQAVAPTL RRRILRLHINN+R S+QMGLV    DGVSA NP  GGTL+ Q GQ R
Subjt:  TFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQGR

Query:  ETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PELARVS
        E+  E+  TQ+ AI S TLFGE L+H S S +MQH  FL+ E SH+ RQ LQ  H S   D     GTI+N LRGL VEN   V RNLNG    ELA V+
Subjt:  ETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVVARNLNG---PELARVS

Query:  SQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQCS
        S  NCE  QI  Y++ +NT S N SLP L  DEK+YCA RE LQP+I+ HLKN LSRD DLG+  A+DIAT+A STIL ACG++HSIN A R+SPPSQCS
Subjt:  SQSNCEQVQIDRYNDVLNTGSVNSSLP-LGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQCS

Query:  HIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        HIEL+ GE +RSLIKGLC  CFD YVRDVVKKITDD SWLNLRL
Subjt:  HIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

A0A6J1C3M4 uncharacterized protein LOC111007954 isoform X10.0e+0071.03Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVVEQQQRNKPRNGVPPKARTAFGSCKSRKKNG
        MCEEQMVRGGKV S+KNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+Y SYLD NASGEDNF VE+QQ  K R     KAR AFGS K+RKKNG
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFVVEQQQRNKPRNGVPPKARTAFGSCKSRKKNG

Query:  RESLIFTYQEEEDG-----DEDYKVDDDNDYEEEEDVHADEEIED-------------EDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKV
        R+   F+Y+EE+DG     DEDY VDDDNDYEEEE+   +EE E+             EDEDFM+EEEDF DEEEP+VRKR TNMKRG+ GV K+T+GKV
Subjt:  RESLIFTYQEEEDG-----DEDYKVDDDNDYEEEEDVHADEEIED-------------EDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKV

Query:  CKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV---TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNR
         KKRKP AAKKPSRNKRR+KSG RTIRNS  DDGDFSDNYC     TRRKK VSKRK +  Q DLDTFLSGSSDYEYTISEEEREQ+REAE+LC HLRNR
Subjt:  CKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV---TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNR

Query:  GRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTV
         R++PSP +I + D RQ R P   VRKGKEK+EEVKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF++ISKPA+ST 
Subjt:  GRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTV

Query:  GIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRR
        GIDLREVVI VPERDQVYQPSEEELRSYLDPYENVICIEC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC DC T A GSSSPQPPSRLSDRR
Subjt:  GIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRR

Query:  TANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTL
        T N+LFNR+FP A+R+GLDLN LSS RTP IQGFGN +SPRLP EVQSTSPMSQAVAPTL  RR +RL IN++R SNQMGLVN   DGVSA +PC GGT+
Subjt:  TANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVSAGNPCVGGTL

Query:  NSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVV-ARNLNG-
        NSQ  QGRE  AE+T  Q+T I S TLFGE+L H +PSPLMQH  FL+SE SH+  Q ++  HLS+P +     GT LN L  L VEN T    + +NG 
Subjt:  NSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPID-----GTILNSLRGLEVENLTVV-ARNLNG-

Query:  --PELARVSSQSNCEQVQIDRYNDVLNTGSVNSSL-PLGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEAC
          P L  V+  SNCE  QI  ++ VLNT S N SL PL  DEKDY A RE LQP+I+SHLK+ LSRD DLG+  A DIA +A++TILAACGF+H INEA 
Subjt:  --PELARVSSQSNCEQVQIDRYNDVLNTGSVNSSL-PLGGDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACGFKHSINEAC

Query:  RASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD----SWLNLRL
        RASPPSQCSHIELS GE  RSLI+GLCS+CFD YVRD+VKKITDD    SWL+L L
Subjt:  RASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD----SWLNLRL

A0A6J1EU37 uncharacterized protein LOC111437762 isoform X10.0e+0070.8Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK
        MCEEQMVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+  SYLD NASGEDNFV      VE+QQ NK R GV PK     GS K
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK

Query:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV
         RKKNGR+   F+YQEEE     + DEDY +D+DNDYEEEE+   +EE          IEDEDE+FM+EEEDF DEEEP+VRK+ATN KRG++GV K+TV
Subjt:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV

Query:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR
        GKVCKKRKP AAKKP+RNKRRKK  S T+R+SDDDD DFS+N  VV  TR+K++VSKRKNH  Q DLDTF+SGSSDYEYTISEEEREQ+REAEKLC HLR
Subjt:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR

Query:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS
        NR RTVPSP RI E + RQ R PR  VRKGKEK+E+VKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+S
Subjt:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS

Query:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS
          GIDLREVVI VPERDQVYQPSEEELRSYLDPYENVIC EC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DCT  A G SSPQ PP+RLS
Subjt:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS

Query:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN
        DRRT N+LFNR+FP ANR+GLDLNSLSS RTP  QGFG+I SPR+PVEVQSTSPMSQAVAPTL RRRILRLHIN++R SNQMGLVN   +GVS    A +
Subjt:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN

Query:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT
         C G T+                 Q+T + S +LFGE+L+H +PSPLMQH VFL SE + +S QV+Q  HLSM         +GTI N+LR G + EN T
Subjt:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT

Query:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DL-GEIAANDIATEAASTILAAC
         V RNLNG    ELA V+  SNCE   I  Y+++LNTGS +SS P   GDEKDYCA REHLQPMIESHLKNYLS D DL G   AN+IAT+AA TILAAC
Subjt:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DL-GEIAANDIATEAASTILAAC

Query:  GFKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        GFK+SINEACRA+PPSQCSHIELS GEE+RSLIKGLCS+CFD YVRDVVKK TDD SWL+L L
Subjt:  GFKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

A0A6J1EZ47 uncharacterized protein LOC111437762 isoform X20.0e+0070.88Show/hide
Query:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK
        MCEEQMVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+  SYLD NASGEDNFV      VE+QQ NK R GV PK     GS K
Subjt:  MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCK

Query:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV
         RKKNGR+   F+YQEEE     + DEDY +D+DNDYEEEE+   +EE          IEDEDE+FM+EEEDF DEEEP+VRK+ATN KRG++GV K+TV
Subjt:  SRKKNGRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTV

Query:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR
        GKVCKKRKP AAKKP+RNKRRKK  S T+R+SDDDD DFS+N  VV  TR+K++VSKRKNH  Q DLDTF+SGSSDYEYTISEEEREQ+REAEKLC HLR
Subjt:  GKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLR

Query:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS
        NR RTVPSP RI E + RQ R PR  VRKGKEK+E+VKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+S
Subjt:  NRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKS

Query:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS
          GIDLREVVI VPERDQVYQPSEEELRSYLDPYENVIC EC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DCT  A G SSPQ PP+RLS
Subjt:  TVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLS

Query:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN
        DRRT N+LFNR+FP ANR+GLDLNSLSS RTP  QGFG+I SPR+PVEVQSTSPMSQAVAPTL RRRILRLHIN++R SNQMGLVN   +GVS    A +
Subjt:  DRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGN

Query:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT
         C G T+                 Q+T + S +LFGE+L+H +PSPLMQH VFL SE + +S QV+Q  HLSM         +GTI N+LR G + EN T
Subjt:  PCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLT

Query:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACG
         V RNLNG    ELA V+  SNCE   I  Y+++LNTGS +SS P   GDEKDYCA REHLQPMIESHLKNYLS D DLG   AN+IAT+AA TILAACG
Subjt:  VVARNLNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DLGEIAANDIATEAASTILAACG

Query:  FKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        FK+SINEACRA+PPSQCSHIELS GEE+RSLIKGLCS+CFD YVRDVVKK TDD SWL+L L
Subjt:  FKHSINEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

A0A6J1EZX5 uncharacterized protein LOC111437762 isoform X30.0e+0070.63Show/hide
Query:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCKSRKKN
        MVRGGKV SRKNFKKKFR KDK  SDDSDEDYVVS+D  GVSE SDE+  SYLD NASGEDNFV      VE+QQ NK R GV PK     GS K RKKN
Subjt:  MVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSD--GVSEGSDENYGSYLDGNASGEDNFV------VEQQQRNKPRNGVPPKARTAFGSCKSRKKN

Query:  GRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCK
        GR+   F+YQEEE     + DEDY +D+DNDYEEEE+   +EE          IEDEDE+FM+EEEDF DEEEP+VRK+ATN KRG++GV K+TVGKVCK
Subjt:  GRESLIFTYQEEE-----DGDEDYKVDDDNDYEEEEDVHADEE----------IEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCK

Query:  KRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRT
        KRKP AAKKP+RNKRRKK  S T+R+SDDDD DFS+N  VV  TR+K++VSKRKNH  Q DLDTF+SGSSDYEYTISEEEREQ+REAEKLC HLRNR RT
Subjt:  KRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNYCVV--TRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRT

Query:  VPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGID
        VPSP RI E + RQ R PR  VRKGKEK+E+VKAEVIKQVCGICL+EEDKRR+RGTLN CSH+FCFSCIMEWAKVESRCPLCKQRF+TISKPA+S  GID
Subjt:  VPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGID

Query:  LREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLSDRRTA
        LREVVI VPERDQVYQPSEEELRSYLDPYENVIC EC +GGDDNLMLLCD+CDSPAHTYCVGL REVPEGNWYC+DCT  A G SSPQ PP+RLSDRRT 
Subjt:  LREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQ-PPSRLSDRRTA

Query:  NSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGNPCVGG
        N+LFNR+FP ANR+GLDLNSLSS RTP  QGFG+I SPR+PVEVQSTSPMSQAVAPTL RRRILRLHIN++R SNQMGLVN   +GVS    A + C G 
Subjt:  NSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNLR-SNQMGLVNTSGDGVS----AGNPCVGG

Query:  TLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLTVVARN
        T+                 Q+T + S +LFGE+L+H +PSPLMQH VFL SE + +S QV+Q  HLSM         +GTI N+LR G + EN T V RN
Subjt:  TLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSM-------PIDGTILNSLR-GLEVENLTVVARN

Query:  LNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DL-GEIAANDIATEAASTILAACGFKHS
        LNG    ELA V+  SNCE   I  Y+++LNTGS +SS P   GDEKDYCA REHLQPMIESHLKNYLS D DL G   AN+IAT+AA TILAACGFK+S
Subjt:  LNGP---ELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLG-GDEKDYCAEREHLQPMIESHLKNYLSRD-DL-GEIAANDIATEAASTILAACGFKHS

Query:  INEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL
        INEACRA+PPSQCSHIELS GEE+RSLIKGLCS+CFD YVRDVVKK TDD SWL+L L
Subjt:  INEACRASPPSQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDD-SWLNLRL

SwissProt top hitse value%identityAlignment
A6H619 PHD and RING finger domain-containing protein 11.1e-1231.65Show/hide
Query:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG
        C ICL     + + GT  +C+HYFC  CI+EW++  + CP+ +  FK I   A+        +++  +P  +     +EEE     DP     C  C + 
Subjt:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG

Query:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDCT
          ++ +LLCD CD+  H  C+    +EVP   W+C +CT
Subjt:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDCT

O94400 PHD and RING finger domain-containing protein C126.07c2.5e-0928.46Show/hide
Query:  CSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYEN-----VICIECQQGGDDNLMLLCDVCDS
        C HYF   C+  W +V + CPLC+  F          + +D+ E V     R    +   + + +  +P+E+       C+ C +     ++LLCD CD 
Subjt:  CSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYEN-----VICIECQQGGDDNLMLLCDVCDS

Query:  PAHTYCVGLAREVPEGNWYCIDC
          HTYC+ +   VP   +YC +C
Subjt:  PAHTYCVGLAREVPEGNWYCIDC

Q61T02 Lysine-specific demethylase rbr-26.6e-1034.91Show/hide
Query:  VESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDV--CDSPAHTYCVG-LAREVPEGN
        VE  CP+  Q  +  SK  K                R +    +EE+     D  E V C+ C +G D++L+LLCD+  C+S  HTYC   +  EVPEG 
Subjt:  VESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDV--CDSPAHTYCVG-LAREVPEGN

Query:  WYCIDC
        W C  C
Subjt:  WYCIDC

Q63625 PHD and RING finger domain-containing protein 15.4e-1231.16Show/hide
Query:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG
        C ICL     + + GT  +C+HYFC  CI+EW++  + CP+ +  FK I   A+   G  L+++ +      +  +  EE      DP     C  C + 
Subjt:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG

Query:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDC
          ++ +LLCD CD+  H  C+    +EVP   W+C +C
Subjt:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDC

Q9P1Y6 PHD and RING finger domain-containing protein 11.8e-1234.06Show/hide
Query:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG
        C ICL     + + GT  +C+HYFC  CI+EW+K  + CP+ +  FK I   A+   G  LR++ +      +  + SEEE     DP     C  C + 
Subjt:  CGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQG

Query:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDC
          ++ +LLCD CD+  H  C+    +EVP   W+C +C
Subjt:  GDDNLMLLCDVCDSPAHTYCVG-LAREVPEGNWYCIDC

Arabidopsis top hitse value%identityAlignment
AT3G01460.1 methyl-CPG-binding domain 91.3e-0826.9Show/hide
Query:  DKRRIRGTLNSCSHYFCFSCIMEWAKVE---SRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLD-----------PYENVIC
        D R   G  +  +  F    +  W+ +    +  P C     T+S+  KS    ++  +V  + +  ++   S E  +   D           P++  +C
Subjt:  DKRRIRGTLNSCSHYFCFSCIMEWAKVE---SRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLD-----------PYENVIC

Query:  IECQQGGDDNLMLLCDVCDSPAHTYCVG--LAREVPEGNWYCIDC
          C    DD+ +LLCD CD+  HTYC+   L R +P+GNWYC  C
Subjt:  IECQQGGDDNLMLLCDVCDSPAHTYCVG--LAREVPEGNWYCIDC

AT3G05670.1 RING/U-box protein2.0e-11838.02Show/hide
Query:  KNFKKKFRSKDKAASDDSDEDYVVSSDGVSEGS---DENYGSYLDGNAS-----GEDNFVVEQQQRNK-------------PRNGVPPKARTAFGSCKSR
        +N  +K RSKDK  SD+SDEDYV+S +   E      E Y S +DG  S     G D   VE ++ ++             P+    P+       CKSR
Subjt:  KNFKKKFRSKDKAASDDSDEDYVVSSDGVSEGS---DENYGSYLDGNAS-----GEDNFVVEQQQRNK-------------PRNGVPPKARTAFGSCKSR

Query:  KKN---------------------------------------------GRESLIFTYQEEEDGDEDYKVDDDNDYEEEEDVH------------------
        K N                                             G+   +F   E+EDGD     ++D D EEE DV                   
Subjt:  KKN---------------------------------------------GRESLIFTYQEEEDGDEDYKVDDDNDYEEEEDVH------------------

Query:  --------------ADEEIEDE----DEDFMLEEEDFLDEEEP----MVRKRATNMKRGQEGVSKSTVGKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNS
                       D E EDE    DEDF  +E+  LDEEE        K A  + +G +   +S  G+  K+RK  +  K    + RK+    T +  
Subjt:  --------------ADEEIEDE----DEDFMLEEEDFLDEEEP----MVRKRATNMKRGQEGVSKSTVGKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNS

Query:  DDDDGDFSDNYCVVTRRKKAVSK------RKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPSPRRIGEYDLRQQRNPRASVR
        D+DD DF D+   +  RKKA +K      R+      D D   SG SDYEYTISEEEREQ+REA  L     N   ++   +     DL Q R  ++ V+
Subjt:  DDDDGDFSDNYCVVTRRKKAVSK------RKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPSPRRIGEYDLRQQRNPRASVR

Query:  KGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELR
        KG++KVE VK +VIK VCGICL+EED RR++GTL+ CSHYFCF+CIMEW+KVESRCPLCKQRF+TISKPA+ST G+DLREVVI VPERDQVYQP+EEELR
Subjt:  KGKEKVEEVKAEVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELR

Query:  SYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLFNRTFP-AANREGLDLNSLSS
        SYLDPYEN+IC EC QG DD LMLLCD+CDS AHTYCVGL REVPEGNWYC  C   A GS+S Q    +S+++  +  ++R  P   + +  D++ + S
Subjt:  SYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCVGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLFNRTFP-AANREGLDLNSLSS

Query:  HRTPNIQGFGNILSPRLP-VEVQSTSPMSQAVAPTLERRRILRLHINNLRSNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQGR------ETAAENTTTQD
         RTP   G  N+ SPR+P  + Q +SP S   A TL RRR L  HI N+  N   L+N    G   G     G + +Q G GR        A++ T    
Subjt:  HRTPNIQGFGNILSPRLP-VEVQSTSPMSQAVAPTLERRRILRLHINNLRSNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQGR------ETAAENTTTQD

Query:  TAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHL-SMPIDGTILNSLRGLEVENLTVVARNLNGPELARVSSQSNCEQVQIDRYNDVLN
         AI    L   N +  +  P +      E       R++   + L   PID   L    GL             G  L  VS+Q              L+
Subjt:  TAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHL-SMPIDGTILNSLRGLEVENLTVVARNLNGPELARVSSQSNCEQVQIDRYNDVLN

Query:  TGSVNSSLPLGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQCSHIELSTGEEE-----RSLIK
        +   N+   + G         E LQ  +++HLKN  S++DLG+   ++I+T +  TILAACG +H  +E     PP  C+H  ++ G         SL+K
Subjt:  TGSVNSSLPLGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRASPPSQCSHIELSTGEEE-----RSLIK

Query:  GLCSSCFDLYVRDVVKKITD---DSWLNLRL
        G C SCFD +V DVVK I D     WL+L L
Subjt:  GLCSSCFDLYVRDVVKKITD---DSWLNLRL

AT4G10940.1 RING/U-box protein1.6e-1131.48Show/hide
Query:  EVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTIS------KPAKSTVGIDL---REVVINVPERDQVYQPSEEELRSY
        E+  + CGIC+   D    RG L+ C H+FCF CI  W+ + + CPLC++ F+ I+          S V  DL    E      E D V  PS     ++
Subjt:  EVIKQVCGICLTEEDKRRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTIS------KPAKSTVGIDL---REVVINVPERDQVYQPSEEELRSY

Query:  LDPYENVICIE---CQ-------QGGDDNL--MLLCDVCDSPAHTYCVGL-AREVPEGNWYC
              V+C++   C+         GD NL   + CD CD   H  CVG       E  W C
Subjt:  LDPYENVICIE---CQ-------QGGDDNL--MLLCDVCDSPAHTYCVGL-AREVPEGNWYC

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 51.8e-0736.9Show/hide
Query:  EEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCV-GLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANS
        E+E     D Y NV C +C  G  D+ +LLCD CD   H  C+  +   VP G W C+DC        S Q P R   R+   S
Subjt:  EEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCV-GLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANS

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 53.7e-0837.5Show/hide
Query:  EEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCV-GLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRR
        E+E     D Y NV C +C  G  D+ +LLCD CD   H  C+  +   VP G W C+DC       S  +P  RLS ++
Subjt:  EEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYCV-GLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGAGGAACAGATGGTAAGGGGAGGGAAGGTTGATAGTAGGAAAAATTTCAAGAAGAAATTTCGATCCAAAGACAAGGCTGCTTCCGACGATTCCGATGAGGATTA
TGTTGTCTCCAGTGATGGGGTTTCTGAGGGTTCTGATGAGAATTATGGCTCTTATTTGGATGGAAATGCATCGGGAGAGGATAATTTTGTTGTCGAGCAGCAGCAGCGGA
ACAAGCCTAGGAATGGAGTCCCACCCAAGGCCAGAACTGCTTTCGGTTCGTGTAAATCAAGGAAGAAAAATGGGAGAGAGAGTCTAATTTTTACTTATCAAGAAGAAGAG
GATGGGGATGAGGATTATAAAGTAGATGATGATAACGATTACGAGGAGGAGGAAGATGTTCATGCTGATGAGGAGATTGAGGACGAGGACGAGGACTTTATGTTGGAAGA
AGAAGATTTTTTGGATGAGGAAGAACCGATGGTTAGAAAGAGAGCAACCAATATGAAACGAGGTCAAGAAGGGGTGAGCAAAAGTACTGTTGGCAAAGTTTGCAAAAAGA
GGAAACCAAACGCTGCAAAGAAGCCTTCGAGGAATAAACGGAGAAAGAAGAGTGGCTCACGGACTATAAGAAACTCAGATGACGACGACGGCGATTTTTCGGACAATTAC
TGTGTAGTAACAAGAAGGAAAAAGGCTGTTTCAAAGCGGAAAAATCATGCTGCACAGTTAGATTTAGATACATTCCTTTCTGGGTCATCAGATTATGAATATACCATCTC
TGAAGAAGAAAGAGAGCAGATGAGAGAAGCTGAGAAGTTGTGTTATCATTTGAGAAATCGAGGGAGGACTGTACCTTCACCCCGAAGAATAGGTGAGTATGATCTACGCC
AGCAGAGAAATCCTCGAGCTTCTGTCAGGAAAGGCAAAGAGAAAGTAGAGGAAGTCAAGGCTGAGGTGATAAAACAAGTATGCGGGATATGCCTAACGGAAGAAGATAAG
AGGAGAATTCGGGGAACCCTCAACAGTTGTTCTCACTATTTTTGTTTCTCTTGTATTATGGAGTGGGCTAAGGTTGAATCACGATGCCCTTTGTGCAAGCAGAGGTTTAA
GACAATTAGTAAGCCTGCAAAATCTACCGTTGGAATTGACTTAAGAGAAGTTGTCATAAATGTTCCTGAGCGTGATCAGGTGTATCAACCATCAGAGGAGGAGTTGAGGA
GTTATCTCGATCCCTATGAAAATGTAATCTGTATCGAGTGTCAGCAAGGCGGGGATGATAATCTTATGTTACTGTGTGATGTATGTGACTCACCAGCACACACCTATTGC
GTCGGTCTTGCTAGAGAAGTACCAGAAGGGAATTGGTACTGTATTGATTGTACGACCTATGCTCATGGATCATCAAGTCCACAGCCTCCAAGCCGACTGTCTGATCGAAG
AACAGCCAACTCTTTGTTTAACAGAACTTTCCCTGCTGCAAATCGGGAGGGTTTAGACTTGAATTCTCTATCTTCACATCGTACTCCAAATATTCAAGGATTTGGAAACA
TCCTGTCTCCTCGATTACCGGTGGAAGTCCAATCAACTTCTCCCATGTCTCAAGCAGTCGCACCAACTCTTGAAAGAAGACGAATTCTTCGGCTGCACATCAATAACTTG
CGCTCGAATCAAATGGGATTAGTAAACACCAGTGGTGATGGTGTTTCTGCTGGTAATCCTTGTGTTGGGGGTACCTTAAATTCGCAGAGGGGTCAAGGTAGGGAAACAGC
AGCTGAAAATACGACAACTCAAGATACAGCTATATTGAGCCCAACTTTGTTTGGGGAGAACTTAATACATGGCAGTCCCTCTCCATTGATGCAACATATAGTTTTTCTTG
AGTCTGAAAGAAGCCACGTGTCAAGGCAAGTATTACAAGGTTCTCACCTTAGCATGCCAATAGATGGAACAATTTTGAACTCTTTAAGAGGATTAGAAGTTGAAAATTTG
ACCGTTGTTGCTCGAAATTTGAATGGGCCTGAGCTTGCAAGAGTGAGTTCACAATCGAACTGTGAGCAAGTTCAGATTGATCGATACAACGATGTTTTAAATACAGGCTC
TGTTAACAGTTCGCTCCCTTTGGGTGGAGATGAGAAGGATTATTGTGCAGAGAGAGAGCATCTACAGCCCATGATCGAAAGCCATTTGAAGAATTATTTATCCCGAGATG
ATCTAGGTGAAATTGCAGCAAATGACATCGCGACGGAAGCTGCTAGCACGATTTTAGCAGCATGTGGGTTCAAACACTCAATAAACGAGGCTTGCAGGGCATCACCACCA
TCCCAATGTTCCCACATTGAATTGTCAACAGGGGAAGAAGAAAGAAGTCTAATAAAGGGCCTGTGCTCGTCTTGTTTCGACTTGTATGTTCGGGATGTAGTGAAGAAGAT
TACGGACGACTCGTGGTTGAATCTCAGGCTATAG
mRNA sequenceShow/hide mRNA sequence
AAAGAAATTTAATAATATTGTTTTTGAAAGAAAGAAAGAAAGAAAAAAAAGGAAAATATTTTTGCAATTGTCCGTGGTGTGTTTCCTGCTGCTTTATTCCTCCTCTCTCA
GGAATGAAGCAGATCGAGTATGGGGAAATCCTTTGTAGTTAGTGGGGGGTTTAAATGTACACATCCAAAATTGGGTGTTTCGACATTTATGCGGGACTTATCCAGGAATT
TTCAATTTTTGTCTCTCTGTTCGGTTTCTTTAGGTTGGATGCTGAGGGATTGGTTGGTTGAGGAATTCTGATTCTGCTTTTTGTGGACTGAATTTTTGTGATTCTGTTGT
TTTTGGCGAATTTGTTTTATGTGCGAGGAACAGATGGTAAGGGGAGGGAAGGTTGATAGTAGGAAAAATTTCAAGAAGAAATTTCGATCCAAAGACAAGGCTGCTTCCGA
CGATTCCGATGAGGATTATGTTGTCTCCAGTGATGGGGTTTCTGAGGGTTCTGATGAGAATTATGGCTCTTATTTGGATGGAAATGCATCGGGAGAGGATAATTTTGTTG
TCGAGCAGCAGCAGCGGAACAAGCCTAGGAATGGAGTCCCACCCAAGGCCAGAACTGCTTTCGGTTCGTGTAAATCAAGGAAGAAAAATGGGAGAGAGAGTCTAATTTTT
ACTTATCAAGAAGAAGAGGATGGGGATGAGGATTATAAAGTAGATGATGATAACGATTACGAGGAGGAGGAAGATGTTCATGCTGATGAGGAGATTGAGGACGAGGACGA
GGACTTTATGTTGGAAGAAGAAGATTTTTTGGATGAGGAAGAACCGATGGTTAGAAAGAGAGCAACCAATATGAAACGAGGTCAAGAAGGGGTGAGCAAAAGTACTGTTG
GCAAAGTTTGCAAAAAGAGGAAACCAAACGCTGCAAAGAAGCCTTCGAGGAATAAACGGAGAAAGAAGAGTGGCTCACGGACTATAAGAAACTCAGATGACGACGACGGC
GATTTTTCGGACAATTACTGTGTAGTAACAAGAAGGAAAAAGGCTGTTTCAAAGCGGAAAAATCATGCTGCACAGTTAGATTTAGATACATTCCTTTCTGGGTCATCAGA
TTATGAATATACCATCTCTGAAGAAGAAAGAGAGCAGATGAGAGAAGCTGAGAAGTTGTGTTATCATTTGAGAAATCGAGGGAGGACTGTACCTTCACCCCGAAGAATAG
GTGAGTATGATCTACGCCAGCAGAGAAATCCTCGAGCTTCTGTCAGGAAAGGCAAAGAGAAAGTAGAGGAAGTCAAGGCTGAGGTGATAAAACAAGTATGCGGGATATGC
CTAACGGAAGAAGATAAGAGGAGAATTCGGGGAACCCTCAACAGTTGTTCTCACTATTTTTGTTTCTCTTGTATTATGGAGTGGGCTAAGGTTGAATCACGATGCCCTTT
GTGCAAGCAGAGGTTTAAGACAATTAGTAAGCCTGCAAAATCTACCGTTGGAATTGACTTAAGAGAAGTTGTCATAAATGTTCCTGAGCGTGATCAGGTGTATCAACCAT
CAGAGGAGGAGTTGAGGAGTTATCTCGATCCCTATGAAAATGTAATCTGTATCGAGTGTCAGCAAGGCGGGGATGATAATCTTATGTTACTGTGTGATGTATGTGACTCA
CCAGCACACACCTATTGCGTCGGTCTTGCTAGAGAAGTACCAGAAGGGAATTGGTACTGTATTGATTGTACGACCTATGCTCATGGATCATCAAGTCCACAGCCTCCAAG
CCGACTGTCTGATCGAAGAACAGCCAACTCTTTGTTTAACAGAACTTTCCCTGCTGCAAATCGGGAGGGTTTAGACTTGAATTCTCTATCTTCACATCGTACTCCAAATA
TTCAAGGATTTGGAAACATCCTGTCTCCTCGATTACCGGTGGAAGTCCAATCAACTTCTCCCATGTCTCAAGCAGTCGCACCAACTCTTGAAAGAAGACGAATTCTTCGG
CTGCACATCAATAACTTGCGCTCGAATCAAATGGGATTAGTAAACACCAGTGGTGATGGTGTTTCTGCTGGTAATCCTTGTGTTGGGGGTACCTTAAATTCGCAGAGGGG
TCAAGGTAGGGAAACAGCAGCTGAAAATACGACAACTCAAGATACAGCTATATTGAGCCCAACTTTGTTTGGGGAGAACTTAATACATGGCAGTCCCTCTCCATTGATGC
AACATATAGTTTTTCTTGAGTCTGAAAGAAGCCACGTGTCAAGGCAAGTATTACAAGGTTCTCACCTTAGCATGCCAATAGATGGAACAATTTTGAACTCTTTAAGAGGA
TTAGAAGTTGAAAATTTGACCGTTGTTGCTCGAAATTTGAATGGGCCTGAGCTTGCAAGAGTGAGTTCACAATCGAACTGTGAGCAAGTTCAGATTGATCGATACAACGA
TGTTTTAAATACAGGCTCTGTTAACAGTTCGCTCCCTTTGGGTGGAGATGAGAAGGATTATTGTGCAGAGAGAGAGCATCTACAGCCCATGATCGAAAGCCATTTGAAGA
ATTATTTATCCCGAGATGATCTAGGTGAAATTGCAGCAAATGACATCGCGACGGAAGCTGCTAGCACGATTTTAGCAGCATGTGGGTTCAAACACTCAATAAACGAGGCT
TGCAGGGCATCACCACCATCCCAATGTTCCCACATTGAATTGTCAACAGGGGAAGAAGAAAGAAGTCTAATAAAGGGCCTGTGCTCGTCTTGTTTCGACTTGTATGTTCG
GGATGTAGTGAAGAAGATTACGGACGACTCGTGGTTGAATCTCAGGCTATAGGGAGGCTACCATTTGGCATCCCTTTAGTTCAGGGGTTAATTGTTGTTAACTTGTCCAT
ATAGTTGATGCGCTGACATTTAAGTCATAGTTTTATTGTTCAAGTTGCATGTCATTTCCCATTGTTGAAATGAAGCTACCTGCATGTTGACCCTA
Protein sequenceShow/hide protein sequence
MCEEQMVRGGKVDSRKNFKKKFRSKDKAASDDSDEDYVVSSDGVSEGSDENYGSYLDGNASGEDNFVVEQQQRNKPRNGVPPKARTAFGSCKSRKKNGRESLIFTYQEEE
DGDEDYKVDDDNDYEEEEDVHADEEIEDEDEDFMLEEEDFLDEEEPMVRKRATNMKRGQEGVSKSTVGKVCKKRKPNAAKKPSRNKRRKKSGSRTIRNSDDDDGDFSDNY
CVVTRRKKAVSKRKNHAAQLDLDTFLSGSSDYEYTISEEEREQMREAEKLCYHLRNRGRTVPSPRRIGEYDLRQQRNPRASVRKGKEKVEEVKAEVIKQVCGICLTEEDK
RRIRGTLNSCSHYFCFSCIMEWAKVESRCPLCKQRFKTISKPAKSTVGIDLREVVINVPERDQVYQPSEEELRSYLDPYENVICIECQQGGDDNLMLLCDVCDSPAHTYC
VGLAREVPEGNWYCIDCTTYAHGSSSPQPPSRLSDRRTANSLFNRTFPAANREGLDLNSLSSHRTPNIQGFGNILSPRLPVEVQSTSPMSQAVAPTLERRRILRLHINNL
RSNQMGLVNTSGDGVSAGNPCVGGTLNSQRGQGRETAAENTTTQDTAILSPTLFGENLIHGSPSPLMQHIVFLESERSHVSRQVLQGSHLSMPIDGTILNSLRGLEVENL
TVVARNLNGPELARVSSQSNCEQVQIDRYNDVLNTGSVNSSLPLGGDEKDYCAEREHLQPMIESHLKNYLSRDDLGEIAANDIATEAASTILAACGFKHSINEACRASPP
SQCSHIELSTGEEERSLIKGLCSSCFDLYVRDVVKKITDDSWLNLRL