; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021691 (gene) of Snake gourd v1 genome

Gene IDTan0021691
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRING/U-box protein, putative isoform 2
Genome locationLG09:5386725..5389469
RNA-Seq ExpressionTan0021691
SyntenyTan0021691
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659696.1 uncharacterized protein LOC101205950 isoform X1 [Cucumis sativus]0.0e+0080Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK
        MCEEQMVRGG+V SRKNFKKKFR KDKGSDDSDEDYVVS+D+NG+SE SDE YC  LDENASGEDN++VEE++EEQQ KK RK VGPKARN   SH+ARK
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK

Query:  KNGRKRQRFSYQEEDD-GDEDDEDYSVDDDNDY---EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKK
        KNG KR+RFS QEE+D GDEDDEDYSVD+DNDY   EEEEEEEEEDVDVD EVEDEDEDF+LEEEDFSD+EEP+VRKR TNMKRGR G+RKN +GKV KK
Subjt:  KNGRKRQRFSYQEEDD-GDEDDEDYSVDDDNDY---EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKK

Query:  RKPKAAKKPSRNKQRKKTGTRTLRNS--DDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTV
        RKPKAAKKPSRNK+RKK+G +T+RNS  DDDDF DNY  +K TRRK+ VSKRK YV QSDLD  LSGSSDYEYTISEEEREQVREAE+LCG LRNR RTV
Subjt:  RKPKAAKKPSRNKQRKKTGTRTLRNS--DDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTV

Query:  PSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDL
        PSPPRIE++DL QQRK RPP+RKGKEKVE +K EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKP RSTAGIDL
Subjt:  PSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDL

Query:  REVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNN
        RE V+QVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYC DC TIA GSSSPQP +RLS+RRTTNN
Subjt:  REVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNN

Query:  LFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQM
        LFNRTFPVANRDGLDLNS+SSPRTP +QGF NIPSPRLPVEVQSTSPMSQAVAPTL RRRILRLHIN++RSS+QMGLV NR DGVSA NP GGGTLS Q 
Subjt:  LFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQM

Query:  GQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEH
        GQ RE+T EH  TQE AIPSQT+FGE+LLHD+ S +MQH  FL  ETSHLPRQ +QDPH +   +R S NGTI+N LRGL VE++ TVDRNLN  L  E 
Subjt:  GQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEH

Query:  AVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRC
        A VNSL NCEQIH Y+N +NT SDN  LP  V +EK+YCAARE LQP+I+ HLKNLSRD+DLGQSTA+DIATKA  TIL ACG++HSI  A  + PPS+C
Subjt:  AVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRC

Query:  SHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD-AWLNLTL
        SHIEL++GEGQRSLIKGLC  CFDSYVRDVVKKITDD +WLNL L
Subjt:  SHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD-AWLNLTL

XP_022136198.1 uncharacterized protein LOC111007954 isoform X1 [Momordica charantia]0.0e+0080.37Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK
        MCEEQMVRGGKVGS+KNFKKKFRPKDKGSDDSDEDYVVSNDENG+SECSDE YC YLDENASGEDNF     VEEQQP KVRK  G KARNAF SHKARK
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK

Query:  KNGRKRQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKN
        KNGRKRQRFSY+EED  DGDEDDEDYSVDDDNDYEEEEEEEEE          DVDVDEEV  EDEDFM+EEEDFSD+EEPLVRKR TNMKRGR GVRKN
Subjt:  KNGRKRQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKN

Query:  TIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL
        TIGKV KKRKPKAAKKPSRNK+R+K+G RT+RNSDD DF DNY   VKNTRRKK VSKRK  V Q+DLDTFLSGSSDYEYTISEEEREQVREAE+LCGHL
Subjt:  TIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL

Query:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR
        RNR R++PSPP+IE+SD RQ RK  PP+RKGKEK+EEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQ+ISKPAR
Subjt:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR

Query:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLS
        STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYC DC TIA GSSSPQPPSRLS
Subjt:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLS

Query:  DRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGG
        DRRTTNNLFNR+FPVA+RDGLDLN LSSPRTP IQGFGN  SPRLP EVQSTSPMSQAVAPTL+ RR +RL INH+RSSNQMGLVN+R DGVSAA+PCGG
Subjt:  DRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGG

Query:  GTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNL
        GT++ Q  QGRE TAEHT  QET IPSQT+FGES LH NPSPLMQH  FL SETSHLP Q ++DPHL++P ER    GT LN L  L VE++TT  D+ +
Subjt:  GTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNL

Query:  NSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC
        N A W     VN LSNCEQIH +++VLNT SDN  LPP + +EKDY AARE LQP+I+ HLK+LSRDVDLGQSTA DIA KA+ TILAACGF+H I EA 
Subjt:  NSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC

Query:  -TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL
           PPS+CSHIELSMGEG RSLI+GLCS CFDSYVRD+VKKITDD    +WL+LTL
Subjt:  -TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL

XP_023530078.1 uncharacterized protein LOC111792737 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0080.35Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEE----EVEEQQPKKVRKGVGPKARNAFSSH
        MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENG+SE SDE  C YLD NASGEDNF+VEE    EVEEQQ  K RKGV PK  NA  S 
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEE----EVEEQQPKKVRKGVGPKARNAFSSH

Query:  KARKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-------VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRK
        K RKKNGRKR+RFSYQ  EEDDGDEDDEDYS+D+DNDYEEEEEEEEE+ + +EE       +EDEDE+FM+EEEDFSD+EEPLVRK+ATN KRGRQGVRK
Subjt:  KARKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-------VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRK

Query:  NTIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL
        NT+GKV KKRKPKAAKKP+RNK+RKK+ T    + DDDDF +N  VVKNTR+K+ VSKRKN+V QSDLDTF+SGSSDYEYTISEEEREQVREAEKLCG+L
Subjt:  NTIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL

Query:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR
        RNR RTVPSP RIEES+ RQ RKPRPP+RKGKEK+EEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR
Subjt:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR

Query:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPP-SRL
        S AGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCT +AQG SSPQPP +RL
Subjt:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPP-SRL

Query:  SDRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS---AAN
        SDRRTTNNLFNR+FPVANRDGLDLNSLSSPRTP  QGFG+IPSPR+PVEVQSTSPMSQAVAPTLVRRRILRLHINH+RSSNQMGLVNNR +GVS   AA+
Subjt:  SDRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS---AAN

Query:  PCGGGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDST
         CGG T+   MG            QET + SQ++FGESLLHDNPSPLMQH VFL SET+ +  QV+QDPHL+M      TS NGTI NTLR G + E+ST
Subjt:  PCGGGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDST

Query:  TVDRNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDLGQSTANDIATKAAGTILAACGFK
        TVDRNLN  LWHE A VN LSNCE IH+Y+N+LNTGSD+S  P  VG+EKDYCAAREHLQPMIE HLKN LS DVDLGQSTAN+IATKAAGTILAACGFK
Subjt:  TVDRNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDLGQSTANDIATKAAGTILAACGFK

Query:  HSITEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL
        +SI EAC   PPS+CSHIELS+GE QRSLIKGLCS CFDSYVRDVVKK TDDA WL+LTL
Subjt:  HSITEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL

XP_038880559.1 uncharacterized protein LOC120072219 isoform X1 [Benincasa hispida]0.0e+0082.27Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK
        MCEEQMVRGGKVGSRKNFKKKFR KDKGSDDSDEDYVVS+DENG+SE SDE YC  LDENASGEDNF+VEE+VEEQQPK VRK VGPKARNAF S K RK
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK

Query:  KNGRKRQRFSYQ-EEDDGDEDDEDYSVDDDNDY-----EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVS
         NG+KR+RFSYQ EED GDEDDEDYSVD+DNDY     EEEEEEEEEDVDVDEEVEDEDEDF+LEEEDFSD+EEP+VRKR TNMKRGR GVRKNT+GKV 
Subjt:  KNGRKRQRFSYQ-EEDDGDEDDEDYSVDDDNDY-----EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVS

Query:  KKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTV
        KKR+PKAAKKPSRNK+RKK+GT++++NS DDDF D+Y  VK TRRK+ VSKRK YV QSDLDTFLSGSSDYEYTISEEEREQVREAE+LCG L NRIRT 
Subjt:  KKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTV

Query:  PSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDL
        PSP RIE+SD+ QQRK RPP+RKGKEKVEEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDL
Subjt:  PSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDL

Query:  REVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNN
        REVVIQVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDC  IA GSSSPQP SRLS+RRTTNN
Subjt:  REVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNN

Query:  LFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQM
        LFNRTFPVANRDGLDLNSLSSPR+  +QGF NIPSPRLPVEVQSTSPMSQA+APTL RRRILRLHINH+RSS+QMGLV NRA+GVSA NPCGGGTLS QM
Subjt:  LFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQM

Query:  GQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEH
        GQ RE+TAEHTITQE  IP+QT+ GE+LLHD+PSPLMQH  FL  ETSHL RQV+QDPH +MP++RTS NGTI+N LRGL VE++ TVDRNLN  L  E 
Subjt:  GQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEH

Query:  AVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRC
        A +NSLSNCEQIH Y+NV+NT SDN  LP  VG+EKDYCAARE LQP+I+ HLKNL+RD+DL QS+A DIATKAA TIL ACGF H I +A    PPSRC
Subjt:  AVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRC

Query:  SHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD--AWLNLTL
        SHIELSMGE +RSLIKGLC +CFDSYV+D+VKKITDD  +WLNL+L
Subjt:  SHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD--AWLNLTL

XP_038880561.1 uncharacterized protein LOC120072219 isoform X2 [Benincasa hispida]0.0e+0082.16Show/hide
Query:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK
        MVRGGKVGSRKNFKKKFR KDKGSDDSDEDYVVS+DENG+SE SDE YC  LDENASGEDNF+VEE+VEEQQPK VRK VGPKARNAF S K RK NG+K
Subjt:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK

Query:  RQRFSYQ-EEDDGDEDDEDYSVDDDNDY-----EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKKRKP
        R+RFSYQ EED GDEDDEDYSVD+DNDY     EEEEEEEEEDVDVDEEVEDEDEDF+LEEEDFSD+EEP+VRKR TNMKRGR GVRKNT+GKV KKR+P
Subjt:  RQRFSYQ-EEDDGDEDDEDYSVDDDNDY-----EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKKRKP

Query:  KAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPR
        KAAKKPSRNK+RKK+GT++++NS DDDF D+Y  VK TRRK+ VSKRK YV QSDLDTFLSGSSDYEYTISEEEREQVREAE+LCG L NRIRT PSP R
Subjt:  KAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPR

Query:  IEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVI
        IE+SD+ QQRK RPP+RKGKEKVEEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVI
Subjt:  IEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVI

Query:  QVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRT
        QVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDC  IA GSSSPQP SRLS+RRTTNNLFNRT
Subjt:  QVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRT

Query:  FPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRE
        FPVANRDGLDLNSLSSPR+  +QGF NIPSPRLPVEVQSTSPMSQA+APTL RRRILRLHINH+RSS+QMGLV NRA+GVSA NPCGGGTLS QMGQ RE
Subjt:  FPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRE

Query:  TTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNS
        +TAEHTITQE  IP+QT+ GE+LLHD+PSPLMQH  FL  ETSHL RQV+QDPH +MP++RTS NGTI+N LRGL VE++ TVDRNLN  L  E A +NS
Subjt:  TTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNS

Query:  LSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRCSHIEL
        LSNCEQIH Y+NV+NT SDN  LP  VG+EKDYCAARE LQP+I+ HLKNL+RD+DL QS+A DIATKAA TIL ACGF H I +A    PPSRCSHIEL
Subjt:  LSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRCSHIEL

Query:  SMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD--AWLNLTL
        SMGE +RSLIKGLC +CFDSYV+D+VKKITDD  +WLNL+L
Subjt:  SMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD--AWLNLTL

TrEMBL top hitse value%identityAlignment
A0A0A0LWB6 Uncharacterized protein0.0e+0079.88Show/hide
Query:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK
        MVRGG+V SRKNFKKKFR KDKGSDDSDEDYVVS+D+NG+SE SDE YC  LDENASGEDN++VEE++EEQQ KK RK VGPKARN   SH+ARKKNG K
Subjt:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK

Query:  RQRFSYQEEDD-GDEDDEDYSVDDDNDY---EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKKRKPKA
        R+RFS QEE+D GDEDDEDYSVD+DNDY   EEEEEEEEEDVDVD EVEDEDEDF+LEEEDFSD+EEP+VRKR TNMKRGR G+RKN +GKV KKRKPKA
Subjt:  RQRFSYQEEDD-GDEDDEDYSVDDDNDY---EEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKKRKPKA

Query:  AKKPSRNKQRKKTGTRTLRNS--DDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPR
        AKKPSRNK+RKK+G +T+RNS  DDDDF DNY  +K TRRK+ VSKRK YV QSDLD  LSGSSDYEYTISEEEREQVREAE+LCG LRNR RTVPSPPR
Subjt:  AKKPSRNKQRKKTGTRTLRNS--DDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPR

Query:  IEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVI
        IE++DL QQRK RPP+RKGKEKVE +K EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKP RSTAGIDLRE V+
Subjt:  IEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVI

Query:  QVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRT
        QVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYC DC TIA GSSSPQP +RLS+RRTTNNLFNRT
Subjt:  QVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRT

Query:  FPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRE
        FPVANRDGLDLNS+SSPRTP +QGF NIPSPRLPVEVQSTSPMSQAVAPTL RRRILRLHIN++RSS+QMGLV NR DGVSA NP GGGTLS Q GQ RE
Subjt:  FPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRE

Query:  TTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNS
        +T EH  TQE AIPSQT+FGE+LLHD+ S +MQH  FL  ETSHLPRQ +QDPH +   +R S NGTI+N LRGL VE++ TVDRNLN  L  E A VNS
Subjt:  TTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNS

Query:  LSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRCSHIEL
        L NCEQIH Y+N +NT SDN  LP  V +EK+YCAARE LQP+I+ HLKNLSRD+DLGQSTA+DIATKA  TIL ACG++HSI  A  + PPS+CSHIEL
Subjt:  LSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPPSRCSHIEL

Query:  SMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD-AWLNLTL
        ++GEGQRSLIKGLC  CFDSYVRDVVKKITDD +WLNL L
Subjt:  SMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD-AWLNLTL

A0A6J1C2V6 uncharacterized protein LOC111007954 isoform X20.0e+0080.26Show/hide
Query:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK
        MVRGGKVGS+KNFKKKFRPKDKGSDDSDEDYVVSNDENG+SECSDE YC YLDENASGEDNF     VEEQQP KVRK  G KARNAF SHKARKKNGRK
Subjt:  MVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRK

Query:  RQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKV
        RQRFSY+EED  DGDEDDEDYSVDDDNDYEEEEEEEEE          DVDVDEEV  EDEDFM+EEEDFSD+EEPLVRKR TNMKRGR GVRKNTIGKV
Subjt:  RQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKV

Query:  SKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIR
         KKRKPKAAKKPSRNK+R+K+G RT+RNSDD DF DNY   VKNTRRKK VSKRK  V Q+DLDTFLSGSSDYEYTISEEEREQVREAE+LCGHLRNR R
Subjt:  SKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIR

Query:  TVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGI
        ++PSPP+IE+SD RQ RK  PP+RKGKEK+EEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQ+ISKPARSTAGI
Subjt:  TVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGI

Query:  DLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTT
        DLREVVIQVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYC DC TIA GSSSPQPPSRLSDRRTT
Subjt:  DLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTT

Query:  NNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSP
        NNLFNR+FPVA+RDGLDLN LSSPRTP IQGFGN  SPRLP EVQSTSPMSQAVAPTL+ RR +RL INH+RSSNQMGLVN+R DGVSAA+PCGGGT++ 
Subjt:  NNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSP

Query:  QMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNLNSALW
        Q  QGRE TAEHT  QET IPSQT+FGES LH NPSPLMQH  FL SETSHLP Q ++DPHL++P ER    GT LN L  L VE++TT  D+ +N A W
Subjt:  QMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNLNSALW

Query:  HEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPP
             VN LSNCEQIH +++VLNT SDN  LPP + +EKDY AARE LQP+I+ HLK+LSRDVDLGQSTA DIA KA+ TILAACGF+H I EA    PP
Subjt:  HEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC-TLPP

Query:  SRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL
        S+CSHIELSMGEG RSLI+GLCS CFDSYVRD+VKKITDD    +WL+LTL
Subjt:  SRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL

A0A6J1C3M4 uncharacterized protein LOC111007954 isoform X10.0e+0080.37Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK
        MCEEQMVRGGKVGS+KNFKKKFRPKDKGSDDSDEDYVVSNDENG+SECSDE YC YLDENASGEDNF     VEEQQP KVRK  G KARNAF SHKARK
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARK

Query:  KNGRKRQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKN
        KNGRKRQRFSY+EED  DGDEDDEDYSVDDDNDYEEEEEEEEE          DVDVDEEV  EDEDFM+EEEDFSD+EEPLVRKR TNMKRGR GVRKN
Subjt:  KNGRKRQRFSYQEED--DGDEDDEDYSVDDDNDYEEEEEEEEE----------DVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKN

Query:  TIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL
        TIGKV KKRKPKAAKKPSRNK+R+K+G RT+RNSDD DF DNY   VKNTRRKK VSKRK  V Q+DLDTFLSGSSDYEYTISEEEREQVREAE+LCGHL
Subjt:  TIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNY-RVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHL

Query:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR
        RNR R++PSPP+IE+SD RQ RK  PP+RKGKEK+EEVK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQ+ISKPAR
Subjt:  RNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPAR

Query:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLS
        STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVIC ECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYC DC TIA GSSSPQPPSRLS
Subjt:  STAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLS

Query:  DRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGG
        DRRTTNNLFNR+FPVA+RDGLDLN LSSPRTP IQGFGN  SPRLP EVQSTSPMSQAVAPTL+ RR +RL INH+RSSNQMGLVN+R DGVSAA+PCGG
Subjt:  DRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGG

Query:  GTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNL
        GT++ Q  QGRE TAEHT  QET IPSQT+FGES LH NPSPLMQH  FL SETSHLP Q ++DPHL++P ER    GT LN L  L VE++TT  D+ +
Subjt:  GTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTV-DRNL

Query:  NSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC
        N A W     VN LSNCEQIH +++VLNT SDN  LPP + +EKDY AARE LQP+I+ HLK+LSRDVDLGQSTA DIA KA+ TILAACGF+H I EA 
Subjt:  NSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEAC

Query:  -TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL
           PPS+CSHIELSMGEG RSLI+GLCS CFDSYVRD+VKKITDD    +WL+LTL
Subjt:  -TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDD----AWLNLTL

A0A6J1EU37 uncharacterized protein LOC111437762 isoform X10.0e+0080.3Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIV--EEEVEEQQPKKVRKGVGPKARNAFSSHKA
        MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENG+SE SDE  C YLD NASGEDNF+V  EEEVEEQQ  K RKGV PK  N   S K 
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIV--EEEVEEQQPKKVRKGVGPKARNAFSSHKA

Query:  RKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-----VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIG
        RKKNGRKR+RFSYQ  EEDDGDEDDEDYS+D+DNDYEEEEEEEEE+ + +EE     +EDEDE+FM+EEEDFSD+EEPLVRK+ATN KRGRQGVRKNT+G
Subjt:  RKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-----VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIG

Query:  KVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRI
        KV KKRKPKAAKKP+RNK+RKK+ T    + DDDDF +N  VVKNTR+K+ VSKRKN+V QSDLDTF+SGSSDYEYTISEEEREQVREAEKLCGHLRNR 
Subjt:  KVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRI

Query:  RTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG
        RTVPSP RIEES+ RQ RKPRPP+RKGKEK+E+VK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARS AG
Subjt:  RTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG

Query:  IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQ-PPSRLSDRR
        IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCT +AQG SSPQ PP+RLSDRR
Subjt:  IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQ-PPSRLSDRR

Query:  TTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS----AANPCG
        TTNNLFNR+FPVANRDGLDLNSLSSPRTP  QGFG+IPSPR+PVEVQSTSPMSQAVAPTLVRRRILRLHINH+RSSNQMGLVNNR +GVS    AA+ CG
Subjt:  TTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS----AANPCG

Query:  GGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDSTTVD
        G T+   MG            QET + SQ++FGESLLHDNPSPLMQH VFL SET+ +  QV+QDPHL+M      TS NGTI NTLR G + E+STTVD
Subjt:  GGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDSTTVD

Query:  RNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDL-GQSTANDIATKAAGTILAACGFKHS
        RNLN  LWHE A VN LSNCE IH+Y+N+LNTGSD+S  P  VG+EKDYCAAREHLQPMIE HLKN LS DVDL G+STAN+IATKAAGTILAACGFK+S
Subjt:  RNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDL-GQSTANDIATKAAGTILAACGFKHS

Query:  ITEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL
        I EAC   PPS+CSHIELS+GE QRSLIKGLCS CFDSYVRDVVKK TDDA WL+LTL
Subjt:  ITEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL

A0A6J1EZ47 uncharacterized protein LOC111437762 isoform X20.0e+0080.4Show/hide
Query:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIV--EEEVEEQQPKKVRKGVGPKARNAFSSHKA
        MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENG+SE SDE  C YLD NASGEDNF+V  EEEVEEQQ  K RKGV PK  N   S K 
Subjt:  MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIV--EEEVEEQQPKKVRKGVGPKARNAFSSHKA

Query:  RKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-----VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIG
        RKKNGRKR+RFSYQ  EEDDGDEDDEDYS+D+DNDYEEEEEEEEE+ + +EE     +EDEDE+FM+EEEDFSD+EEPLVRK+ATN KRGRQGVRKNT+G
Subjt:  RKKNGRKRQRFSYQ--EEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEE-----VEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIG

Query:  KVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRI
        KV KKRKPKAAKKP+RNK+RKK+ T    + DDDDF +N  VVKNTR+K+ VSKRKN+V QSDLDTF+SGSSDYEYTISEEEREQVREAEKLCGHLRNR 
Subjt:  KVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRI

Query:  RTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG
        RTVPSP RIEES+ RQ RKPRPP+RKGKEK+E+VK EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARS AG
Subjt:  RTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG

Query:  IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQ-PPSRLSDRR
        IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCT +AQG SSPQ PP+RLSDRR
Subjt:  IDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQ-PPSRLSDRR

Query:  TTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS----AANPCG
        TTNNLFNR+FPVANRDGLDLNSLSSPRTP  QGFG+IPSPR+PVEVQSTSPMSQAVAPTLVRRRILRLHINH+RSSNQMGLVNNR +GVS    AA+ CG
Subjt:  TTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVS----AANPCG

Query:  GGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDSTTVD
        G T+   MG            QET + SQ++FGESLLHDNPSPLMQH VFL SET+ +  QV+QDPHL+M      TS NGTI NTLR G + E+STTVD
Subjt:  GGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPM--ERTSGNGTILNTLR-GLEVEDSTTVD

Query:  RNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDLGQSTANDIATKAAGTILAACGFKHSI
        RNLN  LWHE A VN LSNCE IH+Y+N+LNTGSD+S  P  VG+EKDYCAAREHLQPMIE HLKN LS DVDLG+STAN+IATKAAGTILAACGFK+SI
Subjt:  RNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPP-VGEEKDYCAAREHLQPMIEHHLKN-LSRDVDLGQSTANDIATKAAGTILAACGFKHSI

Query:  TEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL
         EAC   PPS+CSHIELS+GE QRSLIKGLCS CFDSYVRDVVKK TDDA WL+LTL
Subjt:  TEAC-TLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDA-WLNLTL

SwissProt top hitse value%identityAlignment
A6H619 PHD and RING finger domain-containing protein 17.2e-1230.94Show/hide
Query:  CGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEG
        C ICL+    + V GT   C+H+FC  CI+EW++  + CP+ +  F+ I   A+        +++ ++P  +     +EEE     DP     C  C   
Subjt:  CGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEG

Query:  GDDNLMLLCDLCDSPAHTYCVGLG-REVPEGNWYCVDCT
          ++ +LLCD CD+  H  C+    +EVP   W+C +CT
Subjt:  GDDNLMLLCDLCDSPAHTYCVGLG-REVPEGNWYCVDCT

Arabidopsis top hitse value%identityAlignment
AT3G01460.1 methyl-CPG-binding domain 92.9e-0835.21Show/hide
Query:  PYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGR-EVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRR
        P++  +C  C    DD+ +LLCD CD+  HTYC+      +P+GNWYC  C    + +       +L  RR
Subjt:  PYENVICTECHEGGDDNLMLLCDLCDSPAHTYCVGLGR-EVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRR

AT3G05670.1 RING/U-box protein1.1e-12738.18Show/hide
Query:  KNFKKKFRPKDKGSDDSDEDYVVSNDENGISECS-DEGYCYYLDENASGE----DNFIVEEEVEEQQPKKV--------RKGVGPKARNAFSSHKARKKN
        +N  +K R KDKGSD+SDEDYV+S+++   SE    E Y   +D   S +     + + +EE++E + + V        +   GP+     +  K+RK N
Subjt:  KNFKKKFRPKDKGSDDSDEDYVVSNDENGISECS-DEGYCYYLDENASGE----DNFIVEEEVEEQQPKKV--------RKGVGPKARNAFSSHKARKKN

Query:  ---------------------------------------------GRKRQRFSYQEEDDGDEDDEDYSVDDDNDYE------------------------
                                                     G++R+ F   E++DGD  +ED   +++ D E                        
Subjt:  ---------------------------------------------GRKRQRFSYQEEDDGDEDDEDYSVDDDNDYE------------------------

Query:  -EEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEP----LVRKRATNMKRGRQGVRKNTIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFL
         E E+E++ D + ++E +D DEDF  +E+   D+EE        K A  + +G +  R++  G+  +K+   A  + +R ++R+   T+   + DDDDF+
Subjt:  -EEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEP----LVRKRATNMKRGRQGVRKNTIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLRNSDDDDFL

Query:  DNYRVVKNTRRKK--HVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVK
        D+    +   + K      R+     SD D   SG SDYEYTISEEEREQ+REA  L     N   ++         DL Q RK   P++KG++KVE VK
Subjt:  DNYRVVKNTRRKK--HVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVK

Query:  VEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVI
         +VIK VCGICLSEED RR++GTL+CCSH+FCF+CIMEW+KVESRCPLCKQRF+TISKPARST G+DLREVVI VPERDQVYQP+EEELRSYLDPYEN+I
Subjt:  VEVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVI

Query:  CTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRTFP-VANRDGLDLNSLSSPRTPNIQGFG
        CTECH+G DD LMLLCDLCDS AHTYCVGLGREVPEGNWYC  C  +A GS+S Q    +S+++  +  ++R  P V +    D++ + SPRTP   G  
Subjt:  CTECHEGGDDNLMLLCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRTFP-VANRDGLDLNSLSSPRTPNIQGFG

Query:  NIPSPRLP-VEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRETTAEHTI-TQETAIPSQTMFGESLL
        N+ SPR+P  + Q +SP S   A TL RRR L  HI ++ + +++  +  R  G S+      G ++ Q+G GR       + +QET I S     E  L
Subjt:  NIPSPRLP-VEVQSTSPMSQAVAPTLVRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRETTAEHTI-TQETAIPSQTMFGESLL

Query:  HDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLP
         +N S +  H                 DP L  P     G+      L          +D   +  L     +V   SN +++H  +++ NT S      
Subjt:  HDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMERTSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLP

Query:  PVGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEACTLPPS-RCSHIELSMGEGQ-----RSLIKGLCSTCFDS
                  A E LQ  ++ HLKNLS   DLGQ+T ++I+T +  TILAACG +H  +E   +PP   C+H  ++ G         SL+KG C +CFDS
Subjt:  PVGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGTILAACGFKHSITEACTLPPS-RCSHIELSMGEGQ-----RSLIKGLCSTCFDS

Query:  YVRDVVKKITDDA---WLNLTL
        +V DVVK I D     WL+L L
Subjt:  YVRDVVKKITDDA---WLNLTL

AT4G10940.1 RING/U-box protein5.1e-1331.18Show/hide
Query:  EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG---------IDLREVVIQVPERDQVYQPSEEELRSY
        E+  + CGIC+   D    RG L+CC H+FCF CI  W+ + + CPLC++ FQ I+      +G         +   E      E D V  PS     ++
Subjt:  EVIKQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAG---------IDLREVVIQVPERDQVYQPSEEELRSY

Query:  LDPYENVIC----------TECHEGGDDNL--MLLCDLCDSPAHTYCVGLGRE-VPEGNWYCVDCTTIAQ
              V+C          T  +  GD NL   + CD CD   H  CVG   E   E  W C    T+ Q
Subjt:  LDPYENVIC----------TECHEGGDDNL--MLLCDLCDSPAHTYCVGLGRE-VPEGNWYCVDCTTIAQ

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 51.1e-0742.62Show/hide
Query:  EEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCV-GLGREVPEGNWYCVDCT
        E+E     D Y NV C +C  G  D+ +LLCD CD   H  C+  +   VP G W CVDC+
Subjt:  EEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCV-GLGREVPEGNWYCVDCT

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 53.8e-0838.75Show/hide
Query:  EEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCV-GLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRR
        E+E     D Y NV C +C  G  D+ +LLCD CD   H  C+  +   VP G W CVDC       S  +P  RLS ++
Subjt:  EEELRSYLDPYENVICTECHEGGDDNLMLLCDLCDSPAHTYCV-GLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGAGGAACAGATGGTAAGGGGAGGGAAGGTTGGTAGTAGGAAAAATTTCAAGAAGAAATTTCGACCAAAAGATAAGGGTTCGGATGATTCCGATGAGGATTATGT
TGTTTCCAATGATGAAAATGGAATTTCTGAGTGTTCTGATGAGGGTTATTGCTACTATTTAGATGAAAATGCATCAGGCGAGGATAATTTTATCGTGGAGGAGGAGGTGG
AGGAGCAGCAACCGAAGAAGGTGAGGAAAGGAGTCGGACCGAAGGCAAGAAATGCTTTTAGTTCGCATAAAGCGAGGAAGAAAAATGGACGAAAGAGGCAAAGATTTAGT
TATCAAGAAGAAGATGACGGGGACGAGGATGATGAGGATTATAGTGTAGATGATGATAATGATTATGAGGAGGAGGAAGAAGAGGAAGAGGAAGATGTTGATGTTGATGA
GGAGGTTGAGGATGAGGATGAGGACTTTATGCTGGAAGAAGAAGATTTTTCGGATGACGAAGAACCGCTTGTTAGAAAGAGAGCAACCAACATGAAACGGGGTCGACAAG
GGGTGAGAAAGAATACTATTGGCAAAGTTTCCAAAAAGAGGAAACCAAAGGCTGCAAAGAAGCCTTCAAGGAATAAACAGAGAAAGAAAACTGGCACACGGACTCTAAGA
AACTCAGATGATGATGATTTTTTGGACAATTACCGTGTAGTAAAGAATACGAGAAGGAAAAAGCATGTTTCAAAGCGGAAAAATTATGTTGCACAATCAGATTTAGATAC
CTTCCTTTCTGGGTCATCAGATTATGAATATACCATCTCTGAAGAAGAGAGAGAGCAGGTGAGAGAAGCTGAGAAGTTGTGTGGTCATTTGAGAAATCGAATAAGGACTG
TACCTTCACCCCCAAGGATTGAGGAGTCCGATCTACGCCAGCAGAGAAAACCTCGACCTCCTATTAGGAAAGGCAAAGAGAAAGTGGAGGAAGTGAAGGTTGAGGTGATA
AAGCAAGTATGCGGGATATGCCTATCGGAGGAAGATAAGAGGAGAGTTCGGGGAACGCTCAACTGTTGTTCTCACTTTTTTTGTTTCTCTTGTATTATGGAGTGGGCAAA
GGTTGAATCACGATGCCCTTTGTGCAAGCAGAGGTTTCAGACAATTAGTAAGCCTGCAAGATCTACTGCTGGAATTGACTTGAGAGAAGTCGTCATTCAGGTTCCCGAGC
GTGATCAGGTCTATCAACCATCCGAAGAAGAATTGAGGAGTTATCTCGATCCCTATGAAAATGTAATCTGTACTGAGTGTCACGAAGGTGGGGATGATAATCTAATGTTA
CTGTGTGATCTATGTGACTCACCGGCACACACCTATTGTGTCGGTCTTGGTAGAGAAGTACCAGAAGGGAATTGGTACTGTGTTGATTGTACAACCATTGCTCAGGGATC
ATCAAGTCCACAGCCTCCAAGCCGACTGTCTGATCGAAGAACAACCAACAATTTGTTTAACAGAACTTTCCCTGTTGCAAATCGGGATGGTTTAGACTTGAATTCTCTAT
CTTCACCTCGTACGCCTAATATTCAAGGATTTGGAAACATCCCGTCTCCTCGATTACCTGTGGAAGTCCAATCAACTTCTCCCATGTCTCAAGCAGTTGCGCCAACTCTT
GTAAGAAGACGAATTCTTCGGCTGCACATCAACCACCTGCGCTCTTCAAATCAAATGGGATTAGTGAACAACAGGGCTGATGGTGTTTCTGCTGCTAATCCTTGTGGTGG
AGGTACCTTAAGTCCGCAGATGGGTCAAGGTAGGGAAACAACAGCAGAACATACGATAACTCAAGAAACAGCAATACCGAGCCAAACTATGTTTGGGGAGAGCTTATTAC
ATGACAATCCCTCTCCATTGATGCAACATGTAGTTTTTCTCGGCTCCGAAACTAGTCATTTGCCTAGGCAAGTAATGCAAGATCCTCACCTCACCATGCCAATGGAACGA
ACCAGCGGCAATGGAACAATTTTGAACACATTGAGAGGATTAGAAGTCGAAGATTCGACAACTGTTGATCGAAATTTGAATAGTGCATTGTGGCATGAGCATGCCGTGGT
GAATTCACTATCAAACTGTGAGCAGATTCATCAATACAACAATGTATTAAACACAGGCTCTGATAACAGTTTACTCCCTCCGGTCGGAGAAGAGAAAGATTATTGTGCAG
CGAGGGAGCACCTACAACCCATGATCGAACACCATTTGAAGAATTTATCCAGAGATGTTGATCTAGGTCAAAGCACAGCAAATGACATTGCAACAAAAGCTGCTGGCACG
ATTTTAGCCGCATGTGGGTTCAAACACTCAATAACCGAGGCTTGTACATTGCCACCATCTCGATGTTCCCACATTGAATTGTCAATGGGGGAAGGGCAAAGAAGCCTGAT
AAAAGGCCTCTGCTCGACTTGTTTTGACTCGTACGTTCGGGACGTAGTGAAGAAGATTACAGACGATGCATGGTTGAATCTCACCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCGAGGAACAGATGGTAAGGGGAGGGAAGGTTGGTAGTAGGAAAAATTTCAAGAAGAAATTTCGACCAAAAGATAAGGGTTCGGATGATTCCGATGAGGATTATGT
TGTTTCCAATGATGAAAATGGAATTTCTGAGTGTTCTGATGAGGGTTATTGCTACTATTTAGATGAAAATGCATCAGGCGAGGATAATTTTATCGTGGAGGAGGAGGTGG
AGGAGCAGCAACCGAAGAAGGTGAGGAAAGGAGTCGGACCGAAGGCAAGAAATGCTTTTAGTTCGCATAAAGCGAGGAAGAAAAATGGACGAAAGAGGCAAAGATTTAGT
TATCAAGAAGAAGATGACGGGGACGAGGATGATGAGGATTATAGTGTAGATGATGATAATGATTATGAGGAGGAGGAAGAAGAGGAAGAGGAAGATGTTGATGTTGATGA
GGAGGTTGAGGATGAGGATGAGGACTTTATGCTGGAAGAAGAAGATTTTTCGGATGACGAAGAACCGCTTGTTAGAAAGAGAGCAACCAACATGAAACGGGGTCGACAAG
GGGTGAGAAAGAATACTATTGGCAAAGTTTCCAAAAAGAGGAAACCAAAGGCTGCAAAGAAGCCTTCAAGGAATAAACAGAGAAAGAAAACTGGCACACGGACTCTAAGA
AACTCAGATGATGATGATTTTTTGGACAATTACCGTGTAGTAAAGAATACGAGAAGGAAAAAGCATGTTTCAAAGCGGAAAAATTATGTTGCACAATCAGATTTAGATAC
CTTCCTTTCTGGGTCATCAGATTATGAATATACCATCTCTGAAGAAGAGAGAGAGCAGGTGAGAGAAGCTGAGAAGTTGTGTGGTCATTTGAGAAATCGAATAAGGACTG
TACCTTCACCCCCAAGGATTGAGGAGTCCGATCTACGCCAGCAGAGAAAACCTCGACCTCCTATTAGGAAAGGCAAAGAGAAAGTGGAGGAAGTGAAGGTTGAGGTGATA
AAGCAAGTATGCGGGATATGCCTATCGGAGGAAGATAAGAGGAGAGTTCGGGGAACGCTCAACTGTTGTTCTCACTTTTTTTGTTTCTCTTGTATTATGGAGTGGGCAAA
GGTTGAATCACGATGCCCTTTGTGCAAGCAGAGGTTTCAGACAATTAGTAAGCCTGCAAGATCTACTGCTGGAATTGACTTGAGAGAAGTCGTCATTCAGGTTCCCGAGC
GTGATCAGGTCTATCAACCATCCGAAGAAGAATTGAGGAGTTATCTCGATCCCTATGAAAATGTAATCTGTACTGAGTGTCACGAAGGTGGGGATGATAATCTAATGTTA
CTGTGTGATCTATGTGACTCACCGGCACACACCTATTGTGTCGGTCTTGGTAGAGAAGTACCAGAAGGGAATTGGTACTGTGTTGATTGTACAACCATTGCTCAGGGATC
ATCAAGTCCACAGCCTCCAAGCCGACTGTCTGATCGAAGAACAACCAACAATTTGTTTAACAGAACTTTCCCTGTTGCAAATCGGGATGGTTTAGACTTGAATTCTCTAT
CTTCACCTCGTACGCCTAATATTCAAGGATTTGGAAACATCCCGTCTCCTCGATTACCTGTGGAAGTCCAATCAACTTCTCCCATGTCTCAAGCAGTTGCGCCAACTCTT
GTAAGAAGACGAATTCTTCGGCTGCACATCAACCACCTGCGCTCTTCAAATCAAATGGGATTAGTGAACAACAGGGCTGATGGTGTTTCTGCTGCTAATCCTTGTGGTGG
AGGTACCTTAAGTCCGCAGATGGGTCAAGGTAGGGAAACAACAGCAGAACATACGATAACTCAAGAAACAGCAATACCGAGCCAAACTATGTTTGGGGAGAGCTTATTAC
ATGACAATCCCTCTCCATTGATGCAACATGTAGTTTTTCTCGGCTCCGAAACTAGTCATTTGCCTAGGCAAGTAATGCAAGATCCTCACCTCACCATGCCAATGGAACGA
ACCAGCGGCAATGGAACAATTTTGAACACATTGAGAGGATTAGAAGTCGAAGATTCGACAACTGTTGATCGAAATTTGAATAGTGCATTGTGGCATGAGCATGCCGTGGT
GAATTCACTATCAAACTGTGAGCAGATTCATCAATACAACAATGTATTAAACACAGGCTCTGATAACAGTTTACTCCCTCCGGTCGGAGAAGAGAAAGATTATTGTGCAG
CGAGGGAGCACCTACAACCCATGATCGAACACCATTTGAAGAATTTATCCAGAGATGTTGATCTAGGTCAAAGCACAGCAAATGACATTGCAACAAAAGCTGCTGGCACG
ATTTTAGCCGCATGTGGGTTCAAACACTCAATAACCGAGGCTTGTACATTGCCACCATCTCGATGTTCCCACATTGAATTGTCAATGGGGGAAGGGCAAAGAAGCCTGAT
AAAAGGCCTCTGCTCGACTTGTTTTGACTCGTACGTTCGGGACGTAGTGAAGAAGATTACAGACGATGCATGGTTGAATCTCACCCTATAG
Protein sequenceShow/hide protein sequence
MCEEQMVRGGKVGSRKNFKKKFRPKDKGSDDSDEDYVVSNDENGISECSDEGYCYYLDENASGEDNFIVEEEVEEQQPKKVRKGVGPKARNAFSSHKARKKNGRKRQRFS
YQEEDDGDEDDEDYSVDDDNDYEEEEEEEEEDVDVDEEVEDEDEDFMLEEEDFSDDEEPLVRKRATNMKRGRQGVRKNTIGKVSKKRKPKAAKKPSRNKQRKKTGTRTLR
NSDDDDFLDNYRVVKNTRRKKHVSKRKNYVAQSDLDTFLSGSSDYEYTISEEEREQVREAEKLCGHLRNRIRTVPSPPRIEESDLRQQRKPRPPIRKGKEKVEEVKVEVI
KQVCGICLSEEDKRRVRGTLNCCSHFFCFSCIMEWAKVESRCPLCKQRFQTISKPARSTAGIDLREVVIQVPERDQVYQPSEEELRSYLDPYENVICTECHEGGDDNLML
LCDLCDSPAHTYCVGLGREVPEGNWYCVDCTTIAQGSSSPQPPSRLSDRRTTNNLFNRTFPVANRDGLDLNSLSSPRTPNIQGFGNIPSPRLPVEVQSTSPMSQAVAPTL
VRRRILRLHINHLRSSNQMGLVNNRADGVSAANPCGGGTLSPQMGQGRETTAEHTITQETAIPSQTMFGESLLHDNPSPLMQHVVFLGSETSHLPRQVMQDPHLTMPMER
TSGNGTILNTLRGLEVEDSTTVDRNLNSALWHEHAVVNSLSNCEQIHQYNNVLNTGSDNSLLPPVGEEKDYCAAREHLQPMIEHHLKNLSRDVDLGQSTANDIATKAAGT
ILAACGFKHSITEACTLPPSRCSHIELSMGEGQRSLIKGLCSTCFDSYVRDVVKKITDDAWLNLTL