; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0653 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0653
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAuxilin-related protein 2 isoform X2
Genome locationMC03:13510530..13513785
RNA-Seq ExpressionMC03g0653
SyntenyMC03g0653
Gene Ontology termsNA
InterPro domainsIPR001623 - DnaJ domain
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012546.1 Auxilin-related protein 1 [Cucurbita argyrosperma subsp. argyrosperma]2.29e-25579.92Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDR
        M+Q WR+R GIPRFRSRR E+ET          TF  ++FSDVFGGP RT++ RQFSD      S+SFYEE+FRSTEF S+P+K GRSLPAFRIPVKEDR
Subjt:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFR
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS NHR SNV  QWNSY +MFKEQE+PQFPPDLSAHIDN YVED+YDD  R
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFR

Query:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDD--EDAMSSYVIEINSINREEYREEVSIDDAIA
        SS HGF + +SS ETV + PNS+RSIKI V+DLELNSPSSA SSLC+DPVYYG IH NVLPEDDDDDD  EDAMSSYVIEINSINREEYREEVSID+AIA
Subjt:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDD--EDAMSSYVIEINSINREEYREEVSIDDAIA

Query:  WAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLN--GNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSP SETDLS R QESEQSGEEEGRPV FEFADQQL+  G+LQT ETQ+RDVKV+EG P VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLN--GNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        ILWSSSGWS ISL NLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+ YISQDVFLN
Subjt:  ILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_008444679.1 PREDICTED: uncharacterized protein LOC103487941 [Cucumis melo]6.99e-25980.21Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGP +T++ RQFS+       ++SFY+EVFRS++  SRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN+YVEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPEDD DD ED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_022139887.1 uncharacterized protein LOC111010691 [Momordica charantia]0.0100Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
        MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD

Query:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM
        GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM
Subjt:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM

Query:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
        SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
Subjt:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE

Query:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
        TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
Subjt:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA

Query:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
Subjt:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_023541267.1 uncharacterized protein LOC111801489 [Cucurbita pepo subsp. pepo]1.55e-25580.08Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDR
        M+Q WR+R GIPRFRSRR E+ET          TF  ++FSDVFGGP RT++ RQFSD      S+SFYEE+FRSTEF S+P+K GRSLPAFRIPVKEDR
Subjt:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFR
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS NHR SNV  QWNSY TMFKEQE+PQFPPDLSA IDN YVEDEYDD  R
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFR

Query:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDD-EDAMSSYVIEINSINREEYREEVSIDDAIAW
        SS HGF + +SS ETV + PNS+RSIKI V+DLELNSPSSA SSLCEDPVYYG IH NVLPEDDDDDD ED MSSYVIEINSINREEYREEVSID+AIAW
Subjt:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDD-EDAMSSYVIEINSINREEYREEVSIDDAIAW

Query:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLN--GNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSP SETDLS R QESEQSGEEEGRPV FEFADQQL+  G+LQT ETQ+RDVKV+EG P VDI+RE+EGLDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLN--GNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        LWSSSGWS ISL NLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+ YIS+DVFLN
Subjt:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_038895184.1 uncharacterized protein LOC120083484 [Benincasa hispida]2.58e-26981.25Show/hide
Query:  SSTESHISSFFQFLHQILA---RIMDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRS
        S + S   S   FLH   +   R MD +WR+RFGIPRFRSRRSEDETL  KPT     TFH DDFSDVFGGP +T++ RQFSD      S+SFYEEVFRS
Subjt:  SSTESHISSFFQFLHQILA---RIMDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRS

Query:  TEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEV
        TE  SRP+K GRSLPAFRIPVKEDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD SPLRP++ DDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+
Subjt:  TEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEV

Query:  PQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSY
        PQF PDL+ HIDN+YVEDEY+D +RSSDHGFG+PMSSPETV LEPNS+RSI+I VDDLELNSPSSA SSLCEDPVY G I+CNVLPEDDDDD EDAMSSY
Subjt:  PQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSY

Query:  VIEINSINREEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLD
        VIEI SINREEYREEVSID+AIAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE++DQQ NGN   QTAETQQRDVKVEE  PQVD DRELEGLD
Subjt:  VIEINSINREEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLD

Query:  EKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        EKIKLWSAGKETNIRLLLSTLHYILWSSSGWS ISL NLIGGSQVKKAYQKARLCLHPDKLQQRGAT LQKYVA+KAFTILQ+AW+ YISQD FLN
Subjt:  EKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

TrEMBL top hitse value%identityAlignment
A0A0A0M107 Uncharacterized protein1.33e-25780.64Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTT-FHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFY
        MD TWR+RFGIPRFRSRRSE +TL PKPT+ F  DDFSDVFGGP +T++ RQFS+       ++SFYEEVFRS+E  SRP+K GRSLPAFRIPVKEDRFY
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTT-FHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFY

Query:  RDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSS
        RDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD + LRP++GDDVAFPSSSSNHR +NV  QWNSY TMFKEQE+PQF P LS H+DN+YVEDEYDD ++SS
Subjt:  RDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSS

Query:  DHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAK
        DHGFG P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYY   +CNVLPEDDDDD EDAMSSYVIEI SINREEYREEVSID+AIAWAK
Subjt:  DHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAK

Query:  SKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++IDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        SSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGAT LQK+VA+KAFTILQ+AW+ YISQD F+N
Subjt:  SSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A1S3BAV2 uncharacterized protein LOC1034879413.38e-25980.21Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGP +T++ RQFS+       ++SFY+EVFRS++  SRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN+YVEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPEDD DD ED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A5A7UJY3 Auxilin-related protein 2 isoform X23.38e-25980.21Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGP +T++ RQFS+       ++SFY+EVFRS++  SRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPLRTVISRQFSD-------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN+YVEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPEDD DD ED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A6J1CF64 uncharacterized protein LOC1110106910.0100Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
        MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD

Query:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM
        GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM
Subjt:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPM

Query:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
        SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
Subjt:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE

Query:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
        TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
Subjt:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA

Query:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
Subjt:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A6J1KL61 uncharacterized protein LOC1114955612.00e-25579.03Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKED
        MD  WR+RFGIP+FRSRRSE ET+  KPT     TF  DDFSDVFGGP RT++ RQFSD      S+SFYEEVF+S E  S+P+K GRSLPAFRIP+KED
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPLRTVISRQFSD------SSSFYEEVFRSTEFDSRPRKAGRSLPAFRIPVKED

Query:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGF
        RFYR +FGS+DGR+SRDRSEP+SKEFTRSNSSS  SP RP++GDDVAFPSSSSN R SNV  +W+SYRTMFKEQE+PQFPPD   HIDN YVE+E++D +
Subjt:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQYVEDEYDDGF

Query:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW
        RSS H FG PMSSPET+ LEPNS+RSIKI VDDLE NSPSSA SS CEDPV YG I+CNVLPEDD+DD EDAMSSYVIEI SINREEYREEVSID+AIAW
Subjt:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW

Query:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKSKYQSP SETDLSGR QESEQSGEEEGRPV+FE + QQLNGN   Q AET Q+DVK+EEG P+VDID+ELEGLDEKIKLWSAGKETNIRLLLSTLHYI
Subjt:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        LWSSSGWS ISL NLIGGSQ+KKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQ+AWA Y+SQDVFLN
Subjt:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

SwissProt top hitse value%identityAlignment
O75061 Putative tyrosine-protein phosphatase auxilin2.2e-1132.45Show/hide
Query:  GRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS-SSGWSAISLANLI
        G+ +  + S   EG+  A +F D  L+G    A   ++  +      + ++ +E++    KI  W  GKE NIR LLST+H +LW+  + W  + +A+L+
Subjt:  GRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS-SSGWSAISLANLI

Query:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQ
           QVKK Y+KA L +HPDK   +      K +    F  L DAW+ + +Q
Subjt:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQ

Q0WQ57 Auxilin-related protein 26.3e-2740.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S  + +    S + +P S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

Q9C9Q4 J domain-containing protein required for chloroplast accumulation response 12.0e-2540.59Show/hide
Query:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR
        A ++ Q P+  T      +ES           E+  E        E  D+  + N    +  Q + K+EE     +   E++ +D KI+ WS+GK  NIR
Subjt:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR

Query:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
         LLSTL YILWS SGW  + L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

Q9FWS1 Auxilin-like protein 12.2e-2450.89Show/hide
Query:  AETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYV
        AE + RD+K ++   Q + +R  E LD  +K WS+GKE N+R L+STL YIL + SGW  I L +L+  + V+KAY+KA L +HPDKLQQRGA+  QKY+
Subjt:  AETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYV

Query:  AEKAFTILQDAW
         EK F +L++AW
Subjt:  AEKAFTILQDAW

Q9SU08 Auxilin-related protein 11.4e-2640.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S+ + +    S + +  S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    QV+ DR    LD +IK W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI  + VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

Arabidopsis top hitse value%identityAlignment
AT1G30280.1 Chaperone DnaJ-domain superfamily protein1.1e-6638.89Show/hide
Query:  MDQTWRMRFGI---PRFR-SRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSD----SSSFYEEVFRST-EFDS-----RPRKAGRSLPAFRIPV
        MD++WRM+ G+   P F  +R+S D  +         +DF+DVFGGP R+V++R+FS     S  FY+E+F+    F S       +  GR+LPAFRIP 
Subjt:  MDQTWRMRFGI---PRFR-SRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSD----SSSFYEEVFRST-EFDS-----RPRKAGRSLPAFRIPV

Query:  KEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHID
          + FY  VFG   G       + SS    RSNSSS LS         P     GDD  F S +S  R  NV ++  S++   K+Q     P    +   
Subjt:  KEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHID

Query:  NQYVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSA--YSSLC--EDPVYYGDIHCN-------VLPEDDDDDDEDAMSSYV
         Q    E  D +    H  G   +SPET+ L+PNS+R     +DD   +SP+S+   S +C  ED   +     N       V+ ED++D++E+ MSSYV
Subjt:  NQYVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSA--YSSLC--EDPVYYGDIHCN-------VLPEDDDDDDEDAMSSYV

Query:  IEINSINREEYREE----------VSIDDAIAWAKSKYQSP----TSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQ
        IEINS   + YREE            +D+AIAWAK + Q P    T E  +  R  E E   EEE                                   
Subjt:  IEINSINREEYREE----------VSIDDAIAWAKSKYQSP----TSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQ

Query:  VDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAAY
             E+E  DE+I++W  GKETNIRLLLSTLH++LWS+S W +I LANL  GSQVKKAYQ+ARLCLHPDKLQQRG T  +QK VA + F ILQ+AWA Y
Subjt:  VDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAAY

Query:  ISQD
        ++ +
Subjt:  ISQD

AT1G75100.1 J-domain protein required for chloroplast accumulation response 11.4e-2640.59Show/hide
Query:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR
        A ++ Q P+  T      +ES           E+  E        E  D+  + N    +  Q + K+EE     +   E++ +D KI+ WS+GK  NIR
Subjt:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR

Query:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
         LLSTL YILWS SGW  + L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

AT4G12770.1 Chaperone DnaJ-domain superfamily protein4.5e-2840.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S  + +    S + +P S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

AT4G12770.2 Chaperone DnaJ-domain superfamily protein4.5e-2844.16Show/hide
Query:  SGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLI
        SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE N+R LLSTL Y+LW   GW  +SL +LI
Subjt:  SGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLI

Query:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

AT4G12780.1 Chaperone DnaJ-domain superfamily protein1.0e-2740.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S+ + +    S + +  S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    QV+ DR    LD +IK W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI  + VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGTAAAAGAAAAAATAAAATAAGCTTTTTGGTGTATAAATATGTGAAGGCAGAGCTTCCGTTTGTTCCCACGGTATACACCACCACCTTCACTTCATCTACTGAATCTCA
TATTTCTTCATTCTTTCAGTTTCTGCATCAGATTCTTGCCCGAATAATGGACCAAACCTGGCGAATGCGCTTTGGGATTCCGCGGTTCCGTTCCCGGAGATCGGAAGACG
AAACCTTGCACCCCAAACCCACCACTTTCCACCCAGACGACTTCTCCGACGTCTTCGGCGGTCCGCTGCGGACAGTCATCTCCAGGCAATTTTCCGACTCCAGTTCCTTC
TACGAAGAAGTATTCCGATCCACGGAGTTCGATTCTCGGCCGCGGAAGGCCGGGCGGAGCTTGCCGGCGTTCAGAATCCCGGTTAAGGAGGATAGGTTTTACCGCGATGT
ATTTGGATCCGACGACGGCCGGCGGTCGAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAACTCATCCTCCGACCTCAGCCCCCTCCGGCCGCTCGTCG
GAGATGACGTGGCATTCCCTTCATCTTCTTCAAATCACAGGCAAAGCAATGTCTCAGCTCAATGGAATTCATACAGAACCATGTTCAAGGAACAAGAAGTGCCTCAATTT
CCACCCGATCTCTCTGCCCATATAGATAACCAGTATGTGGAAGATGAATACGATGATGGTTTCAGAAGCTCGGACCATGGATTCGGCGACCCAATGTCGTCACCAGAAAC
CGTTGGTCTGGAACCAAATTCATACAGAAGCATCAAAATCCCTGTGGATGATTTAGAACTCAACTCCCCGTCATCTGCTTATTCTTCACTCTGTGAAGATCCGGTTTATT
ATGGTGATATTCATTGTAATGTCTTACCAGAAGATGACGATGACGATGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAAATTCCATCAATAGAGAAGAATATAGA
GAAGAAGTTTCTATCGATGATGCAATTGCTTGGGCTAAATCGAAGTATCAAAGTCCCACGTCCGAGACAGATTTGAGCGGTAGACCACAAGAAAGTGAGCAATCTGGTGA
AGAAGAAGGAAGACCTGTTGCGTTTGAATTTGCAGATCAGCAGTTGAATGGAAATTTGCAAACAGCAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAACGCCAC
AGGTGGACATCGATAGAGAACTGGAAGGACTGGATGAAAAAATCAAGTTATGGTCAGCCGGCAAGGAGACCAATATCCGTTTGCTACTTTCCACGCTGCATTATATCTTG
TGGTCGAGTAGTGGGTGGTCTGCAATATCCTTGGCCAACCTGATAGGAGGCTCACAAGTGAAAAAGGCTTACCAAAAAGCAAGATTATGCCTCCACCCAGACAAGCTGCA
GCAAAGAGGAGCAACAATGCTGCAAAAATATGTTGCTGAGAAGGCTTTTACCATTCTTCAGGACGCATGGGCTGCATATATATCTCAAGATGTCTTCCTTAACTAG
mRNA sequenceShow/hide mRNA sequence
CCGTAAAAGAAAAAATAAAATAAGCTTTTTGGTGTATAAATATGTGAAGGCAGAGCTTCCGTTTGTTCCCACGGTATACACCACCACCTTCACTTCATCTACTGAATCTC
ATATTTCTTCATTCTTTCAGTTTCTGCATCAGATTCTTGCCCGAATAATGGACCAAACCTGGCGAATGCGCTTTGGGATTCCGCGGTTCCGTTCCCGGAGATCGGAAGAC
GAAACCTTGCACCCCAAACCCACCACTTTCCACCCAGACGACTTCTCCGACGTCTTCGGCGGTCCGCTGCGGACAGTCATCTCCAGGCAATTTTCCGACTCCAGTTCCTT
CTACGAAGAAGTATTCCGATCCACGGAGTTCGATTCTCGGCCGCGGAAGGCCGGGCGGAGCTTGCCGGCGTTCAGAATCCCGGTTAAGGAGGATAGGTTTTACCGCGATG
TATTTGGATCCGACGACGGCCGGCGGTCGAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAACTCATCCTCCGACCTCAGCCCCCTCCGGCCGCTCGTC
GGAGATGACGTGGCATTCCCTTCATCTTCTTCAAATCACAGGCAAAGCAATGTCTCAGCTCAATGGAATTCATACAGAACCATGTTCAAGGAACAAGAAGTGCCTCAATT
TCCACCCGATCTCTCTGCCCATATAGATAACCAGTATGTGGAAGATGAATACGATGATGGTTTCAGAAGCTCGGACCATGGATTCGGCGACCCAATGTCGTCACCAGAAA
CCGTTGGTCTGGAACCAAATTCATACAGAAGCATCAAAATCCCTGTGGATGATTTAGAACTCAACTCCCCGTCATCTGCTTATTCTTCACTCTGTGAAGATCCGGTTTAT
TATGGTGATATTCATTGTAATGTCTTACCAGAAGATGACGATGACGATGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAAATTCCATCAATAGAGAAGAATATAG
AGAAGAAGTTTCTATCGATGATGCAATTGCTTGGGCTAAATCGAAGTATCAAAGTCCCACGTCCGAGACAGATTTGAGCGGTAGACCACAAGAAAGTGAGCAATCTGGTG
AAGAAGAAGGAAGACCTGTTGCGTTTGAATTTGCAGATCAGCAGTTGAATGGAAATTTGCAAACAGCAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAACGCCA
CAGGTGGACATCGATAGAGAACTGGAAGGACTGGATGAAAAAATCAAGTTATGGTCAGCCGGCAAGGAGACCAATATCCGTTTGCTACTTTCCACGCTGCATTATATCTT
GTGGTCGAGTAGTGGGTGGTCTGCAATATCCTTGGCCAACCTGATAGGAGGCTCACAAGTGAAAAAGGCTTACCAAAAAGCAAGATTATGCCTCCACCCAGACAAGCTGC
AGCAAAGAGGAGCAACAATGCTGCAAAAATATGTTGCTGAGAAGGCTTTTACCATTCTTCAGGACGCATGGGCTGCATATATATCTCAAGATGTCTTCCTTAACTAGAGC
CATTTTGACTGTGAAGCTCAAGAAAGTAGCAAGTCTGGAACATTCAGAAGCTGATACCCATTTTTGCATCAAGGATTTGAGAGTGAGCCTGATTACACATATGAAGAAAG
TTGTGTATAAAACAACAATCAGGTTGACTACGGCCTGTATGTAAGATATATTTACTTATGTAGTGTATGTGCAGATTTATATTTCCTTGTGCAGCTACGTTCTAAAGATG
ATTGCAGAACTCTTGCGCTTTGCATATGTACAAACACATTTGAATAGATTTATATACATATTAATACAAGTTCAGAGTCTTCAGATGCATGTTCTTTCAATGTAATGGAA
ATGTACTTTCAAGTTCTTTGGCAAAATATCTAGTTTGATAAGGAT
Protein sequenceShow/hide protein sequence
RKRKNKISFLVYKYVKAELPFVPTVYTTTFTSSTESHISSFFQFLHQILARIMDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPLRTVISRQFSDSSSF
YEEVFRSTEFDSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQF
PPDLSAHIDNQYVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYR
EEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYIL
WSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN