; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013451 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013451
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAuxilin-related protein 2 isoform X2
Genome locationscaffold402:1651084..1653799
RNA-Seq ExpressionMS013451
SyntenyMS013451
Gene Ontology termsGO:0016192 - vesicle-mediated transport (biological process)
GO:0005622 - intracellular (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012546.1 Auxilin-related protein 1 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-20280.13Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDR
        M+Q WR+R GIPRFRSRR E+ET          TF  ++FSDVFGGPPRT++ RQFS      DS+SFYEE+FRSTEFVS+P+K GRSLPAFRIPVKEDR
Subjt:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFR
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS NHR SNV  QWNSY +MFKEQE+PQFPPDLSAHIDN +VED+YDD  R
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFR

Query:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPE--DDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIA
        SS HGF + +SS ETV + PNS+RSIKI V+DLELNSPSSA SSLC+DPVYYG IH NVLPE  DDDDDDEDAMSSYVIEINSINREEYREEVSID+AIA
Subjt:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPE--DDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIA

Query:  WAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQL--NGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSP SETDLS R QESEQSGEEEGRPV FEFADQQL  +G+LQT ETQ+RDVKV+EG P VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQL--NGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        ILWSSSGWS ISL NLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+ YISQDVFLN
Subjt:  ILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_008444679.1 PREDICTED: uncharacterized protein LOC103487941 [Cucumis melo]9.4e-20580.42Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGPP+T++ RQFS+       ++SFY+EVFRS++ VSRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN++VEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPE DD DDED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_022139887.1 uncharacterized protein LOC111010691 [Momordica charantia]1.3e-25799.35Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSDSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
        MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGP RTVISRQFSDSSSFYEEVFRSTEF SRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSDSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD

Query:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSSDHGFGDPM
        GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQ+VEDEYDDGFRSSDHGFGDPM
Subjt:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSSDHGFGDPM

Query:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
        SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
Subjt:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE

Query:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
        TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
Subjt:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA

Query:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
Subjt:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_023541267.1 uncharacterized protein LOC111801489 [Cucurbita pepo subsp. pepo]4.4e-20280.3Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDR
        M+Q WR+R GIPRFRSRR E+ET          TF  ++FSDVFGGPPRT++ RQFS      DS+SFYEE+FRSTEFVS+P+K GRSLPAFRIPVKEDR
Subjt:  MDQTWRMRFGIPRFRSRRSEDET----LHPKPTTFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFR
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS NHR SNV  QWNSY TMFKEQE+PQFPPDLSA IDN +VEDEYDD  R
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFR

Query:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPE-DDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW
        SS HGF + +SS ETV + PNS+RSIKI V+DLELNSPSSA SSLCEDPVYYG IH NVLPE DDDDDDED MSSYVIEINSINREEYREEVSID+AIAW
Subjt:  SSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPE-DDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW

Query:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQL--NGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSP SETDLS R QESEQSGEEEGRPV FEFADQQL  +G+LQT ETQ+RDVKV+EG P VDI+RE+EGLDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQL--NGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        LWSSSGWS ISL NLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+ YIS+DVFLN
Subjt:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

XP_038895184.1 uncharacterized protein LOC120083484 [Benincasa hispida]2.1e-21284.11Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKED
        MD +WR+RFGIPRFRSRRSEDETL  KPT     TFH DDFSDVFGGPP+T++ RQFS      DS+SFYEEVFRSTE VSRP+K GRSLPAFRIPVKED
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKED

Query:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGF
        RFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD SPLRP++ DDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDL+ HIDN++VEDEY+D +
Subjt:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGF

Query:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW
        RSSDHGFG+PMSSPETV LEPNS+RSI+I VDDLELNSPSSA SSLCEDPV YG I+CNVLPE DDDDDEDAMSSYVIEI SINREEYREEVSID+AIAW
Subjt:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW

Query:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKSKYQS  SETDLS R QESEQSGEEEGRPVAFE++DQQ NGN   QTAETQQRDVKVEE  PQVD DRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
Subjt:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        LWSSSGWS ISL NLIGGSQVKKAYQKARLCLHPDKLQQRGAT LQKYVA+KAFTILQ+AW+ YISQD FLN
Subjt:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

TrEMBL top hitse value%identityAlignment
A0A0A0M107 Uncharacterized protein8.6e-20480.85Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTT-FHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFY
        MD TWR+RFGIPRFRSRRSE +TL PKPT+ F  DDFSDVFGGPP+T++ RQFS+       ++SFYEEVFRS+E VSRP+K GRSLPAFRIPVKEDRFY
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTT-FHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFY

Query:  RDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSS
        RDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD + LRP++GDDVAFPSSSSNHR +NV  QWNSY TMFKEQE+PQF P LS H+DN++VEDEYDD ++SS
Subjt:  RDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSS

Query:  DHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAK
        DHGFG P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYY   +CNVLPE DDDDDEDAMSSYVIEI SINREEYREEVSID+AIAWAK
Subjt:  DHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAK

Query:  SKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++IDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        SSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGAT LQK+VA+KAFTILQ+AW+ YISQD F+N
Subjt:  SSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A1S3BAV2 uncharacterized protein LOC1034879414.6e-20580.42Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGPP+T++ RQFS+       ++SFY+EVFRS++ VSRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN++VEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPE DD DDED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A5A7UJY3 Auxilin-related protein 2 isoform X24.6e-20580.42Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK
        MD TWR+RFGIPRFRSRRSE ++L PKPT      TF  DDFSDVFGGPP+T++ RQFS+       ++SFY+EVFRS++ VSRP+KAGRSLPAFRIPVK
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT------TFHPDDFSDVFGGPPRTVISRQFSD-------SSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVK

Query:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD
        EDRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL P++GDDVAFPSSSSNHR SNV  QWNSYRTMFKEQE+PQF PDLS H DN++VEDEYDD
Subjt:  EDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDD

Query:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA
         ++SSDHGFG+P+SSPETV LEPNS+RSIKI VDD LE+NSPSS  SSLCEDPVYYG  +CNVLPE DD DDED MSSYVIEI SINREEYREEVSID+A
Subjt:  GFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDD-LELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDA

Query:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL
        IAWAKSKYQS  SETDLS R QESEQSGEEEGRPVAFE +DQQ NGN   QTAETQQR+VKVEE  PQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTL
Subjt:  IAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTL

Query:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        HYILWSSSGWS ISL NLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFTILQ+AW+ YISQD F+N
Subjt:  HYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A6J1CF64 uncharacterized protein LOC1110106916.1e-25899.35Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSDSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
        MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGP RTVISRQFSDSSSFYEEVFRSTEF SRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSDSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDD

Query:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSSDHGFGDPM
        GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQ+VEDEYDDGFRSSDHGFGDPM
Subjt:  GRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSSDHGFGDPM

Query:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
        SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE
Subjt:  SSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSE

Query:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
        TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA
Subjt:  TDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLA

Query:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
Subjt:  NLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

A0A6J1KL61 uncharacterized protein LOC1114955613.6e-20279.24Show/hide
Query:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKED
        MD  WR+RFGIP+FRSRRSE ET+  KPT     TF  DDFSDVFGGPPRT++ RQFS      DS+SFYEEVF+S E VS+P+K GRSLPAFRIP+KED
Subjt:  MDQTWRMRFGIPRFRSRRSEDETLHPKPT-----TFHPDDFSDVFGGPPRTVISRQFS------DSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKED

Query:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGF
        RFYR +FGS+DGR+SRDRSEP+SKEFTRSNSSS  SP RP++GDDVAFPSSSSN R SNV  +W+SYRTMFKEQE+PQFPPD   HIDN +VE+E++D +
Subjt:  RFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGF

Query:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW
        RSS H FG PMSSPET+ LEPNS+RSIKI VDDLE NSPSSA SS CEDPV YG I+CNVLPE DD+DDEDAMSSYVIEI SINREEYREEVSID+AIAW
Subjt:  RSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAW

Query:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKSKYQSP SETDLSGR QESEQSGEEEGRPV+FE + QQLNGN   Q AET Q+DVK+EEG P+VDID+ELEGLDEKIKLWSAGKETNIRLLLSTLHYI
Subjt:  AKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGN--LQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN
        LWSSSGWS ISL NLIGGSQ+KKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQ+AWA Y+SQDVFLN
Subjt:  LWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVFLN

SwissProt top hitse value%identityAlignment
O75061 Putative tyrosine-protein phosphatase auxilin2.0e-1132.45Show/hide
Query:  GRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS-SSGWSAISLANLI
        G+ +  + S   EG+  A +F D  L+G    A   ++  +      + ++ +E++    KI  W  GKE NIR LLST+H +LW+  + W  + +A+L+
Subjt:  GRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS-SSGWSAISLANLI

Query:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQ
           QVKK Y+KA L +HPDK   +      K +    F  L DAW+ + +Q
Subjt:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQ

Q0WQ57 Auxilin-related protein 25.7e-2740.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S  + +    S + +P S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

Q9C9Q4 J domain-containing protein required for chloroplast accumulation response 11.8e-2540.59Show/hide
Query:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR
        A ++ Q P+  T      +ES           E+  E        E  D+  + N    +  Q + K+EE     +   E++ +D KI+ WS+GK  NIR
Subjt:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR

Query:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
         LLSTL YILWS SGW  + L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

Q9FWS1 Auxilin-like protein 12.0e-2450.89Show/hide
Query:  AETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYV
        AE + RD+K ++   Q + +R  E LD  +K WS+GKE N+R L+STL YIL + SGW  I L +L+  + V+KAY+KA L +HPDKLQQRGA+  QKY+
Subjt:  AETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYV

Query:  AEKAFTILQDAW
         EK F +L++AW
Subjt:  AEKAFTILQDAW

Q9SU08 Auxilin-related protein 11.3e-2640.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S+ + +    S + +  S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    QV+ DR    LD +IK W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI  + VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

Arabidopsis top hitse value%identityAlignment
AT1G30280.1 Chaperone DnaJ-domain superfamily protein1.0e-6838.89Show/hide
Query:  MDQTWRMRFGI---PRFR-SRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSD----SSSFYEEVFR------STEFVSRPRKAGRSLPAFRIPV
        MD++WRM+ G+   P F  +R+S D  +         +DF+DVFGGPPR+V++R+FS     S  FY+E+F+      S   ++  +  GR+LPAFRIP 
Subjt:  MDQTWRMRFGI---PRFR-SRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSD----SSSFYEEVFR------STEFVSRPRKAGRSLPAFRIPV

Query:  KEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHID
          + FY  VFG   G       + SS    RSNSSS LS         P     GDD  F S +S  R  NV ++  S++   K+Q     P    +   
Subjt:  KEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHID

Query:  NQHVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSA--YSSLC--EDPVYYGDIHCN-------VLPEDDDDDDEDAMSSYV
         Q+   E  D +    H  G   +SPET+ L+PNS+R     +DD   +SP+S+   S +C  ED   +     N       V+ ED++D++E+ MSSYV
Subjt:  NQHVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPVDDLELNSPSSA--YSSLC--EDPVYYGDIHCN-------VLPEDDDDDDEDAMSSYV

Query:  IEINSINREEYREE----------VSIDDAIAWAKSKYQSP----TSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQ
        IEINS   + YREE            +D+AIAWAK + Q P    T E  +  R  E E   EEE                                   
Subjt:  IEINSINREEYREE----------VSIDDAIAWAKSKYQSP----TSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQ

Query:  VDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAAY
             E+E  DE+I++W  GKETNIRLLLSTLH++LWS+S W +I LANL  GSQVKKAYQ+ARLCLHPDKLQQRG T  +QK VA + F ILQ+AWA Y
Subjt:  VDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAAY

Query:  ISQD
        ++ +
Subjt:  ISQD

AT1G75100.1 J-domain protein required for chloroplast accumulation response 11.3e-2640.59Show/hide
Query:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR
        A ++ Q P+  T      +ES           E+  E        E  D+  + N    +  Q + K+EE     +   E++ +D KI+ WS+GK  NIR
Subjt:  AKSKYQSPTSETDLSGRPQES-----------EQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIR

Query:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
         LLSTL YILWS SGW  + L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  LLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

AT4G12770.1 Chaperone DnaJ-domain superfamily protein4.0e-2840.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S  + +    S + +P S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

AT4G12770.2 Chaperone DnaJ-domain superfamily protein4.0e-2844.16Show/hide
Query:  SGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLI
        SG  Q+ +   EE  R         Q       AE  +RD++V+    Q + DR    LD +I+ W AGKE N+R LLSTL Y+LW   GW  +SL +LI
Subjt:  SGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLI

Query:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  GGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF

AT4G12780.1 Chaperone DnaJ-domain superfamily protein9.0e-2840.44Show/hide
Query:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE
        E  R+  S+ + +    S + +  S+   SG  Q+ +   EE  R         Q       AE  +RD++V+    QV+ DR    LD +IK W AGKE
Subjt:  EEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQLNGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKE

Query:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF
         N+R LLSTL Y+LW   GW  +SL +LI  + VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  TNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAAYISQDVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAAACCTGGCGAATGCGCTTTGGGATTCCGCGGTTCCGTTCCCGGAGATCGGAAGACGAAACCTTGCACCCCAAACCCACCACTTTCCACCCCGACGACTTCTC
CGACGTCTTCGGCGGTCCGCCGCGGACAGTCATCTCCAGGCAATTTTCCGACTCCAGTTCCTTCTACGAAGAAGTATTCCGATCCACGGAGTTCGTTTCTCGGCCGCGGA
AGGCCGGGCGGAGCTTGCCGGCGTTCAGAATCCCGGTTAAGGAGGATAGATTTTACCGCGATGTATTTGGATCCGACGACGGCCGGCGGTCGAGAGATAGGTCGGAGCCG
AGCTCCAAGGAATTCACCAGATCGAACTCATCCTCCGACCTCAGCCCCCTCCGGCCGCTCGTCGGAGATGACGTGGCATTCCCTTCATCTTCTTCAAATCACAGGCAAAG
CAATGTCTCAGCTCAATGGAATTCATACAGAACCATGTTCAAGGAACAAGAAGTGCCTCAATTTCCACCCGATCTCTCTGCCCATATAGATAACCAGCATGTGGAAGATG
AATACGATGATGGTTTCAGAAGCTCGGACCATGGATTCGGAGACCCAATGTCGTCACCAGAAACCGTTGGTCTGGAACCAAATTCATACAGAAGCATCAAAATCCCTGTG
GATGATTTAGAACTCAACTCCCCGTCATCTGCTTATTCTTCACTCTGTGAAGATCCGGTTTATTATGGTGATATTCATTGTAATGTCTTACCAGAAGATGACGATGACGA
TGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAAATTCCATCAATAGAGAAGAATATAGAGAAGAAGTTTCTATCGATGATGCAATTGCTTGGGCTAAATCGAAGT
ATCAAAGTCCCACGTCCGAGACAGATTTGAGCGGTAGACCACAAGAAAGTGAGCAATCTGGTGAAGAAGAAGGAAGACCTGTTGCGTTTGAATTTGCAGATCAGCAGTTG
AATGGAAATTTGCAAACAGCAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAACGCCACAGGTGGACATCGATAGAGAACTGGAAGGACTGGATGAAAAAATAAA
GTTATGGTCAGCCGGCAAGGAGACCAATATCCGTTTGCTACTTTCCACGCTGCATTATATCTTGTGGTCGAGTAGTGGGTGGTCTGCAATATCCTTGGCCAACCTGATAG
GAGGCTCACAAGTGAAAAAGGCTTACCAAAAAGCAAGATTATGCCTCCACCCAGACAAGCTGCAGCAAAGAGGAGCAACAATGCTGCAAAAATATGTTGCTGAGAAGGCT
TTTACCATTCTTCAGGACGCATGGGCTGCATATATATCTCAAGATGTCTTCCTTAAC
mRNA sequenceShow/hide mRNA sequence
ATGGACCAAACCTGGCGAATGCGCTTTGGGATTCCGCGGTTCCGTTCCCGGAGATCGGAAGACGAAACCTTGCACCCCAAACCCACCACTTTCCACCCCGACGACTTCTC
CGACGTCTTCGGCGGTCCGCCGCGGACAGTCATCTCCAGGCAATTTTCCGACTCCAGTTCCTTCTACGAAGAAGTATTCCGATCCACGGAGTTCGTTTCTCGGCCGCGGA
AGGCCGGGCGGAGCTTGCCGGCGTTCAGAATCCCGGTTAAGGAGGATAGATTTTACCGCGATGTATTTGGATCCGACGACGGCCGGCGGTCGAGAGATAGGTCGGAGCCG
AGCTCCAAGGAATTCACCAGATCGAACTCATCCTCCGACCTCAGCCCCCTCCGGCCGCTCGTCGGAGATGACGTGGCATTCCCTTCATCTTCTTCAAATCACAGGCAAAG
CAATGTCTCAGCTCAATGGAATTCATACAGAACCATGTTCAAGGAACAAGAAGTGCCTCAATTTCCACCCGATCTCTCTGCCCATATAGATAACCAGCATGTGGAAGATG
AATACGATGATGGTTTCAGAAGCTCGGACCATGGATTCGGAGACCCAATGTCGTCACCAGAAACCGTTGGTCTGGAACCAAATTCATACAGAAGCATCAAAATCCCTGTG
GATGATTTAGAACTCAACTCCCCGTCATCTGCTTATTCTTCACTCTGTGAAGATCCGGTTTATTATGGTGATATTCATTGTAATGTCTTACCAGAAGATGACGATGACGA
TGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAAATTCCATCAATAGAGAAGAATATAGAGAAGAAGTTTCTATCGATGATGCAATTGCTTGGGCTAAATCGAAGT
ATCAAAGTCCCACGTCCGAGACAGATTTGAGCGGTAGACCACAAGAAAGTGAGCAATCTGGTGAAGAAGAAGGAAGACCTGTTGCGTTTGAATTTGCAGATCAGCAGTTG
AATGGAAATTTGCAAACAGCAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAACGCCACAGGTGGACATCGATAGAGAACTGGAAGGACTGGATGAAAAAATAAA
GTTATGGTCAGCCGGCAAGGAGACCAATATCCGTTTGCTACTTTCCACGCTGCATTATATCTTGTGGTCGAGTAGTGGGTGGTCTGCAATATCCTTGGCCAACCTGATAG
GAGGCTCACAAGTGAAAAAGGCTTACCAAAAAGCAAGATTATGCCTCCACCCAGACAAGCTGCAGCAAAGAGGAGCAACAATGCTGCAAAAATATGTTGCTGAGAAGGCT
TTTACCATTCTTCAGGACGCATGGGCTGCATATATATCTCAAGATGTCTTCCTTAAC
Protein sequenceShow/hide protein sequence
MDQTWRMRFGIPRFRSRRSEDETLHPKPTTFHPDDFSDVFGGPPRTVISRQFSDSSSFYEEVFRSTEFVSRPRKAGRSLPAFRIPVKEDRFYRDVFGSDDGRRSRDRSEP
SSKEFTRSNSSSDLSPLRPLVGDDVAFPSSSSNHRQSNVSAQWNSYRTMFKEQEVPQFPPDLSAHIDNQHVEDEYDDGFRSSDHGFGDPMSSPETVGLEPNSYRSIKIPV
DDLELNSPSSAYSSLCEDPVYYGDIHCNVLPEDDDDDDEDAMSSYVIEINSINREEYREEVSIDDAIAWAKSKYQSPTSETDLSGRPQESEQSGEEEGRPVAFEFADQQL
NGNLQTAETQQRDVKVEEGTPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSAISLANLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKA
FTILQDAWAAYISQDVFLN