; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035771 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035771
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAuxilin-related protein 2 isoform X2
Genome locationscaffold5:39493975..39497275
RNA-Seq ExpressionSpg035771
SyntenySpg035771
Gene Ontology termsGO:0072318 - clathrin coat disassembly (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0031982 - vesicle (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0030276 - clathrin binding (molecular function)
InterPro domainsIPR001623 - DnaJ domain
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012546.1 Auxilin-related protein 1 [Cucurbita argyrosperma subsp. argyrosperma]9.0e-22786.47Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEE  AK TCS AGTFD ++FSDVFGGPPRT+LFRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS+D RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS +HRPS+VP QWNSY +MFKEQEMPQFPPDLSAHIDN YVED+YDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA
        SS HGF E +SS ETVI+APNSFRSIKICV+DLELNSPSSA SSLC+DPVYYGGI+GNVLPE   DDDDDEDAMSSYVIEI SINREEYREEVSIDEAIA
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        ILWSSSGWSPISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

XP_022954668.1 uncharacterized protein LOC111456858 isoform X1 [Cucurbita moschata]4.0e-22786.89Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEE  AK TCS AGTFD ++FSDVFGGPPRT+LFRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS+D R SRDRSEP SKEFTRSNSSSDLSPLRP++ +DVAFPSSS +HRPS+VP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA
        SS HGF E MSS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPE   DDDDDEDAMSSYVIEI SINREEYREEVSIDEAIA
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        ILWSSSGWSPISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

XP_022994795.1 uncharacterized protein LOC111490417 [Cucurbita maxima]2.6e-22686.23Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR E+E   K TCS AGTFD ++FSDVFGGPPRT++FRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FG +D RRSRDRSEP SKEFTRSNSSSDLSPL+P++G+DVAFPSSS +HRPS+VP QWNSY TMFKEQEMPQFPPDLSAHIDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDD--EDAMSSYVIEITSINREEYREEVSIDEAIAW
        SS HGF E +SS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPEDDDDD  EDAMSSYVIEI SINREEYREEVSIDEAIAW
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDD--EDAMSSYVIEITSINREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELE LDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        LWSSSGW PISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

XP_023541267.1 uncharacterized protein LOC111801489 [Cucurbita pepo subsp. pepo]5.3e-22786.65Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEE  AK TCS AGTFD ++FSDVFGGPPRT+LFRQFSD FE KDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS+D RRSRDRSEP SKEFTRSNSSSDLSPLRP++G+DVAFPSSS +HRPS+VP QWNSY TMFKEQEMPQFPPDLSA IDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE--DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAW
        SS HGF E +SS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPE  DDDDDED MSSYVIEI SINREEYREEVSIDEAIAW
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE--DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRE+EGLDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        LWSSSGWSPISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYIS+DVFLN
Subjt:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

XP_038895184.1 uncharacterized protein LOC120083484 [Benincasa hispida]1.1e-23790.64Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        M+ +WRVR GIPRFRSRRSE+E L K TCSRAGTF ADDFSDVFGGPP+T+LFRQFSDRFEGKDSTSFYEEVFRSTE+VSRPQKGGRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS DGRRSRDRSEPSSKEFTRSNSSSD SPLRPV+ DDVAFPSSSS+HRPS+VPTQWNSYRTMFKEQEMPQF PDL+ HIDNRYVEDEY+DSYR
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
        SSDHGFGEPMSSPETV+L PNSFRSI+ICVDDLELNSPSSADSSLCEDPV YGGIY NVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK

Query:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQS ASETDLS RQ ESEQSGEEEGRPVAFE++DQQ NGNGL QT ETQQRDVKVEE KPQVD +RELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGAT LQKYVA+KAFTILQ+AW+VYISQD FLN
Subjt:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

TrEMBL top hitse value%identityAlignment
A0A6J1EN77 uncharacterized protein LOC1114341152.6e-22485.53Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        M+  WRVR GIP+FRSRRSE E +AK TCSRA TFDADDFSDVFGGPPRT+LFRQFSD FEGKDSTSFYEEVF+S E+VS+PQKGGRSLPAFRIP+KEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYR IFGS DGR+SRDRSEP+SKEFTRSNSSS  SP RP++GDDVAFPSSSS+ RPS+ PT+W+SYRTMFKEQEMPQFPPDL  HIDN YVE+E+ DSYR
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
        SS H FG P+SSPET+ L PNSFRSIKICVDDLE NSPSSA SS CEDPV Y GIY NVLPEDD+DDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK

Query:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQSP SETDLS RQ ESEQSGEEEGRPV+FE + QQLNGNGL Q  ET QRDVKVEEGKP+VDI++ELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        SSSGWSPISLTNLIGGSQ+KKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQ+AWAVYISQDVFLN
Subjt:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

A0A6J1GRR2 uncharacterized protein LOC111456858 isoform X11.9e-22786.89Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEE  AK TCS AGTFD ++FSDVFGGPPRT+LFRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS+D R SRDRSEP SKEFTRSNSSSDLSPLRP++ +DVAFPSSS +HRPS+VP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA
        SS HGF E MSS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPE   DDDDDEDAMSSYVIEI SINREEYREEVSIDEAIA
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        ILWSSSGWSPISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

A0A6J1GTM9 uncharacterized protein LOC111456858 isoform X21.4e-22586.68Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEE  AK TCS AGTFD ++FSDVFGGPPRT+LFRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FGS+D R SRDRSEP SKEFTRSNSSSDLSPLRP++ +DVAFPSSS +HRPS+VP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA
        SS HGF E MSS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPE   DDDDDEDAMSSYVIEI SINREEYREEVSIDEAIA
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPE---DDDDDEDAMSSYVIEITSINREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLS RQ ESEQSG EEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        ILWSSSGWSPISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

A0A6J1K2C2 uncharacterized protein LOC1114904171.3e-22686.23Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR E+E   K TCS AGTFD ++FSDVFGGPPRT++FRQFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYRD+FG +D RRSRDRSEP SKEFTRSNSSSDLSPL+P++G+DVAFPSSS +HRPS+VP QWNSY TMFKEQEMPQFPPDLSAHIDN YVEDEYDD  R
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDD--EDAMSSYVIEITSINREEYREEVSIDEAIAW
        SS HGF E +SS ETVI+APNSFRSIKICV+DLELNSPSSADSSLCEDPVYYGGI+GNVLPEDDDDD  EDAMSSYVIEI SINREEYREEVSIDEAIAW
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDD--EDAMSSYVIEITSINREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLS RQ ESEQSGEEEGRPV FEF DQQL+ +G LQTEETQ+RDVKV+EGKP VDINRELE LDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        LWSSSGW PISLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQDAW+VYISQDVFLN
Subjt:  LWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

A0A6J1KL61 uncharacterized protein LOC1114955613.1e-22585.32Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        M+  WRVR GIP+FRSRRSE E +AK TCSRA TFDADDFSDVFGGPPRT+LFRQFSD FEGKDSTSFYEEVF+S E+VS+PQKGGRSLPAFRIP+KEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR
        FYR IFGS DGR+SRDRSEP+SKEFTRSNSSS  SP RP++GDDVAFPSSSS+ RPS+VPT+W+SYRTMFKEQEMPQFPPD   HIDN YVE+E++DSYR
Subjt:  FYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYR

Query:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
        SS H FG PMSSPET+ L PNSFRSIKICVDDLE NSPSSA SS CEDPV YGGIY NVLPEDD+DDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK
Subjt:  SSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAK

Query:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQSP SETDLS RQ ESEQSGEEEGRPV+FE + QQLNGNGL Q  ET Q+DVK+EEGKP+VDI++ELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPASETDLS-RQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN
        SSSGWSPISLTNLIGGSQ+KKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQ+AWAVY+SQDVFLN
Subjt:  SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVFLN

SwissProt top hitse value%identityAlignment
O14976 Cyclin-G-associated kinase1.8e-1240.2Show/hide
Query:  DINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW-SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYI
        D+ ++ + L  K+  W  GKE NIR LLSTLH +LW   S W+P+ + +L+   QVKK Y++A L +HPDK     A    +  A+  F  L DAW+ + 
Subjt:  DINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW-SSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYI

Query:  SQ
        +Q
Subjt:  SQ

Q0WQ57 Auxilin-related protein 22.6e-2740.22Show/hide
Query:  EEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGK
        E  R+  S    +    S + +PAS++        +  GE E R  A     Q+          E  +RD++V+  + Q + +R    LD +I+ W AGK
Subjt:  EEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGK

Query:  ETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF
        E N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  ETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF

Q9C9Q4 J domain-containing protein required for chloroplast accumulation response 13.2e-2539.01Show/hide
Query:  EITSINREEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKI
        +I ++NR++   + +  +        Y++P       RQ  S      E         D+  + N     E+  Q + K+EE     +   E++ +D KI
Subjt:  EITSINREEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKI

Query:  KLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
        + WS+GK  NIR LLSTL YILWS SGW P+ L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  KLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

Q9FWS1 Auxilin-like protein 18.4e-2642.41Show/hide
Query:  SPASETDLSRQPESEQSGEEEGRPV----AFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS
        S +   + S  P +  +   +G P+    A     Q+ +        E + RD+K +  K Q + NR  E LD  +K WS+GKE N+R L+STL YIL +
Subjt:  SPASETDLSRQPESEQSGEEEGRPV----AFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWS

Query:  SSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
         SGW PI LT+L+  + V+KAY+KA L +HPDKLQQRGA+  QKY+ EK F +L++AW
Subjt:  SSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

Q9SU08 Auxilin-related protein 11.4e-2846.58Show/hide
Query:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA
        GE E R  A     Q+          E  +RD++V+  + QV+ +R    LD +IK W AGKE N+R LLSTL Y+LW   GW P+SLT+LI  + VKK 
Subjt:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF

Arabidopsis top hitse value%identityAlignment
AT1G30280.1 Chaperone DnaJ-domain superfamily protein2.0e-7037.97Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFR------STEVVSRPQKGGRSLPAFRI
        M+++WR+++G+       S +   + A  S     DA+DF+DVFGGPPR++L R+FS  F   D   FY+E+F+      S   ++  +  GR+LPAFRI
Subjt:  MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFR------STEVVSRPQKGGRSLPAFRI

Query:  PVKEDRFYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAH
        P   + FY  +FG   G       + SS    RSNSSS LS         P     GDD  F S +S  RP +VP++  S++   K+Q     P    + 
Subjt:  PVKEDRFYRDIFGSNDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAH

Query:  IDNRYVEDEYDDSYRSSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSA--DSSLC--EDPVYYGGIYGN--------VLPEDDDDDEDAMSS
               ++ D  Y+ +  G G   +SPET+ L PNSFR     +DD   +SP+S+   S +C  ED   +     N        V+ +++D++E+ MSS
Subjt:  IDNRYVEDEYDDSYRSSDHGFGEPMSSPETVILAPNSFRSIKICVDDLELNSPSSA--DSSLC--EDPVYYGGIYGN--------VLPEDDDDDEDAMSS

Query:  YVIEITSINREEYREE----------VSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQV
        YVIEI S   + YREE            +DEAIAWAK +          S++PE++Q+ E+                            D +  E +P+ 
Subjt:  YVIEITSINREEYREE----------VSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQV

Query:  DINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAVYI
        +   E+E  DE+I++W  GKETNIRLLLSTLH++LWS+S W  I L NL  GSQVKKAYQ+ARLCLHPDKLQQRG T  +QK VA + F ILQ+AWAVY+
Subjt:  DINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFTILQDAWAVYI

Query:  SQD
        + +
Subjt:  SQD

AT1G75310.1 auxin-like 1 protein3.1e-2847.52Show/hide
Query:  SEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQ
        S Q+GE   R  A     Q+ +        E + RD+K +  K Q + NR  E LD  +K WS+GKE N+R L+STL YIL + SGW PI LT+L+  + 
Subjt:  SEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQ

Query:  VKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW
        V+KAY+KA L +HPDKLQQRGA+  QKY+ EK F +L++AW
Subjt:  VKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAW

AT4G12770.1 Chaperone DnaJ-domain superfamily protein1.8e-2840.22Show/hide
Query:  EEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGK
        E  R+  S    +    S + +PAS++        +  GE E R  A     Q+          E  +RD++V+  + Q + +R    LD +I+ W AGK
Subjt:  EEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGK

Query:  ETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF
        E N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  ETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF

AT4G12770.2 Chaperone DnaJ-domain superfamily protein1.3e-2945.89Show/hide
Query:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA
        GE E R  A     Q+          E  +RD++V+  + Q + +R    LD +I+ W AGKE N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK 
Subjt:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF

AT4G12780.1 Chaperone DnaJ-domain superfamily protein9.8e-3046.58Show/hide
Query:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA
        GE E R  A     Q+          E  +RD++V+  + QV+ +R    LD +IK W AGKE N+R LLSTL Y+LW   GW P+SLT+LI  + VKK 
Subjt:  GEEEGRPVAFEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFTILQDAWAVYISQDVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGACCTGGCGAGTTCGTCTAGGGATTCCACGGTTCCGTTCCCGGAGATCAGAAGAGGAAGCGCTGGCCAAAGCCACGTGTTCCCGCGCCGGAACTTTCGACGC
CGATGACTTCTCCGACGTCTTCGGCGGTCCGCCGCGGACCCTCCTCTTCAGGCAATTTTCCGACAGGTTTGAGGGTAAAGACTCCACCTCCTTCTACGAAGAAGTATTCC
GATCCACTGAGGTAGTTTCTCGGCCGCAGAAGGGCGGCCGGAGCTTGCCGGCCTTCAGAATCCCTGTTAAGGAGGATAGGTTTTACCGCGATATTTTTGGATCGAACGAC
GGCCGGCGGTCGAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAACTCCTCTTCCGACCTCAGCCCTCTCCGGCCGGTCGTCGGAGATGACGTGGCATT
CCCTTCGTCGTCTTCACATCACAGGCCAAGCGATGTCCCAACACAATGGAACTCATACAGAACCATGTTCAAGGAACAAGAAATGCCACAGTTTCCACCTGATCTCTCTG
CTCATATCGATAACCGGTATGTGGAAGATGAATATGATGATAGTTATAGAAGCTCAGATCATGGATTTGGAGAACCCATGTCATCCCCGGAAACCGTTATTCTGGCACCA
AATTCATTCAGAAGCATCAAAATCTGTGTGGATGATCTAGAACTAAACTCCCCATCATCTGCTGATTCTTCACTCTGTGAAGATCCGGTTTATTATGGTGGTATTTATGG
TAATGTTTTACCAGAAGATGACGACGATGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAACTTCCATCAATAGAGAAGAATATAGAGAAGAAGTTTCTATCGATG
AAGCAATTGCTTGGGCTAAATCAAAGTATCAAAGTCCTGCATCTGAGACAGATTTGAGTAGACAACCAGAGAGTGAGCAATCTGGGGAAGAAGAAGGAAGACCTGTTGCG
TTTGAATTTACAGATCAGCAGTTGAATGGAAATGGATTGTTGCAGACAGAAGAGACACAACAGAGAGATGTTAAAGTTGAAGAAGGAAAGCCGCAGGTGGACATCAATAG
AGAATTGGAAGGATTAGATGAAAAAATAAAGTTGTGGTCAGCTGGCAAGGAGACCAACATCCGCTTGCTACTTTCTACGCTTCATTATATACTGTGGTCAAGTAGTGGGT
GGTCTCCGATATCCTTGACAAACCTGATTGGAGGCTCACAAGTGAAGAAGGCTTATCAAAAAGCTAGATTATGTCTCCACCCAGACAAGCTGCAGCAAAGAGGAGCAACG
ATGCTGCAAAAATATGTTGCCGAGAAGGCTTTTACCATCCTTCAGGACGCGTGGGCTGTATATATATCCCAAGATGTCTTCCTTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGACCTGGCGAGTTCGTCTAGGGATTCCACGGTTCCGTTCCCGGAGATCAGAAGAGGAAGCGCTGGCCAAAGCCACGTGTTCCCGCGCCGGAACTTTCGACGC
CGATGACTTCTCCGACGTCTTCGGCGGTCCGCCGCGGACCCTCCTCTTCAGGCAATTTTCCGACAGGTTTGAGGGTAAAGACTCCACCTCCTTCTACGAAGAAGTATTCC
GATCCACTGAGGTAGTTTCTCGGCCGCAGAAGGGCGGCCGGAGCTTGCCGGCCTTCAGAATCCCTGTTAAGGAGGATAGGTTTTACCGCGATATTTTTGGATCGAACGAC
GGCCGGCGGTCGAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAACTCCTCTTCCGACCTCAGCCCTCTCCGGCCGGTCGTCGGAGATGACGTGGCATT
CCCTTCGTCGTCTTCACATCACAGGCCAAGCGATGTCCCAACACAATGGAACTCATACAGAACCATGTTCAAGGAACAAGAAATGCCACAGTTTCCACCTGATCTCTCTG
CTCATATCGATAACCGGTATGTGGAAGATGAATATGATGATAGTTATAGAAGCTCAGATCATGGATTTGGAGAACCCATGTCATCCCCGGAAACCGTTATTCTGGCACCA
AATTCATTCAGAAGCATCAAAATCTGTGTGGATGATCTAGAACTAAACTCCCCATCATCTGCTGATTCTTCACTCTGTGAAGATCCGGTTTATTATGGTGGTATTTATGG
TAATGTTTTACCAGAAGATGACGACGATGACGAAGATGCTATGAGCTCTTATGTCATTGAGATAACTTCCATCAATAGAGAAGAATATAGAGAAGAAGTTTCTATCGATG
AAGCAATTGCTTGGGCTAAATCAAAGTATCAAAGTCCTGCATCTGAGACAGATTTGAGTAGACAACCAGAGAGTGAGCAATCTGGGGAAGAAGAAGGAAGACCTGTTGCG
TTTGAATTTACAGATCAGCAGTTGAATGGAAATGGATTGTTGCAGACAGAAGAGACACAACAGAGAGATGTTAAAGTTGAAGAAGGAAAGCCGCAGGTGGACATCAATAG
AGAATTGGAAGGATTAGATGAAAAAATAAAGTTGTGGTCAGCTGGCAAGGAGACCAACATCCGCTTGCTACTTTCTACGCTTCATTATATACTGTGGTCAAGTAGTGGGT
GGTCTCCGATATCCTTGACAAACCTGATTGGAGGCTCACAAGTGAAGAAGGCTTATCAAAAAGCTAGATTATGTCTCCACCCAGACAAGCTGCAGCAAAGAGGAGCAACG
ATGCTGCAAAAATATGTTGCCGAGAAGGCTTTTACCATCCTTCAGGACGCGTGGGCTGTATATATATCCCAAGATGTCTTCCTTAACTAG
Protein sequenceShow/hide protein sequence
MEQTWRVRLGIPRFRSRRSEEEALAKATCSRAGTFDADDFSDVFGGPPRTLLFRQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDRFYRDIFGSND
GRRSRDRSEPSSKEFTRSNSSSDLSPLRPVVGDDVAFPSSSSHHRPSDVPTQWNSYRTMFKEQEMPQFPPDLSAHIDNRYVEDEYDDSYRSSDHGFGEPMSSPETVILAP
NSFRSIKICVDDLELNSPSSADSSLCEDPVYYGGIYGNVLPEDDDDDEDAMSSYVIEITSINREEYREEVSIDEAIAWAKSKYQSPASETDLSRQPESEQSGEEEGRPVA
FEFTDQQLNGNGLLQTEETQQRDVKVEEGKPQVDINRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPISLTNLIGGSQVKKAYQKARLCLHPDKLQQRGAT
MLQKYVAEKAFTILQDAWAVYISQDVFLN