; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014326 (gene) of Snake gourd v1 genome

Gene IDTan0014326
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAuxilin-related protein 2 isoform X2
Genome locationLG05:78101094..78104290
RNA-Seq ExpressionTan0014326
SyntenyTan0014326
Gene Ontology termsGO:0016192 - vesicle-mediated transport (biological process)
GO:0005622 - intracellular (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573379.1 Auxilin-related protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.5e-22686.89Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEET  K TCS AGTFDV++FSDVFGGPPRTILF+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFGSDD  RSRDRSEP SKEFTRSNSSSDLSPLRP+IG+DVAFPSSS NHRPSNVP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA
          YHGF E+VSS ETVI+APNSFRSIKICV+DLELNSPSSA S+LC DPVYYGGI+G+VLPE   DDDDDEDAMSSYVIEI SI+REEYREEVSIDEAIA
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        ILWSSSGWSP+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

KAG7012546.1 Auxilin-related protein 1 [Cucurbita argyrosperma subsp. argyrosperma]4.0e-22786.89Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEET  K TCS AGTFDV++FSDVFGGPPRTILF+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP+IG+DVAFPSSS NHRPSNVP QWNSY +MFKEQEMPQFPPDLSAHIDN YVED+      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA
          YHGF E+VSS ETVI+APNSFRSIKICV+DLELNSPSSA S+LC DPVYYGGI+G+VLPE   DDDDDEDAMSSYVIEI SI+REEYREEVSIDEAIA
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        ILWSSSGWSP+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

XP_022994795.1 uncharacterized protein LOC111490417 [Cucurbita maxima]6.8e-22786.65Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR E+ET  K TCS AGTFDV++FSDVFGGPPRTI+F+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFG DD RRSRDRSEP SKEFTRSNSSSDLSPL+P+IG+DVAFPSSS NHRPSNVP QWNSY TMFKEQEMPQFPPDLSAHIDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDD--EDAMSSYVIEITSISREEYREEVSIDEAIAW
          YHGF E+VSS ETVI+APNSFRSIKICV+DLELNSPSSADS+LC+DPVYYGGI+G+VLPEDDDDD  EDAMSSYVIEI SI+REEYREEVSIDEAIAW
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDD--EDAMSSYVIEITSISREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELE LDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        LWSSSGW P+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

XP_023541267.1 uncharacterized protein LOC111801489 [Cucurbita pepo subsp. pepo]3.0e-22786.86Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEET  K TCS AGTFDV++FSDVFGGPPRTILF+QFSD FE KDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFGSDD RRSRDRSEP SKEFTRSNSSSDLSPLRP+IG+DVAFPSSS NHRPSNVP QWNSY TMFKEQEMPQFPPDLSA IDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE--DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAW
          YHGF E+VSS ETVI+APNSFRSIKICV+DLELNSPSSADS+LC+DPVYYGGI+G+VLPE  DDDDDED MSSYVIEI SI+REEYREEVSIDEAIAW
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE--DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RE+EGLDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        LWSSSGWSP+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYIS+DVFLN
Subjt:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

XP_038895184.1 uncharacterized protein LOC120083484 [Benincasa hispida]1.9e-22988.3Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        M+ +WRVR GIPRFRSRRSE+ETL K TCSRAGTF  DDFSDVFGGPP+TILF+QFSDRFEGKDSTSFYEEVFRSTE+VSRPQKGGRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY-----
        FYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD SPLRPVI DDVAFPSSSSNHRPSNVPTQWNSY+TMFKEQEMPQF PDL+ HIDNRYVEDEY     
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY-----

Query:  ---HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAWAK
           HGFGE +SSPETV+L PNSFRSI+ICVDDLELNSPSSADS+LC+DPV YGGIY +VLPEDDDDDEDAMSSYVIEITSI+REEYREEVSIDEAIAWAK
Subjt:  ---HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAWAK

Query:  SKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQS ASETDLSVRQQESEQSGEEEGRPVAFE++DQQ N +GL QT ETQQRDVKVEE KPQVD DRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        SSSGWSP+SLTNLIGGSQVKKAYQKARLCLHPDKLQQRGAT LQKYVA+KAF ILQ+AW+VYISQD FLN
Subjt:  SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

TrEMBL top hitse value%identityAlignment
A0A5A7UJY3 Auxilin-related protein 2 isoform X28.9e-21783.93Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATC-SRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDS-TSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKE
        M+ TWR+R GIPRFRSRRSE ++LPK T  S + TF  DDFSDVFGGPP+TILF+QFS+RFEG DS TSFY+EVFRS+++VSRPQK GRSLPAFRIPVKE
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATC-SRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDS-TSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKE

Query:  DRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY---
        DRFYRDVFGS+DGRRSRDRSEPSSKEFTRSNSSSD +PL PVIGDDVAFPSSSSNHRPSNVPTQWNSY+TMFKEQEMPQF PDLS H DNRYVEDEY   
Subjt:  DRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY---

Query:  -----HGFGEHVSSPETVILAPNSFRSIKICVDD-LELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA
             HGFGE VSSPETVIL PNSFRSIKICVDD LE+NSPSS +S+LC+DPVYYGG Y +VLPEDD DDED MSSYVIEITSI+REEYREEVSIDEAIA
Subjt:  -----HGFGEHVSSPETVILAPNSFRSIKICVDD-LELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKSKYQS ASETDLSVRQQESEQSGEEEGRPVAFE +DQQ N +GL QT ETQQR+VKVEE KPQ++ DRELEGLDEKIKLWSAGKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        ILWSSSGWSP+SLTNLIGG+QVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAF ILQ+AW+VYISQD F+N
Subjt:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

A0A6J1EN77 uncharacterized protein LOC1114341158.9e-21782.98Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        M+  WRVR GIP+FRSRRSE ET+ K TCSRA TFD DDFSDVFGGPPRTILF+QFSD FEGKDSTSFYEEVF+S E+VS+PQKGGRSLPAFRIP+KEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY-----
        FYR +FGS+DGR+SRDRSEP+SKEFTRSNSSS  SP RP+IGDDVAFPSSSSN RPSN PT+W+SY+TMFKEQEMPQFPPDL  HIDN YVE+E+     
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEY-----

Query:  ---HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAWAK
           H FG  VSSPET+ L PNSFRSIKICVDDLE NSPSSA S+ C+DPV Y GIY +VLPEDD+DDEDAMSSYVIEITSI+REEYREEVSIDEAIAWAK
Subjt:  ---HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAWAK

Query:  SKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
        SKYQSP SETDLS RQQESEQSGEEEGRPV+FE + QQLN +GL Q  ET QRDVKVEEGKP+VDID+ELEGLDEKIKLWSAGKETNIRLLLSTLHYILW
Subjt:  SKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW

Query:  SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        SSSGWSP+SLTNLIGGSQ+KKAYQKARLCLHPDKLQQRGATMLQKYVAEKAF ILQ+AWAVYISQDVFLN
Subjt:  SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

A0A6J1GRR2 uncharacterized protein LOC111456858 isoform X17.3e-22786.68Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEET  K TCS AGTFDV++FSDVFGGPPRTILF+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFGSDD R SRDRSEP SKEFTRSNSSSDLSPLRP+I +DVAFPSSS NHRPSNVP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA
          YHGF E++SS ETVI+APNSFRSIKICV+DLELNSPSSADS+LC+DPVYYGGI+G+VLPE   DDDDDEDAMSSYVIEI SI+REEYREEVSIDEAIA
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        ILWSSSGWSP+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

A0A6J1GTM9 uncharacterized protein LOC111456858 isoform X25.2e-22586.47Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR EEET  K TCS AGTFDV++FSDVFGGPPRTILF+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFGSDD R SRDRSEP SKEFTRSNSSSDLSPLRP+I +DVAFPSSS NHRPSNVP QWNSY +MFKEQEMPQFPPDLSAHIDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA
          YHGF E++SS ETVI+APNSFRSIKICV+DLELNSPSSADS+LC+DPVYYGGI+G+VLPE   DDDDDEDAMSSYVIEI SI+REEYREEVSIDEAIA
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPE---DDDDDEDAMSSYVIEITSISREEYREEVSIDEAIA

Query:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY
        WAKS  QSPASETDLSVRQQESEQSG EEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELEGLDEKIKLWS GKETNIRLLLSTLHY
Subjt:  WAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHY

Query:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        ILWSSSGWSP+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  ILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

A0A6J1K2C2 uncharacterized protein LOC1114904173.3e-22786.65Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR
        MEQ WRVRLGIPRFRSRR E+ET  K TCS AGTFDV++FSDVFGGPPRTI+F+QFSD FEGKDSTSFYEE+FRSTE VS+PQK GRSLPAFRIPVKEDR
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDR

Query:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------
        FYRDVFG DD RRSRDRSEP SKEFTRSNSSSDLSPL+P+IG+DVAFPSSS NHRPSNVP QWNSY TMFKEQEMPQFPPDLSAHIDN YVEDE      
Subjt:  FYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDE------

Query:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDD--EDAMSSYVIEITSISREEYREEVSIDEAIAW
          YHGF E+VSS ETVI+APNSFRSIKICV+DLELNSPSSADS+LC+DPVYYGGI+G+VLPEDDDDD  EDAMSSYVIEI SI+REEYREEVSIDEAIAW
Subjt:  --YHGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDD--EDAMSSYVIEITSISREEYREEVSIDEAIAW

Query:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI
        AKS YQSPASETDLSVRQQESEQSGEEEGRPV FEFADQQL+ DG LQTEETQ+RDVKV+EGKP VDI+RELE LDEKIKLWS GKETNIRLLLSTLHYI
Subjt:  AKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYI

Query:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN
        LWSSSGW P+SLTNLIG SQVKKAYQKARLCLHPDKLQQRGATMLQKYVA+KAFAILQDAW+VYISQDVFLN
Subjt:  LWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVFLN

SwissProt top hitse value%identityAlignment
O14976 Cyclin-G-associated kinase4.0e-1240.2Show/hide
Query:  DIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW-SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYI
        D+ ++ + L  K+  W  GKE NIR LLSTLH +LW   S W+P+ + +L+   QVKK Y++A L +HPDK     A    +  A+  F  L DAW+ + 
Subjt:  DIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILW-SSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYI

Query:  SQ
        +Q
Subjt:  SQ

Q0WQ57 Auxilin-related protein 22.6e-2741.08Show/hide
Query:  EEYREEVSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAG
        E  R+  S    +    S + +PAS++         +  GE E R  A     Q+  E       E  +RD++V+  + Q + DR    LD +I+ W AG
Subjt:  EEYREEVSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAG

Query:  KETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
        KE N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  KETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF

Q9C9Q4 J domain-containing protein required for chloroplast accumulation response 17.0e-2541.18Show/hide
Query:  AKSKYQSPASETDLSVRQQES-------EQSGEEEGRPVAFEFADQQLNEDGLL--QTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIR
        A ++ Q P+  T      +ES       +   EE   P       + ++E   +    E+  Q + K+EE     +   E++ +D KI+ WS+GK  NIR
Subjt:  AKSKYQSPASETDLSVRQQES-------EQSGEEEGRPVAFEFADQQLNEDGLL--QTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIR

Query:  LLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAW
         LLSTL YILWS SGW P+ L ++I G+ V+K+YQ+A L LHPDKLQQ+GA+  QKY+AEK F +LQ+AW
Subjt:  LLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAW

Q9FWS1 Auxilin-like protein 12.2e-2337.37Show/hide
Query:  ITSISREEYREEVSIDEAIAWAKS-----KYQSPASETDLSVRQQESEQSGEE-------EGRPV----AFEFADQQLNEDGLLQTEETQQRDVKVEEGK
        IT +      + +S   A + AKS      + S       S    E++ SG         +G P+    A     Q+ ++       E + RD+K +  K
Subjt:  ITSISREEYREEVSIDEAIAWAKS-----KYQSPASETDLSVRQQESEQSGEE-------EGRPV----AFEFADQQLNEDGLLQTEETQQRDVKVEEGK

Query:  PQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAW
         Q + +R  E LD  +K WS+GKE N+R L+STL YIL + SGW P+ LT+L+  + V+KAY+KA L +HPDKLQQRGA+  QKY+ EK F +L++AW
Subjt:  PQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAW

Q9SU08 Auxilin-related protein 11.0e-2847.95Show/hide
Query:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA
        GE E R  A     Q+  E       E  +RD++V+  + QV+ DR    LD +IK W AGKE N+R LLSTL Y+LW   GW P+SLT+LI  + VKK 
Subjt:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF

Arabidopsis top hitse value%identityAlignment
AT1G30280.1 Chaperone DnaJ-domain superfamily protein1.1e-6838.17Show/hide
Query:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFR------STEVVSRPQKGGRSLPAFRI
        M+++WR+++G+       S +     A  S     D +DF+DVFGGPPR++L ++FS  F   D   FY+E+F+      S   ++  +  GR+LPAFRI
Subjt:  MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFR------STEVVSRPQKGGRSLPAFRI

Query:  PVKEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQE---MPQFPPDL
        P   + FY  VFG   G       + SS    RSNSSS LS         P     GDD  F S +S  RP NVP++  S+K   K+Q    +P+     
Subjt:  PVKEDRFYRDVFGSDDGRRSRDRSEPSSKEFTRSNSSSDLS---------PLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQE---MPQFPPDL

Query:  SAHIDNRYVEDEY----HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSA--DSALCK---DPVYYG-------GIYGSVLPEDDDDDEDAMSSY
        S   +     D Y    H  G   +SPET+ L PNSFR     +DD   +SP+S+   S +C+   D  +          +   V+ +++D++E+ MSSY
Subjt:  SAHIDNRYVEDEY----HGFGEHVSSPETVILAPNSFRSIKICVDDLELNSPSSA--DSALCK---DPVYYG-------GIYGSVLPEDDDDDEDAMSSY

Query:  VIEITSISREEYREE----------VSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQV
        VIEI S   + YREE            +DEAIAWAK + Q P ++                                    QTEE    D +  E +P+ 
Subjt:  VIEITSISREEYREE----------VSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQV

Query:  DIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFAILQDAWAVYI
        + + E+E  DE+I++W  GKETNIRLLLSTLH++LWS+S W  + L NL  GSQVKKAYQ+ARLCLHPDKLQQRG T  +QK VA + FAILQ+AWAVY+
Subjt:  DIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATM-LQKYVAEKAFAILQDAWAVYI

Query:  SQD
        + +
Subjt:  SQD

AT4G12770.1 Chaperone DnaJ-domain superfamily protein1.8e-2841.08Show/hide
Query:  EEYREEVSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAG
        E  R+  S    +    S + +PAS++         +  GE E R  A     Q+  E       E  +RD++V+  + Q + DR    LD +I+ W AG
Subjt:  EEYREEVSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAG

Query:  KETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
        KE N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  KETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF

AT4G12770.2 Chaperone DnaJ-domain superfamily protein9.6e-3047.26Show/hide
Query:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA
        GE E R  A     Q+  E       E  +RD++V+  + Q + DR    LD +I+ W AGKE N+R LLSTL Y+LW   GW P+SLT+LI G+ VKK 
Subjt:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF

AT4G12780.1 Chaperone DnaJ-domain superfamily protein7.4e-3047.95Show/hide
Query:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA
        GE E R  A     Q+  E       E  +RD++V+  + QV+ DR    LD +IK W AGKE N+R LLSTL Y+LW   GW P+SLT+LI  + VKK 
Subjt:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKA

Query:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
        Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  YQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF

AT4G12780.2 Chaperone DnaJ-domain superfamily protein7.6e-2743.56Show/hide
Query:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDID----RELE-------------GLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGW
        GE E R  A     Q+  E       E  +RD++V+  + QV+ D    R L               LD +IK W AGKE N+R LLSTL Y+LW   GW
Subjt:  GEEEGRPVAFEFADQQLNEDGLLQTEETQQRDVKVEEGKPQVDID----RELE-------------GLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGW

Query:  SPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF
         P+SLT+LI  + VKK Y+KA LC+HPDK+QQ+GA + QKY+AEK F +L++AW  + S+++F
Subjt:  SPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVAEKAFAILQDAWAVYISQDVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAAACCTGGCGAGTCCGTTTAGGGATTCCACGGTTCCGTTCCCGGAGATCGGAAGAGGAAACCCTGCCCAAAGCCACATGTTCCCGCGCCGGAACTTTCGACGT
CGATGACTTCTCCGACGTCTTTGGCGGCCCGCCGAGGACCATCCTATTCAAGCAATTTTCCGACAGGTTTGAGGGTAAAGACTCCACTTCTTTCTACGAAGAAGTATTCC
GATCCACTGAGGTAGTTTCTCGCCCGCAGAAGGGCGGCCGGAGCTTGCCGGCCTTCAGAATCCCCGTTAAGGAGGATAGGTTTTACCGCGATGTTTTTGGATCGGACGAT
GGCCGGCGGTCAAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAATTCGTCTTCCGACCTCAGCCCCCTCCGGCCGGTCATCGGAGATGATGTGGCATT
CCCTTCGTCGTCTTCAAATCACAGGCCAAGCAATGTCCCAACACAATGGAACTCATACAAAACCATGTTCAAGGAACAAGAAATGCCACAGTTTCCGCCTGATCTCTCCG
CCCATATCGATAACCGGTATGTGGAAGATGAGTACCATGGATTTGGAGAGCACGTGTCGTCCCCAGAAACCGTTATTCTGGCACCGAATTCATTCAGAAGCATCAAAATC
TGTGTGGATGATTTGGAACTAAACTCCCCATCATCTGCTGATTCTGCACTCTGTAAAGATCCGGTTTATTATGGTGGTATTTATGGTAGTGTTTTACCAGAAGATGACGA
TGACGATGAAGATGCTATGAGCTCTTATGTCATTGAGATAACTTCCATCAGTAGAGAAGAATATAGAGAAGAGGTTTCTATCGATGAAGCAATTGCTTGGGCTAAATCAA
AGTATCAAAGTCCTGCATCTGAGACAGATTTGAGTGTTAGACAACAAGAAAGTGAGCAATCTGGTGAAGAAGAAGGAAGACCTGTTGCGTTTGAATTTGCAGATCAGCAG
TTGAATGAAGATGGATTGTTGCAGACAGAAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAAAGCCGCAGGTGGACATCGATAGAGAATTGGAAGGATTAGATGA
AAAAATAAAGTTATGGTCAGCTGGAAAGGAGACCAACATCCGCTTGCTACTTTCTACGCTTCATTATATATTGTGGTCAAGTAGTGGGTGGTCTCCAATGTCCTTGACAA
ACCTGATTGGAGGCTCACAAGTGAAGAAGGCTTATCAAAAAGCAAGGTTATGTCTCCACCCAGACAAGCTCCAGCAAAGAGGAGCCACAATGCTGCAAAAATATGTTGCA
GAGAAGGCTTTTGCCATCCTTCAGGACGCGTGGGCTGTATATATATCCCAAGATGTCTTCCTTAACTAG
mRNA sequenceShow/hide mRNA sequence
CAACTTTCTTGTATAAATATGTAAAGGAAGAACTTCCATTTGTTTCCATGGTGTATTGGTGTTCCACCACCTCTCCTTCACAAACCTCATATTTCTTCACTCTCTTTCTG
CACAAAACATCGCCGGAAAATGGAGCAAACCTGGCGAGTCCGTTTAGGGATTCCACGGTTCCGTTCCCGGAGATCGGAAGAGGAAACCCTGCCCAAAGCCACATGTTCCC
GCGCCGGAACTTTCGACGTCGATGACTTCTCCGACGTCTTTGGCGGCCCGCCGAGGACCATCCTATTCAAGCAATTTTCCGACAGGTTTGAGGGTAAAGACTCCACTTCT
TTCTACGAAGAAGTATTCCGATCCACTGAGGTAGTTTCTCGCCCGCAGAAGGGCGGCCGGAGCTTGCCGGCCTTCAGAATCCCCGTTAAGGAGGATAGGTTTTACCGCGA
TGTTTTTGGATCGGACGATGGCCGGCGGTCAAGAGATAGGTCGGAGCCGAGCTCCAAGGAATTCACCAGATCGAATTCGTCTTCCGACCTCAGCCCCCTCCGGCCGGTCA
TCGGAGATGATGTGGCATTCCCTTCGTCGTCTTCAAATCACAGGCCAAGCAATGTCCCAACACAATGGAACTCATACAAAACCATGTTCAAGGAACAAGAAATGCCACAG
TTTCCGCCTGATCTCTCCGCCCATATCGATAACCGGTATGTGGAAGATGAGTACCATGGATTTGGAGAGCACGTGTCGTCCCCAGAAACCGTTATTCTGGCACCGAATTC
ATTCAGAAGCATCAAAATCTGTGTGGATGATTTGGAACTAAACTCCCCATCATCTGCTGATTCTGCACTCTGTAAAGATCCGGTTTATTATGGTGGTATTTATGGTAGTG
TTTTACCAGAAGATGACGATGACGATGAAGATGCTATGAGCTCTTATGTCATTGAGATAACTTCCATCAGTAGAGAAGAATATAGAGAAGAGGTTTCTATCGATGAAGCA
ATTGCTTGGGCTAAATCAAAGTATCAAAGTCCTGCATCTGAGACAGATTTGAGTGTTAGACAACAAGAAAGTGAGCAATCTGGTGAAGAAGAAGGAAGACCTGTTGCGTT
TGAATTTGCAGATCAGCAGTTGAATGAAGATGGATTGTTGCAGACAGAAGAGACACAACAGAGAGATGTAAAAGTTGAAGAAGGAAAGCCGCAGGTGGACATCGATAGAG
AATTGGAAGGATTAGATGAAAAAATAAAGTTATGGTCAGCTGGAAAGGAGACCAACATCCGCTTGCTACTTTCTACGCTTCATTATATATTGTGGTCAAGTAGTGGGTGG
TCTCCAATGTCCTTGACAAACCTGATTGGAGGCTCACAAGTGAAGAAGGCTTATCAAAAAGCAAGGTTATGTCTCCACCCAGACAAGCTCCAGCAAAGAGGAGCCACAAT
GCTGCAAAAATATGTTGCAGAGAAGGCTTTTGCCATCCTTCAGGACGCGTGGGCTGTATATATATCCCAAGATGTCTTCCTTAACTAGAGCCTAATTTGGGATCTTTTGG
GAAGTCATCGTGAGTGTAAAGCTCAAGAAAGTAGCAAGCAAGTCTGGAACATTCAGAAGCTGATACCCATTTTTGCATTAAGAATTTGAGACTGAACCTGATTACACATG
AAAAAAGTTGTGTATAAAACAGAAATCAGGTTAATACGGCCTGTATGTATGTAAGATATATCTACTTATGTAGTGTTATGTGTAGGATTTATATCTCATTGTGAAGCTGC
GTTCAGAATATGATTGCAGAACACTTGTGCTTTGCATATGTACATACACATTTGGGAATAGATTTCAATACATATTAATACATTGCTCAGACTTCAGA
Protein sequenceShow/hide protein sequence
MEQTWRVRLGIPRFRSRRSEEETLPKATCSRAGTFDVDDFSDVFGGPPRTILFKQFSDRFEGKDSTSFYEEVFRSTEVVSRPQKGGRSLPAFRIPVKEDRFYRDVFGSDD
GRRSRDRSEPSSKEFTRSNSSSDLSPLRPVIGDDVAFPSSSSNHRPSNVPTQWNSYKTMFKEQEMPQFPPDLSAHIDNRYVEDEYHGFGEHVSSPETVILAPNSFRSIKI
CVDDLELNSPSSADSALCKDPVYYGGIYGSVLPEDDDDDEDAMSSYVIEITSISREEYREEVSIDEAIAWAKSKYQSPASETDLSVRQQESEQSGEEEGRPVAFEFADQQ
LNEDGLLQTEETQQRDVKVEEGKPQVDIDRELEGLDEKIKLWSAGKETNIRLLLSTLHYILWSSSGWSPMSLTNLIGGSQVKKAYQKARLCLHPDKLQQRGATMLQKYVA
EKAFAILQDAWAVYISQDVFLN