; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014730 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014730
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationtig00001047:467180..472599
RNA-Seq ExpressionSgr014730
SyntenySgr014730
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]7.2e-24090.76Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ RRDSLYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKL+R REQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

XP_022152465.1 uncharacterized protein LOC111020186 isoform X1 [Momordica charantia]4.6e-25594.74Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDL
        NRKFCNCAICENSNQAFICTICVNYRLNDYNS+LKSLKARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDL

Query:  RLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNEL
        +LKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDG+ NEG+GEQFDQICNVRLPRRLDPHSVPPNEL
Subjt:  RLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSSF
        AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYW+ARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESE+KPHLGSLESSSF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSSF

Query:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM
        NY+SASPHSIETHKDLQKGIALLKKSVACITAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S+  SSM
Subjt:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM

Query:  LLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        LLDS HTL+MK NCESNLPSSAASYLYATEFSD GKND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]3.2e-24090.97Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ RRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

XP_022993038.1 uncharacterized protein LOC111489176 [Cucurbita maxima]2.6e-23789.92Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNR+FCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKL+R REQLEQGK EIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+ NEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]5.5e-24090.97Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

TrEMBL top hitse value%identityAlignment
A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X14.1e-23387.82Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA ICT CVN RLNDYN+SLKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G+   G GE FDQICNV LPR LDPHSV P E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYW+A PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS KH+QK  KS WNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        ML +S H+ +MKTN ESNLPSSA+SYLYATEFSDAGKND TIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 144.1e-23387.82Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA ICT CVN RLNDYN+SLKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G+   G GE FDQICNV LPR LDPHSV P E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYW+A PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS KH+QK  KS WNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        ML +S H+ +MKTN ESNLPSSA+SYLYATEFSDAGKND TIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A6J1DE02 uncharacterized protein LOC111020186 isoform X12.2e-25594.74Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDL
        NRKFCNCAICENSNQAFICTICVNYRLNDYNS+LKSLKARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDL

Query:  RLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNEL
        +LKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDG+ NEG+GEQFDQICNVRLPRRLDPHSVPPNEL
Subjt:  RLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSSF
        AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYW+ARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESE+KPHLGSLESSSF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSSF

Query:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM
        NY+SASPHSIETHKDLQKGIALLKKSVACITAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S+  SSM
Subjt:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM

Query:  LLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        LLDS HTL+MK NCESNLPSSAASYLYATEFSD GKND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A6J1FFY6 uncharacterized protein LOC1114451311.6e-24090.97Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ RRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A6J1K104 uncharacterized protein LOC1114891761.2e-23789.92Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        MNR+FCNCAICENSNQA IC  CVN+RLNDYNS+LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKL+R REQLEQGK EIEMTSYD
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L+LK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV GEN EG GEQFDQICNV LPRRLDPHSV P+E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS
        L+ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYWDA PSS+ NEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESERKPHL SLE+ S
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSS

Query:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS
        FNY+SAS HSIETHKDLQ GIALLKKSVACITAYCYNSL LDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS ISSS
Subjt:  FNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSS

Query:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        MLL+SAH+ +MKTN ESN PSSA+SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein3.8e-11451.88Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQICKLFP  RV V+G+N +GS  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF

Query:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM
        + +SASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS  SSS 
Subjt:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM

Query:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP
        LL+S+HT     N    N+P+   SY+   EF D  K+  +I  W+L+E+P
Subjt:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein2.0e-10750.33Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQICKLFP  RV V+G+N +GS  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF

Query:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM
        + +SASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS  SSS 
Subjt:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM

Query:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP
        LL+S+HT     N    N+P+   SY+   EF D  K+  +I  W+L+E+P
Subjt:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein9.4e-11351.88Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQICKLFP  RV V+G+N +GS  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESER--KPHLGSLESSSF

Query:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM
        + +SASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS  SSS 
Subjt:  NYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSM

Query:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP
        LL+S+HT     N    N+P+   SY+   EF D  K+  +I  W+L+E+P
Subjt:  LLDSAHTLVMKTNCE-SNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein1.1e-18570.23Show/hide
Query:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD
        M ++  NCAIC+N+N+  ICT CVN+RL +YN+ LKSLK RRDSL SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYD

Query:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE
        L++KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVVVKQICKLFP RRV  DGE+  GS  Q+D ICN RLP  LDPHS+P  E
Subjt:  LRLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNE

Query:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERK-PHLGSLESS
        LA SLGYMVQLLNL+V NLAAPALH+SGFAGSCSRIWQRDSYWD R S+RSNEYPLFIPR+NYCSTS ENSW+DK+SSNFGVASMES+RK P L S  S+
Subjt:  LAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERK-PHLGSLESS

Query:  SFNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISS
        SF Y+SASPHSIE+H+DLQKGIALLKKSVAC+TAYCYNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+LNKS WN +S ISS
Subjt:  SFNYTSASPHSIETHKDLQKGIALLKKSVACITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISS

Query:  SMLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        S LL+SAH  + +    +  P+S ASYL ATE S    ND  + GWDL+EHP +PPPPSQ+ED+EHWTRAMFIDA K
Subjt:  SMLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDYTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTTCATTTGCACCATTTGCGTCAATTACAGATTGAATGACTACAACTCTTCGTTAAAATC
ATTGAAAGCTCGGCGGGATTCGTTATATTCGAGGCTGAGTGACGTGCTTGTGGCCAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTGACTCGGAATGAGAAACTTG
CCAGGTTAAGGGAGAAACTCCAACGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGACGTCCTATGATCTGAGATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAGCAGCGAGTTGAACAACTGGAAAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTTGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAGCAATCTGTGGTCGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTGATGGAGAGAATAATGAGGGATCTGGTGAGCAATTTGATCAAATCTGTA
ATGTGCGCTTGCCAAGAAGACTGGATCCCCACTCTGTTCCGCCAAATGAGCTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTGGCT
GCTCCAGCGCTTCACAACTCAGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGATGCTCGTCCATCTTCTCGAAGCAATGAATATCCACTTTT
TATTCCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCGTGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAGGAAACCACATTTAG
GTTCATTAGAAAGTAGTAGTTTCAATTATACTTCAGCTTCTCCACATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCTCTCCTCAAAAAAAGTGTAGCATGC
ATCACCGCATACTGCTATAACTCTCTGTCTCTAGATGTTCCTTCCGAAGCATCTACTTTTGAAGCATTTGCCAAATTGTTGGCTACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCGAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAATGTGAATTCTGAGATTTCATCAAGCATGCTGCTCG
ATAGTGCACATACACTAGTAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTATCTTTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTAC
ACCATTGAAGGATGGGATCTCATCGAGCATCCGACTTTTCCTCCTCCACCTTCCCAAGCGGAGGATATTGAGCATTGGACTCGAGCAATGTTCATTGATGCCACCAAAGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTTCATTTGCACCATTTGCGTCAATTACAGATTGAATGACTACAACTCTTCGTTAAAATC
ATTGAAAGCTCGGCGGGATTCGTTATATTCGAGGCTGAGTGACGTGCTTGTGGCCAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTGACTCGGAATGAGAAACTTG
CCAGGTTAAGGGAGAAACTCCAACGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGACGTCCTATGATCTGAGATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAGCAGCGAGTTGAACAACTGGAAAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTTGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAGCAATCTGTGGTCGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTGATGGAGAGAATAATGAGGGATCTGGTGAGCAATTTGATCAAATCTGTA
ATGTGCGCTTGCCAAGAAGACTGGATCCCCACTCTGTTCCGCCAAATGAGCTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTGGCT
GCTCCAGCGCTTCACAACTCAGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGATGCTCGTCCATCTTCTCGAAGCAATGAATATCCACTTTT
TATTCCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCGTGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAGGAAACCACATTTAG
GTTCATTAGAAAGTAGTAGTTTCAATTATACTTCAGCTTCTCCACATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCTCTCCTCAAAAAAAGTGTAGCATGC
ATCACCGCATACTGCTATAACTCTCTGTCTCTAGATGTTCCTTCCGAAGCATCTACTTTTGAAGCATTTGCCAAATTGTTGGCTACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCGAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAATGTGAATTCTGAGATTTCATCAAGCATGCTGCTCG
ATAGTGCACATACACTAGTAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTATCTTTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTAC
ACCATTGAAGGATGGGATCTCATCGAGCATCCGACTTTTCCTCCTCCACCTTCCCAAGCGGAGGATATTGAGCATTGGACTCGAGCAATGTTCATTGATGCCACCAAAGA
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSNQAFICTICVNYRLNDYNSSLKSLKARRDSLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLEQGKAEIEMTSYDLRLKYAMLES
ARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGENNEGSGEQFDQICNVRLPRRLDPHSVPPNELAASLGYMVQLLNLIVQNLA
APALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESERKPHLGSLESSSFNYTSASPHSIETHKDLQKGIALLKKSVAC
ITAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSEISSSMLLDSAHTLVMKTNCESNLPSSAASYLYATEFSDAGKNDY
TIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKX