; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1978 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1978
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationMC04:26555858..26563603
RNA-Seq ExpressionMC04g1978
SyntenyMC04g1978
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]1.56e-29387.79Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KLK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

XP_008441240.1 PREDICTED: uncharacterized protein LOC103485428 isoform X2 [Cucumis melo]6.15e-29286.19Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

XP_022152465.1 uncharacterized protein LOC111020186 isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]1.52e-29488Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KLK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]2.64e-29588.21Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KLK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

TrEMBL top hitse value%identityAlignment
A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X14.02e-29286.19Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X22.98e-29286.19Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 142.98e-29286.19Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+LKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A6J1DE02 uncharacterized protein LOC111020186 isoform X10.0100Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
        SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A6J1FFY6 uncharacterized protein LOC1114451317.38e-29588Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KLK+AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein8.5e-11452.33Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT1G77890.2 DNA-directed RNA polymerase II protein3.5e-10750.78Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT1G77890.3 DNA-directed RNA polymerase II protein2.1e-11252.33Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  KYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT4G08540.1 DNA-directed RNA polymerase II protein2.4e-18068.68Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MT R   NCAIC+N+N+  ICT CVN+RL +YN+ LKSLK RRD L SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + +GK +IE  S 
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLK+KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVVVKQICKLFP RRV  DG+   G+  Q+D ICN RLP  LDPHS+P  
Subjt:  DLKLKYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKK-PHLGSLES
        ELA SLGYMVQLLNL+V NLAAPALH+SGFAGSCSRIWQRDSYW+ R S+RSNEYPLFIPR+NYCSTS ENSW+DK+SSNFGVASMES++K P L S  S
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKK-PHLGSLES

Query:  SSFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFP
        +SF YSSASPHSIE+H+DLQKGIALLKKSVAC+TAY YNSL L+VP EASTFEAFAKLLATLSSSKEVRSV+SLKM SSRS K  Q+LNKS WN  S   
Subjt:  SSFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFP

Query:  SSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SS LL+S H  + +N   +  P+S ASYL ATE S    ND  + GWDLVEHP +PPPPSQ+ED+EHWTRAMFIDA +K
Subjt:  SSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAATCGAAAGTTCTGCAACTGTGCTATCTGTGAGAATTCAAATCAAGCTTTCATTTGCACTATTTGCGTTAATTACAGATTGAATGACTACAACTCAACGTTAAA
ATCATTGAAAGCTCGGCGGGATTGGTTGTATTCGAGGCTGAGTGACGTGCTTGTGGCAAAGGGTAAGGCAGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAAC
TTGCAAGGTTAAGGGAGAAACTCCAACGTAGTAGAGAGCAACTCGAGCGAGGGAAGGCTGAGATTGAGATGACGTCCTATGATCTCAAGTTGAAATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAGCGAGTTGAACAACTGGAAAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCCGAACGCCT
TCACAAGCAGTCTGTGGTTGTTAAACAAATATGCAAATTGTTTCCACAACGGCGAGTGTTGGTTGATGGACAGAAAAATGAGGGAACTGGTGAGCAATTTGATCAAATCT
GTAATGTGCGCTTACCAAGAAGACTGGATCCCCACTCTGTTCCACCAAATGAACTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTG
GCTGCTCCTGCACTTCACAACTCAGGTTTTGCGGGTTCTTGTTCACGCATATGGCAAAGGGACTCGTATTGGAATGCTCGTCCATCTTCTCGTAGCAATGAGTATCCACT
TTTTATACCACGCCAAAACTACTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAAGAAACCACATT
TAGGTTCACTAGAAAGTAGTAGCTTCAATTATTCTTCAGCTTCTCCGCATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCA
TGCATCACTGCATACTTTTATAACTCTCTGTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGT
GCGTTCTGTTTATTCCCTCAAAATGGATTCTTCCAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAACGTGGATTCTGACTTTCCATCAAGCATGCTGC
TCGATAGCGTGCATACGTTAATAATGAAAAACAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTACCTTTATGCCACTGAATTTTCCGATGTTGGAAAGAATGAT
TGCAGCATAGAAGGATGGGATCTCGTGGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGATATTGAGCATTGGACTCGAGCAATGTTCATCGATGCCACCAG
AAAGTAA
mRNA sequenceShow/hide mRNA sequence
CACGAAAGCTTATAATTTAATCTAGGATTTGTAATAAACCTTTTAAACCATATTTACACGATACCAACTCCCAAAGTTACGGGACAAAACGTACGGGCTATAATTTAACC
TACTATTTATTTATTTGAAGAAAGAACGGCGATCCAGAGTTTTCTTGTTAATGCTAAATTCCAGTTTCCAGACGATCAGCTCCCGCGGTTCAAGCTTTCCCCCACCCTTC
TGGCCCAACTTCGAATTCAGAATTTGAAGTTATCATTCCCAAAGCTTCAGCATCATCATCTGATTCAATTCCTCCGAGTAGCATGATTAGAAATTGATCGATGACGAATC
GAAAGTTCTGCAACTGTGCTATCTGTGAGAATTCAAATCAAGCTTTCATTTGCACTATTTGCGTTAATTACAGATTGAATGACTACAACTCAACGTTAAAATCATTGAAA
GCTCGGCGGGATTGGTTGTATTCGAGGCTGAGTGACGTGCTTGTGGCAAAGGGTAAGGCAGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAACTTGCAAGGTT
AAGGGAGAAACTCCAACGTAGTAGAGAGCAACTCGAGCGAGGGAAGGCTGAGATTGAGATGACGTCCTATGATCTCAAGTTGAAATATGCAATGCTTGAATCAGCCCGTT
CAGTGTTGGAAAAACAGCGAGTTGAACAACTGGAAAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCCGAACGCCTTCACAAGCAG
TCTGTGGTTGTTAAACAAATATGCAAATTGTTTCCACAACGGCGAGTGTTGGTTGATGGACAGAAAAATGAGGGAACTGGTGAGCAATTTGATCAAATCTGTAATGTGCG
CTTACCAAGAAGACTGGATCCCCACTCTGTTCCACCAAATGAACTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTGGCTGCTCCTG
CACTTCACAACTCAGGTTTTGCGGGTTCTTGTTCACGCATATGGCAAAGGGACTCGTATTGGAATGCTCGTCCATCTTCTCGTAGCAATGAGTATCCACTTTTTATACCA
CGCCAAAACTACTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAAGAAACCACATTTAGGTTCACT
AGAAAGTAGTAGCTTCAATTATTCTTCAGCTTCTCCGCATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGCATCACTG
CATACTTTTATAACTCTCTGTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCGTTCTGTT
TATTCCCTCAAAATGGATTCTTCCAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAACGTGGATTCTGACTTTCCATCAAGCATGCTGCTCGATAGCGT
GCATACGTTAATAATGAAAAACAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTACCTTTATGCCACTGAATTTTCCGATGTTGGAAAGAATGATTGCAGCATAG
AAGGATGGGATCTCGTGGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGATATTGAGCATTGGACTCGAGCAATGTTCATCGATGCCACCAGAAAGTAATTT
AATGGGTGCCATGGAGATAGCGCGAGAGAAGTTCTTGGAACAACTCAATGCATCTCTGGTGCATGATCGATCAGCTTTGGACCTGTAAATATGGAGACTTAAAAAGGGTA
CAGATCTGAACATATTTAGATTTAGACGAACTACTACTATTATTCCATCATCTTATATTCTGGATATATTTTAGATTTAGGAAAATTTCTATTGTTGTTCTATAATCTTA
GACAATAGCCATAGTATTCGGAAAATTCTTAAATGGGTGCACATGGAGGCAGGAGTTGGGTTGTGGGGATGTGATGTCTTTCAAATGTACATATAAATTGGCTAGAACTG
GTTGAATTGAACAAAGGAACTTCAATATAATATAAAACCATCTGCATACAAACGCTATGACTCTAAGATAGCGATGCATTGTTTAGTCAAATTGTCTTGTTGAGAGCTTT
TAGTCTCCAAAACGTACAATTGGAGGCATGTCTCATTAGTCCGGAAACATAATTAGAAAGTGCTTTCACAGTTAGCATATTTCTTGCCTCTTCATTACTCTTTTACTTCT
TTCTCTTCTTTGGTGGTGGGAGGGTTTCTTCGTCCTCTTCTCCTTCAAAGTTGTCTTCTTCTTCTGCATCTGCATCATCTTCATCAACGTCTTCATCGTCCTCATTATTG
TCTTCATCGTTATCGTCGCTCCCACCATCATCATTATCGTCGTCACCACCATCACCGCTGTTATCACTTCCACCATCGTCATTATCGTTGCTGTTGCCAGCATCTCCTCC
TCCATTAGCGCCTTCTTTAATGGGGGCATTTTTACCTTCCTCTTCTCCATTTTCAAGCACATCTTCATCTTCTCCGAGATCATTATCACTGTAGTTGTTTTCATTTTCAT
TTCCTTCTCCATCCACTGGTTTCCTCTGGCATTTCCTTTTCTCGTTGGCATTCGAATTCAGGAACTGTACGCAAAAGAAATTTGAATTTCACGGGGAAGAAAAAGCTACA
ATGAATTATCAAAAAAACCAAAAGGGGACCAAAATATAAAACCAGAAAGTGACCAAATAAAATGGAACTACGGAAAGTGGGGCGTTGTCCCCTCCCTGAATCTGCATGGG
ATTCTTTCCTGGTCCACTATGGTCTGGTTAGGACAAATTTGGCCACACAGAGCTTATAGATAAAAGAAAATTGCAGCGTATCACCCATCTTATAATACTAAATGATACTT
ATCTAGTTTCAAAATTAACTATAAACGAGAAACATATCATCTGTCACTCTT
Protein sequenceShow/hide protein sequence
MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKLKYAMLE
SARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAASLGYMVQLLNLIVQNL
AAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSFNYSSASPHSIETHKDLQKGIALLKKSVA
CITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSDFPSSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKND
CSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK