; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003535 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003535
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationscaffold234:3352389..3358091
RNA-Seq ExpressionMS003535
SyntenyMS003535
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]2.7e-23187.58Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KL +AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

XP_008441240.1 PREDICTED: uncharacterized protein LOC103485428 isoform X2 [Cucumis melo]1.2e-22985.98Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+L YAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

XP_022152465.1 uncharacterized protein LOC111020186 isoform X1 [Momordica charantia]6.0e-27199.58Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLKL YAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDS+FPS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]1.2e-23187.79Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KL +AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]3.2e-23288Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KL +AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

TrEMBL top hitse value%identityAlignment
A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X15.6e-23085.98Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+L YAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X25.6e-23085.98Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+L YAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 145.6e-23085.98Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        M NRKFCNCAICENSNQA ICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKL+RSREQLE+GKAEIEM S+
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DL+L YAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G K  G GE FDQICNV LPR LDPHSV P 
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        EL+ASLGYMVQLLNL+VQ LAAPALHNSGFAGSCSRIWQRDSYWNA PSSRSNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ 
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM+SSRS KH+QK  KS WNV+S   S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SML +S H+ IMK N ESNLPSSA+SYLYATEFSD GKND +IEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDAT++
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A6J1DE02 uncharacterized protein LOC111020186 isoform X12.9e-27199.58Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLKL YAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
        ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS
        SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDS+FPS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPS

Query:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
Subjt:  SMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK

A0A6J1FFY6 uncharacterized protein LOC1114451316.0e-23287.79Show/hide
Query:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL
        NRKFCNCAICENSNQA IC  CVN+RLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKL+R REQLE+GKAEIEMTSYDL
Subjt:  NRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDL

Query:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL
        KL +AMLESARSVLEKQRVEQLEK+YPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LPRRLDPHSV P+EL
Subjt:  KLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNEL

Query:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF
        +ASLGYMVQLLNLIV NLAAPALHNSGFAGSCSRIWQR+SYW+A PSS+SNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVAS+ESE+KPHL SLE+ SF
Subjt:  AASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        NYSSAS HSIETHKDLQ GIALLKKSVACITAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR
        LL+S H+ IMK N ESN PSSA+SYLYATEFSD  KND +IEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDAT+
Subjt:  LLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.1e-11352.33Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
         Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT1G77890.2 DNA-directed RNA polymerase II protein5.9e-10750.78Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
         Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT1G77890.3 DNA-directed RNA polymerase II protein3.6e-11252.33Show/hide
Query:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL
        K   CA+C  S +  IC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  Q  W+  +NEKLA+LREKLQ   E+L++ K      S +LK 
Subjt:  KFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKL

Query:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA
         Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQICKLFP  RV V+GQ  +G+  Q+DQICN  LP+ L+P SVPP ELAA
Subjt:  TYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAA

Query:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV S++S+   +     L+    
Subjt:  SLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEK--KPHLGSLESSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM
        + SSASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL + SS   +H  + NKS WN++S   SS 
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSM

Query:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP
        LL+S HT     N+   N+P+   SY+   EF DV K+  SI  W+LVE+P
Subjt:  LLDSVHTL-IMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHP

AT4G08540.1 DNA-directed RNA polymerase II protein5.2e-18068.48Show/hide
Query:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY
        MT R   NCAIC+N+N+  ICT CVN+RL +YN+ LKSLK RRD L SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + +GK +IE  S 
Subjt:  MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSY

Query:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN
        DLK+ Y +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVVVKQICKLFP RRV  DG+   G+  Q+D ICN RLP  LDPHS+P  
Subjt:  DLKLTYAMLESARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPN

Query:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKK-PHLGSLES
        ELA SLGYMVQLLNL+V NLAAPALH+SGFAGSCSRIWQRDSYW+ R S+RSNEYPLFIPR+NYCSTS ENSW+DK+SSNFGVASMES++K P L S  S
Subjt:  ELAASLGYMVQLLNLIVQNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKK-PHLGSLES

Query:  SSFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFP
        +SF YSSASPHSIE+H+DLQKGIALLKKSVAC+TAY YNSL L+VP EASTFEAFAKLLATLSSSKEVRSV+SLKM SSRS K  Q+LNKS WN  S   
Subjt:  SSFNYSSASPHSIETHKDLQKGIALLKKSVACITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFP

Query:  SSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK
        SS LL+S H  + +N   +  P+S ASYL ATE S    ND  + GWDLVEHP +PPPPSQ+ED+EHWTRAMFIDA +K
Subjt:  SSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKNDCSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAATCGAAAGTTCTGCAACTGTGCTATCTGTGAGAATTCAAATCAAGCCTTCATTTGCACTATTTGCGTTAATTACAGATTGAATGACTACAACTCAACGTTAAA
ATCATTGAAAGCTCGGCGGGATTGGTTGTATTCGAGGCTGAGTGACGTGCTTGTGGCAAAGGGTAAGGCAGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAAC
TTGCAAGGTTAAGGGAGAAACTCCAACGCAGTAGAGAGCAACTCGAGCGAGGGAAGGCTGAGATTGAGATGACGTCCTATGATCTCAAGTTGACATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAGCGAGTTGAACAACTGGAGAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCCGAACGCCT
TCACAAGCAGTCTGTGGTTGTTAAACAAATATGCAAATTGTTTCCACAACGGCGAGTGTTGGTTGATGGACAGAAAAATGAGGGAACTGGTGAGCAATTTGATCAAATCT
GTAATGTGCGCTTACCAAGAAGACTGGATCCCCACTCTGTTCCACCAAATGAACTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTG
GCTGCTCCTGCACTTCACAACTCAGGTTTTGCGGGTTCTTGTTCACGCATATGGCAAAGGGACTCATATTGGAATGCTCGTCCATCTTCTCGTAGCAATGAGTATCCACT
TTTTATACCACGCCAAAACTACTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAAGAAACCACATT
TAGGTTCACTAGAAAGTAGTAGCTTCAATTATTCTTCAGCTTCTCCACATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCA
TGCATCACTGCATACTTTTATAACTCTCTGTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGT
GCGTTCTGTTTATTCCCTCAAAATGGATTCTTCCAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAACGTGGATTCTGAGTTTCCATCAAGCATGCTGC
TCGATAGCGTGCATACGTTAATAATGAAAAACAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTACCTTTATGCCACTGAATTTTCCGATGTTGGAAAGAATGAT
TGCAGCATAGAAGGATGGGATCTCGTGGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGATATTGAGCATTGGACTCGAGCAATGTTCATCGATGCCACCAG
AAAG
mRNA sequenceShow/hide mRNA sequence
ATGACGAATCGAAAGTTCTGCAACTGTGCTATCTGTGAGAATTCAAATCAAGCCTTCATTTGCACTATTTGCGTTAATTACAGATTGAATGACTACAACTCAACGTTAAA
ATCATTGAAAGCTCGGCGGGATTGGTTGTATTCGAGGCTGAGTGACGTGCTTGTGGCAAAGGGTAAGGCAGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAAC
TTGCAAGGTTAAGGGAGAAACTCCAACGCAGTAGAGAGCAACTCGAGCGAGGGAAGGCTGAGATTGAGATGACGTCCTATGATCTCAAGTTGACATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAGCGAGTTGAACAACTGGAGAAGTCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCCGAACGCCT
TCACAAGCAGTCTGTGGTTGTTAAACAAATATGCAAATTGTTTCCACAACGGCGAGTGTTGGTTGATGGACAGAAAAATGAGGGAACTGGTGAGCAATTTGATCAAATCT
GTAATGTGCGCTTACCAAGAAGACTGGATCCCCACTCTGTTCCACCAAATGAACTTGCTGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAAAATTTG
GCTGCTCCTGCACTTCACAACTCAGGTTTTGCGGGTTCTTGTTCACGCATATGGCAAAGGGACTCATATTGGAATGCTCGTCCATCTTCTCGTAGCAATGAGTATCCACT
TTTTATACCACGCCAAAACTACTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGATGGAATCAGAGAAGAAACCACATT
TAGGTTCACTAGAAAGTAGTAGCTTCAATTATTCTTCAGCTTCTCCACATTCTATTGAAACACACAAGGATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCA
TGCATCACTGCATACTTTTATAACTCTCTGTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGT
GCGTTCTGTTTATTCCCTCAAAATGGATTCTTCCAGGTCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAACGTGGATTCTGAGTTTCCATCAAGCATGCTGC
TCGATAGCGTGCATACGTTAATAATGAAAAACAATTGTGAGAGTAACCTTCCAAGTTCTGCTGCGAGTTACCTTTATGCCACTGAATTTTCCGATGTTGGAAAGAATGAT
TGCAGCATAGAAGGATGGGATCTCGTGGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGATATTGAGCATTGGACTCGAGCAATGTTCATCGATGCCACCAG
AAAG
Protein sequenceShow/hide protein sequence
MTNRKFCNCAICENSNQAFICTICVNYRLNDYNSTLKSLKARRDWLYSRLSDVLVAKGKADDQLNWRVTRNEKLARLREKLQRSREQLERGKAEIEMTSYDLKLTYAMLE
SARSVLEKQRVEQLEKSYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVDGQKNEGTGEQFDQICNVRLPRRLDPHSVPPNELAASLGYMVQLLNLIVQNL
AAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASMESEKKPHLGSLESSSFNYSSASPHSIETHKDLQKGIALLKKSVA
CITAYFYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVYSLKMDSSRSPKHVQKLNKSAWNVDSEFPSSMLLDSVHTLIMKNNCESNLPSSAASYLYATEFSDVGKND
CSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMFIDATRK