; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G030030 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G030030
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationCma_Chr04:19636909..19643278
RNA-Seq ExpressionCmaCh04G030030
SyntenyCmaCh04G030030
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602709.1 hypothetical protein SDJN03_07942, partial [Cucurbita argyrosperma subsp. sororia]2.5e-26198.31Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAP+LHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERK HLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLL TLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

KAG7033396.1 hypothetical protein SDJN02_07452, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-26298.73Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_022962750.1 uncharacterized protein LOC111463144 isoform X1 [Cucurbita moschata]2.3e-26298.73Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_022990571.1 uncharacterized protein LOC111487411 isoform X1 [Cucurbita maxima]2.2e-265100Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_023518763.1 uncharacterized protein LOC111782150 isoform X1 [Cucurbita pepo subsp. pepo]9.2e-26499.36Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

TrEMBL top hitse value%identityAlignment
A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 146.5e-22384.87Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENS+QASICT CVN RLNDYN++LK L+ARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSRE+LEQGK EIE+ S+D
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        L+LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+V+KQ+CKLFPQR+VLV G+ + GP E FDQIC VSLPR LDPHSVEP+E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LS SLGYMVQLLNLVVQYLAAPALH+SGFAGSCSRIWQRDSYW A PSSRSNEYP+F+PRQ+Y STSGENSWSDK SS+FGVASLESERK HLSSLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        F+Y SA PHSIE+HK+LQKGIALLKKSVACVT+Y YNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSR  KH+QK  KS W VNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFES HSQIMKTN ESNLP    SYLYATEFSDAGKND TIE WDL+EHPT PPPPSQ+E+IEHWTRAM  DATK+
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451314.1e-22586.32Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENS+QASIC  CVNHRLNDYNS LK L+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKLRR RE+LEQGK EIE+TSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQS+VVKQ+CKLFPQR+VLVHGENKEGP EQFDQIC VSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LS SLGYMVQLLNL+V  LAAPALH+SGFAGSCSRIWQR+SYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        F+Y SA  HSIETHK+LQ GIALLKKSVAC+T+YCYNSL  DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATK
        L ESAHSQIMKTN ESN P    SYLYATEFSDA KND TIE WDLIEHPT PPPPSQ+E+IEHWTRAM  DATK
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATK

A0A6J1HFZ3 uncharacterized protein LOC111463144 isoform X11.1e-26298.73Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1JNB3 uncharacterized protein LOC111487411 isoform X11.1e-265100Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1K104 uncharacterized protein LOC1114891762.0e-22486.11Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNR+FCNCAICENS+QASIC  CVNHRLNDYNS LK L+ARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKLRR RE+LEQGKTEIE+TSYD
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQS+VVKQ+CKLFPQR+VLVHGENKEGP EQFDQIC VSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS
        LS SLGYMVQLLNL+V  LAAPALH+SGFAGSCSRIWQR+SYW A PSS+ NEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM
        F+Y SA  HSIETHK+LQ GIALLKKSVAC+T+YCYNSL  DVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISSSM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATK
        L ESAHSQIMKTN ESN P    SYLYATEFSDA KND TIE WDLIEHPT PPPPSQ+E+IEHWTRAM  DATK
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.6e-10951.12Show/hide
Query:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ+LV+KQ+CKLFP  +V V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF

Query:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML
        S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLS +VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SSS L
Subjt:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML

Query:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP
          S+H+Q    N    N+P+    Y  EF D  K+  +I EW+L+E+P
Subjt:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein6.7e-10349.55Show/hide
Query:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L +           +LV+KQ+CKLFP  +V V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF

Query:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML
        S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLS +VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SSS L
Subjt:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML

Query:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP
          S+H+Q    N    N+P+    Y  EF D  K+  +I EW+L+E+P
Subjt:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein4.0e-10851.12Show/hide
Query:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ+LV+KQ+CKLFP  +V V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESER--KSHLSSLESSSF

Query:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML
        S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLS +VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SSS L
Subjt:  SYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSML

Query:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP
          S+H+Q    N    N+P+    Y  EF D  K+  +I EW+L+E+P
Subjt:  FESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein5.0e-16765.26Show/hide
Query:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        M ++  NCAIC+N+++  ICT CVNHRL +YN+ LK LK RRD L SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LK+KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQS+VVKQ+CKLFP R+V   GE++ G   Q+D IC   LP GLDPHS+   E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKS-HLSSLESS
        L+VSLGYMVQLLNLVV  LAAPALH SGFAGSCSRIWQRDSYW  R S+RSNEYPLFIPR+ Y STS ENSW+DK SS+FGVAS+ES+RK   L S  S+
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKS-HLSSLESS

Query:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSS
        SF Y SA PHSIE+H++LQKGIALLKKSVAC+T+YCYNSL  +VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSR  K  Q+LNKS W  +S+ SS
Subjt:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSS

Query:  MLFESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ
         L ESAH  +    N + N P SYL ATE S    ND  +  WDL+EHP  PPPPSQSE++EHWTRAM  DA K+
Subjt:  MLFESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAATTGCGCTATCTGTGAGAATTCTAGTCAAGCTTCCATTTGCACCGTTTGCGTCAATCACAGATTGAATGACTACAACTCTGCGTTA
AAATTATTGAAAGCTCGGCGCGATTTGTTGTATTCGAGGTTGAGCGACGTACTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTTACTCGGAAT
GAGAAACTTACTAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGAAACTCGAGCAAGGAAAGACAGAGATTGAGTTGACGTCCTATGATCTCAAATTGAAATAT
GCGATGCTTGAATCAGCTCGTTCTGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAAGCCTATCCCGACCTTATTAGCACCAAGAATCTGGGACATATGGCA
ATTACCTCTGAACGCCTTCACAAACAATCCTTGGTTGTAAAACAAATGTGCAAATTGTTTCCACAACGTCAGGTTTTAGTTCATGGAGAGAACAAAGAGGGACCA
AGCGAGCAATTTGATCAAATCTGTTATGTGAGCTTACCAAGAGGACTGGATCCGCATTCTGTTGAGCCACACGAGCTTTCAGTTTCTTTGGGATACATGGTGCAA
CTTCTGAATCTTGTCGTTCAATATCTGGCTGCTCCTGCACTTCACCACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAAAGCT
CGTCCATCTTCCCGAAGCAATGAGTATCCACTTTTTATACCACGTCAAACCTATAGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAATGCTCTAGCAGCTTC
GGTGTTGCTTCCTTGGAATCGGAGAGGAAATCACATTTAAGTTCACTAGAAAGTAGTAGCTTCAGTTATCCGTCCGCTCCTCCACATTCTATAGAAACGCACAAG
AATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGTGTCACCTCATACTGCTATAACTCTCTCTCTTCAGATGTTCCTTCTGAAGCTTCTACTTTT
GAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGCCCCCTAAGCACGTTCAGAAA
CTGAACAAGTCTGCATGGACTGTTAATTCGATTTCATCCAGCATGCTGTTCGAGAGTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGT
TATCTTTATGCCACCGAGTTTTCTGATGCTGGAAAGAATGATTGCACCATTGAGGAATGGGACCTCATAGAGCATCCAACGCTTCCTCCACCACCATCTCAATCT
GAAAATATTGAGCATTGGACTCGAGCAATGATCACGGATGCCACCAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAAATTCTGCAATTGCGCTATCTGTGAGAATTCTAGTCAAGCTTCCATTTGCACCGTTTGCGTCAATCACAGATTGAATGACTACAACTCTGCGTTA
AAATTATTGAAAGCTCGGCGCGATTTGTTGTATTCGAGGTTGAGCGACGTACTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTTACTCGGAAT
GAGAAACTTACTAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGAAACTCGAGCAAGGAAAGACAGAGATTGAGTTGACGTCCTATGATCTCAAATTGAAATAT
GCGATGCTTGAATCAGCTCGTTCTGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAAGCCTATCCCGACCTTATTAGCACCAAGAATCTGGGACATATGGCA
ATTACCTCTGAACGCCTTCACAAACAATCCTTGGTTGTAAAACAAATGTGCAAATTGTTTCCACAACGTCAGGTTTTAGTTCATGGAGAGAACAAAGAGGGACCA
AGCGAGCAATTTGATCAAATCTGTTATGTGAGCTTACCAAGAGGACTGGATCCGCATTCTGTTGAGCCACACGAGCTTTCAGTTTCTTTGGGATACATGGTGCAA
CTTCTGAATCTTGTCGTTCAATATCTGGCTGCTCCTGCACTTCACCACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAAAGCT
CGTCCATCTTCCCGAAGCAATGAGTATCCACTTTTTATACCACGTCAAACCTATAGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAATGCTCTAGCAGCTTC
GGTGTTGCTTCCTTGGAATCGGAGAGGAAATCACATTTAAGTTCACTAGAAAGTAGTAGCTTCAGTTATCCGTCCGCTCCTCCACATTCTATAGAAACGCACAAG
AATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGTGTCACCTCATACTGCTATAACTCTCTCTCTTCAGATGTTCCTTCTGAAGCTTCTACTTTT
GAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGCCCCCTAAGCACGTTCAGAAA
CTGAACAAGTCTGCATGGACTGTTAATTCGATTTCATCCAGCATGCTGTTCGAGAGTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGT
TATCTTTATGCCACCGAGTTTTCTGATGCTGGAAAGAATGATTGCACCATTGAGGAATGGGACCTCATAGAGCATCCAACGCTTCCTCCACCACCATCTCAATCT
GAAAATATTGAGCATTGGACTCGAGCAATGATCACGGATGCCACCAAACAGTAATTTAATGGTATATGGAGATAGCATGAGAAATAGAAGAGGATTTGGAACTAC
TCAATACAATCAGCTTTCGACCTGTAAATATGAAGACTTCCAAAAGTGGAAGGGCAGAGATTTATTCTTCTTCCAGTCCTTCAAAGGTGTCTTCCTCTCCATCTT
CTTCCACGATTACTTATAATCTTTATTATCTTCATCCTTCTTATCACTCCCACCATCATCCTTATCTTTATCACCACCACCATCGCCACTGTTCTCACTTCCACC
ACGGTGGTCATCATTGCCGTCATCTCTTCCTCCTTTGGTGATATCATCTTTTTACCATTTTTCTTTCTAGTAATGTATGCATCTGTATCATCGTAAAAAAAGATA
TTTTTCGATGCATGAGGTCCT
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSSQASICTVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKLKY
AMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRQVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSVSLGYMVQ
LLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLSSLESSSFSYPSAPPHSIETHK
NLQKGIALLKKSVACVTSYCYNSLSSDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSSMLFESAHSQIMKTNCESNLPS
YLYATEFSDAGKNDCTIEEWDLIEHPTLPPPPSQSENIEHWTRAMITDATKQ