; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg06076 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg06076
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA-directed RNA polymerase II protein
Genome locationCarg_Chr04:21790652..21796901
RNA-Seq ExpressionCarg06076
SyntenyCarg06076
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602709.1 hypothetical protein SDJN03_07942, partial [Cucurbita argyrosperma subsp. sororia]4.1e-26499.15Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAP+LHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERK HL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLL TLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

KAG7033396.1 hypothetical protein SDJN02_07452, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-267100Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_022962750.1 uncharacterized protein LOC111463144 isoform X1 [Cucurbita moschata]3.7e-26599.58Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_022990571.1 uncharacterized protein LOC111487411 isoform X1 [Cucurbita maxima]7.8e-26398.73Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

XP_023518763.1 uncharacterized protein LOC111782150 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-26499.36Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

TrEMBL top hitse value%identityAlignment
A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 141.1e-22284.45Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENS+QASIC  CVN RLNDYN++LK L+ARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSRE+LEQGK EIE+ S+D
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        L+LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+V+KQ+CKLFPQRRVLV G+ + GP E FDQIC VSLPR LDPHSVEP+E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LS SLGYMVQLLNLVVQYLAAPALH+SGFAGSCSRIWQRDSYW A PSSRSNEYP+F+PRQ+Y STSGENSWSDK SS+FGVASLESERK HL SLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        F+Y SA PHSIE+HK+LQKGIALLKKSVACVT+Y YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSR  KH+QK  KS W VNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFES HSQIMKTN ESNLP    SYLYATEFSDAGKND TIE WD +EHPT PPPPSQ+E+IEHWTRAM  DATK+
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451311.8e-22586.11Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENS+QASIC  CVNHRLNDYNS LK L+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKLRR RE+LEQGK EIE+TSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQS+VVKQ+CKLFPQRRVLVHGENKEGP EQFDQIC VSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LS SLGYMVQLLNL+V  LAAPALH+SGFAGSCSRIWQR+SYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HL SLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        F+Y SA  HSIETHK+LQ GIALLKKSVAC+T+YCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATK
        L ESAHSQIMKTN ESN P    SYLYATEFSDA KND TIE WD IEHPT PPPPSQ+E+IEHWTRAM  DATK
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATK

A0A6J1HFZ3 uncharacterized protein LOC111463144 isoform X11.8e-26599.58Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1JNB3 uncharacterized protein LOC111487411 isoform X13.8e-26398.73Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNRKFCNCAICENSSQASIC VCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQR+VLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHL SLESSS
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
        LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWD IEHPTLPPPPSQSENIEHWTRAMITDATKQ
Subjt:  LFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ

A0A6J1K104 uncharacterized protein LOC1114891769.1e-22585.89Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        MNR+FCNCAICENS+QASIC  CVNHRLNDYNS LK L+ARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKLRR RE+LEQGKTEIE+TSYD
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQS+VVKQ+CKLFPQRRVLVHGENKEGP EQFDQIC VSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS
        LS SLGYMVQLLNL+V  LAAPALH+SGFAGSCSRIWQR+SYW A PSS+ NEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HL SLE+ S
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSS

Query:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM
        F+Y SA  HSIETHK+LQ GIALLKKSVAC+T+YCYNSL LDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISS+M
Subjt:  FSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTM

Query:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATK
        L ESAHSQIMKTN ESN P    SYLYATEFSDA KND TIE WD IEHPT PPPPSQ+E+IEHWTRAM  DATK
Subjt:  LFESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein2.5e-11051.56Show/hide
Query:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ+LV+KQ+CKLFP  RV V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S    E   H   L+  
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS

Query:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST
        S S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SS+
Subjt:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST

Query:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP
         L  S+H+Q    N    N+P+    Y  EF D  K+  +I EW+ +E+P
Subjt:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein1.3e-10350Show/hide
Query:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L +           +LV+KQ+CKLFP  RV V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S    E   H   L+  
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS

Query:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST
        S S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SS+
Subjt:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST

Query:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP
         L  S+H+Q    N    N+P+    Y  EF D  K+  +I EW+ +E+P
Subjt:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein8.2e-10951.56Show/hide
Query:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L  LK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   EKL+Q K      S +LK 
Subjt:  KFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ+LV+KQ+CKLFP  RV V G+NK+G S Q+DQIC   LP+GL+P SV P EL+ 
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSV

Query:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS
        SLGYMVQLLNLVV  L+ PALH+ GFAGSCSRIW+RDSYW + PSS SN YPLF+P   + S   ++SW+ + +++FGV SL+S    E   H   L+  
Subjt:  SLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLES----ERKSHLCSLESS

Query:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST
        S S  SA PHS+ET +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS W +NS SS+
Subjt:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST

Query:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP
         L  S+H+Q    N    N+P+    Y  EF D  K+  +I EW+ +E+P
Subjt:  MLFESAHSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEEWDFIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein5.0e-16765.05Show/hide
Query:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD
        M ++  NCAIC+N+++  IC  CVNHRL +YN+ LK LK RRD L SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE
        LK+KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQS+VVKQ+CKLFP RRV   GE++ G   Q+D IC   LP GLDPHS+   E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHE

Query:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKS-HLCSLESS
        L+VSLGYMVQLLNLVV  LAAPALH SGFAGSCSRIWQRDSYW  R S+RSNEYPLFIPR+ Y STS ENSW+DK SS+FGVAS+ES+RK   L S  S+
Subjt:  LSVSLGYMVQLLNLVVQYLAAPALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKS-HLCSLESS

Query:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST
        SF Y SA PHSIE+H++LQKGIALLKKSVAC+T+YCYNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSR  K  Q+LNKS W  +S+ S+
Subjt:  SFSYPSAPPHSIETHKNLQKGIALLKKSVACVTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISST

Query:  MLFESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ
         L ESAH  +    N + N P SYL ATE S    ND  +  WD +EHP  PPPPSQSE++EHWTRAM  DA K+
Subjt:  MLFESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEEWDFIEHPTLPPPPSQSENIEHWTRAMITDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAATTGCGCTATCTGTGAGAATTCTAGTCAAGCTTCCATTTGCGCCGTTTGCGTCAATCACAGGTTGAATGACTACAACTCTGCGTTAAAATT
ATTGAAAGCTCGGCGCGATTTGTTGTATTCGAGGTTGAGCGACGTACTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTTACTCGGAATGAGAAACTTA
CTAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGAAACTCGAGCAAGGAAAGACAGAGATTGAGTTGACGTCCTATGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCTCGTTCTGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAAGCCTATCCCGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAACAATCCTTGGTTGTAAAACAAATGTGCAAATTGTTTCCACAACGTCGGGTTTTAGTTCATGGAGAGAACAAAGAGGGACCAAGCGAGCAATTTGATCAAATCTGTT
ATGTGAGCTTACCAAGAGGACTGGATCCGCATTCTGTTGAGCCACACGAGCTTTCAGTTTCTTTGGGATACATGGTGCAACTTCTAAATCTTGTCGTTCAATATCTGGCT
GCTCCTGCACTTCACCACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAAAGCTCGTCCATCTTCCCGAAGCAATGAGTATCCACTTTT
TATACCACGTCAAACCTATAGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAATGCTCTAGCAGCTTCGGTGTTGCTTCCTTGGAATCAGAGAGGAAATCACATTTAT
GTTCACTAGAAAGTAGTAGCTTCAGTTATCCGTCCGCTCCTCCACATTCTATAGAAACGCACAAGAATTTGCAGAAGGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGT
GTCACCTCATACTGCTATAACTCTCTCTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGCCCCCTAAGCACGTTCAGAAACTGAACAAGTCTGCATGGACTGTTAATTCGATTTCATCCACCATGCTGTTCGAGA
GTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTATCTTTATGCCACCGAGTTTTCTGATGCTGGAAAGAATGATTGCACCATCGAGGAATGG
GACTTTATAGAGCATCCAACGCTTCCTCCACCACCATCTCAATCTGAAAATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAAATTCTGCAATTGCGCTATCTGTGAGAATTCTAGTCAAGCTTCCATTTGCGCCGTTTGCGTCAATCACAGGTTGAATGACTACAACTCTGCGTTAAAATT
ATTGAAAGCTCGGCGCGATTTGTTGTATTCGAGGTTGAGCGACGTACTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTTACTCGGAATGAGAAACTTA
CTAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGAAACTCGAGCAAGGAAAGACAGAGATTGAGTTGACGTCCTATGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCTCGTTCTGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAAGCCTATCCCGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAACAATCCTTGGTTGTAAAACAAATGTGCAAATTGTTTCCACAACGTCGGGTTTTAGTTCATGGAGAGAACAAAGAGGGACCAAGCGAGCAATTTGATCAAATCTGTT
ATGTGAGCTTACCAAGAGGACTGGATCCGCATTCTGTTGAGCCACACGAGCTTTCAGTTTCTTTGGGATACATGGTGCAACTTCTAAATCTTGTCGTTCAATATCTGGCT
GCTCCTGCACTTCACCACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAAAGCTCGTCCATCTTCCCGAAGCAATGAGTATCCACTTTT
TATACCACGTCAAACCTATAGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAATGCTCTAGCAGCTTCGGTGTTGCTTCCTTGGAATCAGAGAGGAAATCACATTTAT
GTTCACTAGAAAGTAGTAGCTTCAGTTATCCGTCCGCTCCTCCACATTCTATAGAAACGCACAAGAATTTGCAGAAGGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGT
GTCACCTCATACTGCTATAACTCTCTCTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGCCCCCTAAGCACGTTCAGAAACTGAACAAGTCTGCATGGACTGTTAATTCGATTTCATCCACCATGCTGTTCGAGA
GTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTATCTTTATGCCACCGAGTTTTCTGATGCTGGAAAGAATGATTGCACCATCGAGGAATGG
GACTTTATAGAGCATCCAACGCTTCCTCCACCACCATCTCAATCTGAAAATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAACAGTAATTTAATGGTAT
ATGGAGATAGCATGAGAAATAGAAGGGGATTTGGAACTACTCAATACACAATCAGCTTTCGACCTGTAAATATGAAGACTTCCAAAAGGTGTACATCCCTCAACATGTTT
AGGTTTAGGATAACTTTTATTATTCCATCATACTAGCCAATAGTATTTTCGGATCTTTTTTTTCATTATTATTTGTTTTTAAAATGGGTGCACATGGAGGAGATGGGCTG
TGGGGGCGTCTTTCAAATATCTGTAGTAATTGGTCTTTAAATTTGCTCATCCCGTATGTTGTCATGATTTTATTGGTTTGAATCATCACTCCACTTAATAACTAGATTTT
ACTGAACACAAACACTAATTGATAACAACTATTCGATAAATTAGCATATTTTCTTGATCACTCTTTTACTTCTGTCTTTTTTGGTAGTGGAAGGGCAGAGGTTTATTCTT
CTTCCTGTCCTTCAAAGGTGTCTTCCTCTCCATCTTCTTCC
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSSQASICAVCVNHRLNDYNSALKLLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREKLEQGKTEIELTSYDLKLKYAMLES
ARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSLVVKQMCKLFPQRRVLVHGENKEGPSEQFDQICYVSLPRGLDPHSVEPHELSVSLGYMVQLLNLVVQYLA
APALHHSGFAGSCSRIWQRDSYWKARPSSRSNEYPLFIPRQTYSSTSGENSWSDKCSSSFGVASLESERKSHLCSLESSSFSYPSAPPHSIETHKNLQKGIALLKKSVAC
VTSYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRPPKHVQKLNKSAWTVNSISSTMLFESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEEW
DFIEHPTLPPPPSQSENIEHWTRAMITDATKQ