; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003415 (gene) of Chayote v1 genome

Gene IDSed0003415
OrganismSechium edule (Chayote v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationLG03:4016070..4031082
RNA-Seq ExpressionSed0003415
SyntenySed0003415
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-23188.26Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NSNQASIC  CVNHRLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADD+LNWR+TRNEKL+RL EKLRR REQLEQGKAEIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQR+EQLEKAYPDLISTK LG+MAITSERLHKQSV+VKQ+CKLFPQRRVLVHGENKEGPGEQ DQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNLIV  LA PALHNSGFAGSCSRIWQR+SYW+A PSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSAS HSIETHKDL+ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSRS KHVQKLNKSAWNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL
        MLLESAHSQI+KTN ES   S+  SYLYATEFSDA KND TI+GWDLIEHPTFPPPPSQ ED+EHWTRAM  DATKL
Subjt:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]9.4e-23288.47Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NSNQASIC  CVNHRLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADD+LNWRVTRNEKL+RL EKLRR REQLEQGKAEIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQR+EQLEKAYPDLISTK LG+MAITSERLHKQSV+VKQ+CKLFPQRRVLVHGENKEGPGEQ DQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNLIV  LA PALHNSGFAGSCSRIWQR+SYW+A PSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSAS HSIETHKDL+ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSRS KHVQKLNKSAWNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL
        MLLESAHSQI+KTN ES   S+  SYLYATEFSDA KND TI+GWDLIEHPTFPPPPSQ ED+EHWTRAM  DATKL
Subjt:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL

XP_022990571.1 uncharacterized protein LOC111487411 isoform X1 [Cucurbita maxima]3.0e-23087.32Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NS+QASICTVCVNHRLNDYNS LK LKARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKLTRL EKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQR+EQLEKAYPDLISTKNLG+MAITSERLHKQS++VKQMCKLFPQR+VLVHGENKEGP EQ DQICYVSLPR LDPHSV+PHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LS SLGYMVQLLNL+VQYLA PALH+SGFAGSCSRIWQRDSYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE SS
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SF+Y SA PHSIETHK+L+KGIALLKKSVAC+T+YCYNSLS DVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSR  KHVQKLNKSAW VNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
        ML ESAHSQI+KTNCE SNLPSYLYATEFSDAGKNDCTI+ WDLIEHPT PPPPSQ+E++EHWTRAMITDATK
Subjt:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK

XP_023518763.1 uncharacterized protein LOC111782150 isoform X1 [Cucurbita pepo subsp. pepo]6.1e-23187.53Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NS+QASICTVCVNHRLNDYNS LK LKARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKLTRL EKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQR+EQLEKAYPDLISTKNLG+MAITSERLHKQS++VKQMCKLFPQRRVLVHGENKEGP EQ DQICYVSLPR LDPHSV+PHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LS SLGYMVQLLNL+VQYLA PALH+SGFAGSCSRIWQRDSYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE SS
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SF+Y SA PHSIETHK+L+KGIALLKKSVAC+T+YCYNSLSLDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSR  KHVQKLNKSAW VNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
        ML ESAHSQI+KTNCE SNLPSYLYATEFSDAGKNDCTI+ WD IEHPT PPPPSQ+E++EHWTRAMITDATK
Subjt:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]1.9e-23288.68Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NSNQASIC  CVNHRLNDYNSTLKSL+ARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKL+RL EKLRR REQLEQGKAEIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQR+EQLEKAYPDLISTK LG+MAITSERLHKQSV+VKQ+CKLFPQRRVLVHGENKEGPGEQ DQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNLIV  LA PALHNSGFAGSCSRIWQR+SYW+A PSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSAS HSIETHKDL+ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSRS KHVQKLNKSAWNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL
        MLLESAHSQI+KTN ES   S+  SYLYATEFSDA KND TI+GWDLIEHPTFPPPPSQ ED+EHWTRAM  DATKL
Subjt:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL

TrEMBL top hitse value%identityAlignment
A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 141.6e-22485.12Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NSNQASICT CVN RLNDYN++LKSL+ARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKLTRL EKLRRSREQLEQGKAEIE+KS+D
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        L+LK+AMLESARSVLEKQR+EQLEKAYPDLISTKNLG+MAITSERLHKQSV++KQ+CKLFPQRRVLV G+ + GPGE  DQIC VSLPR LDPHSV+P+E
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNL+VQYLA PALHNSGFAGSCSRIWQRDSYWNA PSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSASPHSIE+HKDL+KGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSS+EVRS  SLKM SSRS KH+QK  KS WNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLP----SYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
        ML ES HSQI+KTN E SNLP    SYLYATEFSDAGKND TI+GWDL+EHPTFPPPPSQ ED+EHWTRAM  DATK
Subjt:  MLLESAHSQILKTNCESSNLP----SYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK

A0A6J1FFY6 uncharacterized protein LOC1114451314.5e-23288.47Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NSNQASIC  CVNHRLNDYNSTLKSL+ RRD LYSRLSDVLVAKGKADD+LNWRVTRNEKL+RL EKLRR REQLEQGKAEIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQR+EQLEKAYPDLISTK LG+MAITSERLHKQSV+VKQ+CKLFPQRRVLVHGENKEGPGEQ DQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNLIV  LA PALHNSGFAGSCSRIWQR+SYW+A PSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSAS HSIETHKDL+ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSRS KHVQKLNKSAWNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL
        MLLESAHSQI+KTN ES   S+  SYLYATEFSDA KND TI+GWDLIEHPTFPPPPSQ ED+EHWTRAM  DATKL
Subjt:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL

A0A6J1HFZ3 uncharacterized protein LOC111463144 isoform X17.2e-23086.89Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NS+QASIC VCVNHRLNDYNS LK LKARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKLTRL EKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLK+ MLESARSVLEKQR+EQLEKAYPDLISTKNLG+MAITSERLHKQS++VKQMCKLFPQRRVLVHGENKEGP EQ DQICYVSLPR LDPHSV+PHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LS SLGYMVQLLNL+VQYLA PALH+SGFAGSCSRIWQRDSYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE SS
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SF+Y SA PHSIETHK+L+KGIALLKKSVAC+T+YCYNSLSLDVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSR  KHVQKLNKSAW VNS+SS+
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
        ML ESAHSQI+KTNCE SNLPSYLYATEFSDAGKNDCTI+ WD IEHPT PPPPSQ+E++EHWTRAMITDATK
Subjt:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK

A0A6J1JNB3 uncharacterized protein LOC111487411 isoform X11.5e-23087.32Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNR+F NCAIC NS+QASICTVCVNHRLNDYNS LK LKARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKLTRL EKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQR+EQLEKAYPDLISTKNLG+MAITSERLHKQS++VKQMCKLFPQR+VLVHGENKEGP EQ DQICYVSLPR LDPHSV+PHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LS SLGYMVQLLNL+VQYLA PALH+SGFAGSCSRIWQRDSYW A PSS+SNEYPLFIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLE SS
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SF+Y SA PHSIETHK+L+KGIALLKKSVAC+T+YCYNSLS DVPSEASTFEAFAKLLATLSSS+EVRS  SLKMASSR  KHVQKLNKSAW VNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
        ML ESAHSQI+KTNCE SNLPSYLYATEFSDAGKNDCTI+ WDLIEHPT PPPPSQ+E++EHWTRAMITDATK
Subjt:  MLLESAHSQILKTNCESSNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK

A0A6J1K104 uncharacterized protein LOC1114891763.2e-23087.84Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        MNRRF NCAIC NSNQASIC  CVNHRLNDYNS LKSL+ARRD LYSRLSDVLVAKGKADD+LNWRVTRNEKL+ L EKLRR REQLEQGK EIE+ SYD
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQR+EQLEKAYPDLISTK LG+MAITSERLHKQSV+VKQ+CKLFPQRRVLVHGENKEGPGEQ DQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        LSASLGYMVQLLNLIV  LA PALHNSGFAGSCSRIWQR+SYW+A PSSQ NEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE + 
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SFNYSSAS HSIETHKDL+ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLA LSSS+EVRS  SLKMASSRS KHVQKLNKSAWNVNS+SSS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL
        MLLESAHSQI+KTN ES   S+  SYLYATEFSDA KND TI+GWDLIEHPTFPPPPSQ ED+EHWTRAM  DATKL
Subjt:  MLLESAHSQILKTNCES---SNLPSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein3.0e-11151.45Show/hide
Query:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM
        CA+C  S + SIC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  +  W+  +NEKL +L EKL+   E+L+Q K      S +LK ++ +
Subjt:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM

Query:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY
        +ES    LE+ R+ QLE  Y D I    L Y+ +TSERL+KQ++++KQ+CKLFP  RV V G+NK+G   Q DQIC   LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN
        MVQLLNL+V  L+VPALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S    E   H   L+  S   
Subjt:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN

Query:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL
         SSASPHS+ET ++L++GIA LK+SVA LT Y Y SLSL+VPS  STFE FAKLLATLSS +EV+S +SL ++SS   +H  + NKS WN+NS SSS LL
Subjt:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL

Query:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP
         S+H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein1.6e-10449.89Show/hide
Query:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM
        CA+C  S + SIC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  +  W+  +NEKL +L EKL+   E+L+Q K      S +LK ++ +
Subjt:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM

Query:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY
        +ES    LE+ R+ QLE  Y D I    L Y           ++++KQ+CKLFP  RV V G+NK+G   Q DQIC   LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN
        MVQLLNL+V  L+VPALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S    E   H   L+  S   
Subjt:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN

Query:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL
         SSASPHS+ET ++L++GIA LK+SVA LT Y Y SLSL+VPS  STFE FAKLLATLSS +EV+S +SL ++SS   +H  + NKS WN+NS SSS LL
Subjt:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL

Query:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP
         S+H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein3.7e-10951.23Show/hide
Query:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM
        CA+C  S + SIC  CVN  LN+Y   L SLK+ R+  Y RLS +LV K KA  +  W+  +NEKL +L EKL+   E+L+Q K      S +LK ++ +
Subjt:  CAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKHAM

Query:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY
        +ES    LE+ R+ QLE  Y D I    L  + +TSERL+KQ++++KQ+CKLFP  RV V G+NK+G   Q DQIC   LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN
        MVQLLNL+V  L+VPALHN GFAGSCSRIW+RDSYWN+ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S    E   H   L+  S   
Subjt:  MVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLES----ERKPHLSSLESSSSFN

Query:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL
         SSASPHS+ET ++L++GIA LK+SVA LT Y Y SLSL+VPS  STFE FAKLLATLSS +EV+S +SL ++SS   +H  + NKS WN+NS SSS LL
Subjt:  YSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLL

Query:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP
         S+H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  ESAHSQILKTNCESSNLPSY--LYATEFSDAGKNDCTIDGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein1.3e-17065.82Show/hide
Query:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD
        M +R  NCAIC N+N+  ICT CVNHRL +YN+ LKSLK RRD L SR +++L +KGKADD+ NWR+ +NEK+++L +KL+ ++E + QGK +IE  S D
Subjt:  MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYD

Query:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE
        LK+K+ +L+SARS LEK R+EQ+EK +P+LI T++LG+MAI+SERLHKQSV+VKQ+CKLFP RRV   GE++ G   Q D IC   LP  LDPHS+   E
Subjt:  LKLKHAMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS
        L+ SLGYMVQLLNL+V  LA PALH+SGFAGSCSRIWQRDSYW+   S++SNEYPLFIPR+NYCSTS ENSW+DK+SSNFGVAS+ES+RK        S+
Subjt:  LSASLGYMVQLLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSS

Query:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS
        SF YSSASPHSIE+H+DL+KGIALLKKSVACLTAYCYNSL L+VP EASTFEAFAKLLATLSSS+EVRS  SLKMASSRS K  Q+LNKS WN +SV SS
Subjt:  SFNYSSASPHSIETHKDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSS

Query:  MLLESAHSQILKTNCESSNLP-SYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK
         LLESAH     +  +  N P SYL ATE S    ND  ++GWDL+EHP +PPPPSQ+EDVEHWTRAM  DA K
Subjt:  MLLESAHSQILKTNCESSNLP-SYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAGATTCTTTAACTGCGCTATCTGTGCGAATTCAAATCAAGCTTCTATTTGCACCGTTTGCGTCAATCACAGATTGAATGACTACAACTCGACCTTG
AAATCGTTGAAAGCTCGGCGGGATTTTTTGTATTCGAGGTTGAGTGACGTGCTTGTGGCCAAGGGGAAGGCTGACGATCGATTAAATTGGAGAGTGACTCGGAAT
GAGAAACTTACAAGGTTAACGGAGAAACTCCGGCGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGGTGAAGTCCTATGATCTCAAATTGAAACAT
GCTATGCTCGAGTCAGCACGTTCGGTGTTGGAAAAACAACGAATTGAACAACTGGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGATATATGGCA
ATTACCTCTGAACGGCTTCACAAACAATCAGTGATTGTAAAACAAATGTGCAAATTGTTTCCACAACGACGGGTTCTAGTTCATGGAGAGAACAAAGAGGGACCT
GGCGAGCAATTGGACCAAATCTGCTATGTAAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAGCCACATGAGCTTTCGGCTTCTTTGGGATACATGGTGCAA
CTTCTAAATCTTATTGTTCAATATTTGGCTGTTCCTGCTCTTCACAACTCTGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAATGCT
GGCCCATCTTCTCAAAGCAATGAGTATCCACTTTTTATACCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTC
GGTGTTGCTTCGTTGGAATCAGAGAGGAAACCTCATTTAAGTTCACTAGAAAGTAGTAGTAGCTTCAATTATTCGTCCGCTTCTCCACATTCGATCGAAACGCAC
AAGGATTTGGAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGTCTCACCGCATACTGCTATAATTCTCTCTCCTTGGACGTTCCTTCCGAAGCTTCTACT
TTTGAAGCATTTGCTAAATTATTGGCCACTCTTTCTTCATCCCGGGAAGTGCGTTCGGCTGTTTCTCTCAAAATGGCTTCTTCCAGGTCCCATAAGCACGTTCAG
AAACTGAACAAATCTGCATGGAATGTAAATTCGGTTTCATCCAGCATGCTGCTCGAGAGTGCACATTCACAAATATTGAAAACCAATTGTGAGAGTAGTAACCTT
CCCAGTTATCTTTATGCCACTGAATTTTCTGATGCTGGAAAAAATGATTGCACCATTGACGGATGGGACCTCATAGAGCATCCAACCTTTCCCCCTCCACCATCT
CAAACTGAAGATGTCGAGCATTGGACTCGAGCAATGATAACCGATGCCACCAAACTGTAA
mRNA sequenceShow/hide mRNA sequence
GATTAAAAATCTGCCGAACTAAAAAAAACTTCTAATCCCTTTTATTTATCTACGCGAAGAAAGAGATTTAAACTTTACGGCGTTCAAGAGTTTCCTTGTTAAAGC
TTCATCCCAATTATCGGCTCCGGCGATAGTTTTTCCCCCTTCTTCACCGATTTCTCAATCCAAACTTCAAATCATCGCTCTCAAAGCTCGAAGATCATCATCGGA
TCAAATTCCTTCGATATCATGATTACAAATTGATCGACGATGAATCGGAGATTCTTTAACTGCGCTATCTGTGCGAATTCAAATCAAGCTTCTATTTGCACCGTT
TGCGTCAATCACAGATTGAATGACTACAACTCGACCTTGAAATCGTTGAAAGCTCGGCGGGATTTTTTGTATTCGAGGTTGAGTGACGTGCTTGTGGCCAAGGGG
AAGGCTGACGATCGATTAAATTGGAGAGTGACTCGGAATGAGAAACTTACAAGGTTAACGGAGAAACTCCGGCGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCA
GAGATTGAGGTGAAGTCCTATGATCTCAAATTGAAACATGCTATGCTCGAGTCAGCACGTTCGGTGTTGGAAAAACAACGAATTGAACAACTGGAGAAGGCCTAT
CCTGACCTTATTAGCACCAAGAATCTGGGATATATGGCAATTACCTCTGAACGGCTTCACAAACAATCAGTGATTGTAAAACAAATGTGCAAATTGTTTCCACAA
CGACGGGTTCTAGTTCATGGAGAGAACAAAGAGGGACCTGGCGAGCAATTGGACCAAATCTGCTATGTAAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAG
CCACATGAGCTTTCGGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAATATTTGGCTGTTCCTGCTCTTCACAACTCTGGTTTTGCAGGTTCT
TGTTCACGCATATGGCAAAGGGATTCATATTGGAATGCTGGCCCATCTTCTCAAAGCAATGAGTATCCACTTTTTATACCACGTCAAAACTATTGTTCAACAAGT
GGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTCGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCTCATTTAAGTTCACTAGAAAGTAGTAGTAGCTTC
AATTATTCGTCCGCTTCTCCACATTCGATCGAAACGCACAAGGATTTGGAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGTCTCACCGCATACTGCTAT
AATTCTCTCTCCTTGGACGTTCCTTCCGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCCACTCTTTCTTCATCCCGGGAAGTGCGTTCGGCTGTTTCT
CTCAAAATGGCTTCTTCCAGGTCCCATAAGCACGTTCAGAAACTGAACAAATCTGCATGGAATGTAAATTCGGTTTCATCCAGCATGCTGCTCGAGAGTGCACAT
TCACAAATATTGAAAACCAATTGTGAGAGTAGTAACCTTCCCAGTTATCTTTATGCCACTGAATTTTCTGATGCTGGAAAAAATGATTGCACCATTGACGGATGG
GACCTCATAGAGCATCCAACCTTTCCCCCTCCACCATCTCAAACTGAAGATGTCGAGCATTGGACTCGAGCAATGATAACCGATGCCACCAAACTGTAATTCAAT
GGTGTAGGACTTGGAACTACTCAATGCAGCTTATGACCTGTAAATATGAAGACTTCAAAAAGGTGACATCCTTGAACATATTTAGGTTTAGGATAACTTTTCTTA
TTCCATTATCTTAGCCAATAGTATATTTGGACATTTTTTCAAATGTGTGCATATGGGGGAAATGGGTTGTGGAAATGTCTTTCAAAATGTAAATAGAAATTCCAT
TCCCTATGATGTCATGATTTCATTAGTTTGAAACAATCCACAAAGATGATCTCACTCTACACAACACCAACCCAAACACTAATACCTTATGT
Protein sequenceShow/hide protein sequence
MNRRFFNCAICANSNQASICTVCVNHRLNDYNSTLKSLKARRDFLYSRLSDVLVAKGKADDRLNWRVTRNEKLTRLTEKLRRSREQLEQGKAEIEVKSYDLKLKH
AMLESARSVLEKQRIEQLEKAYPDLISTKNLGYMAITSERLHKQSVIVKQMCKLFPQRRVLVHGENKEGPGEQLDQICYVSLPRRLDPHSVQPHELSASLGYMVQ
LLNLIVQYLAVPALHNSGFAGSCSRIWQRDSYWNAGPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESSSSFNYSSASPHSIETH
KDLEKGIALLKKSVACLTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSREVRSAVSLKMASSRSHKHVQKLNKSAWNVNSVSSSMLLESAHSQILKTNCESSNL
PSYLYATEFSDAGKNDCTIDGWDLIEHPTFPPPPSQTEDVEHWTRAMITDATKL