; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030413 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030413
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationchr8:47042107..47047917
RNA-Seq ExpressionLag0030413
SyntenyLag0030413
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-24091.37Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENSNQA IC  CVNHRLNDYNS+LKSLR RRD  YSRLSDVLVAKGKADDQ NWR+TRNEKL+RLREKLRR REQLEQGKAEIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+SNEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]8.4e-24191.58Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENSNQA IC  CVNHRLNDYNS+LKSLR RRD  YSRLSDVLVAKGKADDQ NWRVTRNEKL+RLREKLRR REQLEQGKAEIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+SNEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

XP_022993038.1 uncharacterized protein LOC111489176 [Cucurbita maxima]5.5e-24091.37Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNRRFCNCAICENSNQA IC  CVNHRLNDYNS+LKSLRARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKL+ LREKLRR REQLEQGK EIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+ NEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

XP_023518763.1 uncharacterized protein LOC111782150 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-23989.83Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENS+QA ICTVCVNHRLNDYNS+LK L+ARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKLTRLREKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQIC VSLP+ LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSRIWQRDSY  ARPSSRSNEYPLFIPRQ Y ST+GENSWSDK SS+FGVASLESERK H+ SLE+SS
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        F+Y SA PHSIETHK+LQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK
        L ESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIE WD IEHPT PPPPSQ+E+IEHWTRAMITDATK+
Subjt:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]5.8e-24292Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENSNQA IC  CVNHRLNDYNS+LKSLRARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKL+RLREKLRR REQLEQGKAEIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+SNEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

TrEMBL top hitse value%identityAlignment
A0A6J1DE02 uncharacterized protein LOC111020186 isoform X12.7e-23788.66Show/hide
Query:  NRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDL
        NR+FCNCAICENSNQAFICT+CVN+RLNDYNS+LKSL+ARRD  YSRLSDVLVAKGKADDQ NWRVTRNEKL RLREKL+RSREQLE+GKAEIEM SYDL
Subjt:  NRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDL

Query:  KLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHEL
        KLKYAMLESARSVLEKQRVEQLEK+YPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLV G+  EG GEQFDQICNV LP+RLDPHSV P+EL
Subjt:  KLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHEL

Query:  SASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSSF
        +ASLGYMVQLLNLIVQ LAAPALHNSGFAGSCSRIWQRDSY +ARPSSRSNEYPLFIPRQNYCST+GENSWSDKSSSNFGVAS+ESE+KPH+GSLE+SSF
Subjt:  SASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS-ISSSM
        NYSSASPHSIETHKDLQKGIALLKKSVAC+TAY YNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSV+SLKM SSRSPKHVQKLNKSAWNV+S   SSM
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNS-ISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK
        LL+S H+ IMK NCESNLP    SYLYATEFSD GKNDC+IEGWDL+EHPTFPPPPSQAEDIEHWTRAM  DAT+K
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK

A0A6J1FFY6 uncharacterized protein LOC1114451314.1e-24191.58Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENSNQA IC  CVNHRLNDYNS+LKSLR RRD  YSRLSDVLVAKGKADDQ NWRVTRNEKL+RLREKLRR REQLEQGKAEIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+SNEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

A0A6J1HFZ3 uncharacterized protein LOC111463144 isoform X11.4e-23889.19Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENS+QA IC VCVNHRLNDYNS+LK L+ARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKLTRLREKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQIC VSLP+ LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSRIWQRDSY  ARPSSRSNEYPLFIPRQ Y ST+GENSWSDK SS+FGVASLESERK H+ SLE+SS
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        F+Y SA PHSIETHK+LQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISS+M
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK
        L ESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIE WD IEHPT PPPPSQ+E+IEHWTRAMITDATK+
Subjt:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK

A0A6J1JNB3 uncharacterized protein LOC111487411 isoform X12.9e-23989.62Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNR+FCNCAICENS+QA ICTVCVNHRLNDYNS+LK L+ARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKLTRLREKLRRSRE+LEQGK EIE+ SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQR+VLV+GENKEGP EQFDQIC VSLP+ LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSRIWQRDSY  ARPSSRSNEYPLFIPRQ Y ST+GENSWSDK SS+FGVASLESERK H+ SLE+SS
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        F+Y SA PHSIETHK+LQKGIALLKKSVACVT+YCYNSLS DVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSR PKHVQKLNKSAW VNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK
        L ESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIE WDLIEHPT PPPPSQ+E+IEHWTRAMITDATK+
Subjt:  LLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK

A0A6J1K104 uncharacterized protein LOC1114891762.6e-24091.37Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        MNRRFCNCAICENSNQA IC  CVNHRLNDYNS+LKSLRARRDL YSRLSDVLVAKGKADDQ NWRVTRNEKL+ LREKLRR REQLEQGK EIEM SYD
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQICKLFPQRRVLV+GENKEGPGEQFDQICNVSLP+RLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSRIWQR+SY DA PSS+ NEYPLFIPRQNYCST+GENSWSDKSSSNFGVASLESERKPH+ SLEN S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK
        LLESAHSQIMKTN ESN P    SYLYATEFSDA KND TIEGWDLIEHPTFPPPPSQAEDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTNCESNLP----SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.3e-11151.58Show/hide
Query:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM
        CA+C  S +  IC  CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K     +S +LK +Y +
Subjt:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM

Query:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY
        +ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS
        MVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSY ++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    + SS
Subjt:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS

Query:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA
        ASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS LL S+
Subjt:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA

Query:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP
        H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein7.1e-10550Show/hide
Query:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM
        CA+C  S +  IC  CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K     +S +LK +Y +
Subjt:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM

Query:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY
        +ES    LE+ RV QLE  Y D I    L +           ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS
        MVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSY ++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    + SS
Subjt:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS

Query:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA
        ASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS LL S+
Subjt:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA

Query:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP
        H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein4.3e-11051.58Show/hide
Query:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM
        CA+C  S +  IC  CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K     +S +LK +Y +
Subjt:  CAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAM

Query:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY
        +ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+ASLGY
Subjt:  LESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGY

Query:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS
        MVQLLNL+V  L+ PALHN GFAGSCSRIW+RDSY ++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    + SS
Subjt:  MVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESER--KPHIGSLENSSFNYSS

Query:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA
        ASPHS+ET ++LQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS LL S+
Subjt:  ASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESA

Query:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP
        H+Q    N    N+P+    Y  EF D  K+  +I  W+L+E+P
Subjt:  HSQIMKTNCE-SNLPSY--LYATEFSDAGKNDCTIEGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein8.3e-17868Show/hide
Query:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD
        M +R  NCAIC+N+N+  ICT CVNHRL +YN+ LKSL+ RRD   SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S D
Subjt:  MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE
        LK+KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVVVKQICKLFP RRV   GE++ G   Q+D ICN  LP  LDPHS+   E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERK-PHIGSLENS
        L+ SLGYMVQLLNL+V  LAAPALH+SGFAGSCSRIWQRDSY D R S+RSNEYPLFIPR+NYCST+ ENSW+DK+SSNFGVAS+ES+RK P + S  ++
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERK-PHIGSLENS

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSS
        SF YSSASPHSIE+H+DLQKGIALLKKSVAC+TAYCYNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+LNKS WN +S+ SS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSS

Query:  MLLESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK
         LLESAH  +    N + N P SYL ATE S    ND  + GWDL+EHP +PPPPSQ+ED+EHWTRAM  DA KK
Subjt:  MLLESAH-SQIMKTNCESNLP-SYLYATEFSDAGKNDCTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMITDATKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAGATTCTGCAACTGCGCTATCTGTGAGAATTCGAATCAAGCTTTCATTTGCACCGTTTGCGTGAATCACAGATTGAATGACTACAACTCTTCGTTAAAATC
GTTGAGAGCTCGGCGGGATTTGTTTTATTCGAGGTTGAGTGACGTGCTCGTGGCAAAGGGGAAGGCAGACGATCAATTCAATTGGAGAGTGACTCGGAATGAGAAACTTA
CGAGGTTAAGGGAGAAACTCCGACGTAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGATGTCCTACGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAGGCCTATCCGGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAACAATCTGTGGTTGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTTATGGAGAGAATAAAGAGGGACCCGGTGAGCAATTTGATCAAATCTGTA
ATGTGAGCTTACCAAAAAGGCTGGATCCCCATTCTGTTCAGCCACACGAGCTTTCAGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAATATTTGGCT
GCTCCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGTATATGGCAAAGGGATTCATATTGTGATGCTCGTCCATCTTCTCGAAGCAATGAGTATCCACTTTT
TATACCACGTCAAAACTATTGTTCAACAAATGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGCTGGAATCAGAGAGGAAACCACATATAG
GCTCACTAGAAAATAGTAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCATGT
GTCACCGCATATTGCTATAACTCTCTTTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCCACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTCTTTTCCCTCAAAATGGCTTCTTCCAGATCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAATGTGAATTCAATTTCATCCAGCATGCTGCTCGAGA
GTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTATCTCTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTGCACCATTGAAGGATGG
GACCTCATAGAGCATCCAACTTTTCCTCCTCCACCTTCCCAGGCTGAAGATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAGATTCTGCAACTGCGCTATCTGTGAGAATTCGAATCAAGCTTTCATTTGCACCGTTTGCGTGAATCACAGATTGAATGACTACAACTCTTCGTTAAAATC
GTTGAGAGCTCGGCGGGATTTGTTTTATTCGAGGTTGAGTGACGTGCTCGTGGCAAAGGGGAAGGCAGACGATCAATTCAATTGGAGAGTGACTCGGAATGAGAAACTTA
CGAGGTTAAGGGAGAAACTCCGACGTAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGATGTCCTACGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAACAACGAGTTGAACAACTGGAGAAGGCCTATCCGGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAACGCCTTCA
CAAACAATCTGTGGTTGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTTATGGAGAGAATAAAGAGGGACCCGGTGAGCAATTTGATCAAATCTGTA
ATGTGAGCTTACCAAAAAGGCTGGATCCCCATTCTGTTCAGCCACACGAGCTTTCAGCTTCTTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAATATTTGGCT
GCTCCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGTATATGGCAAAGGGATTCATATTGTGATGCTCGTCCATCTTCTCGAAGCAATGAGTATCCACTTTT
TATACCACGTCAAAACTATTGTTCAACAAATGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGCTGGAATCAGAGAGGAAACCACATATAG
GCTCACTAGAAAATAGTAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCATGT
GTCACCGCATATTGCTATAACTCTCTTTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCCACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTCTTTTCCCTCAAAATGGCTTCTTCCAGATCCCCTAAGCACGTTCAGAAACTGAACAAATCTGCATGGAATGTGAATTCAATTTCATCCAGCATGCTGCTCGAGA
GTGCACATTCACAAATAATGAAAACCAATTGTGAGAGTAACCTTCCAAGTTATCTCTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTGCACCATTGAAGGATGG
GACCTCATAGAGCATCCAACTTTTCCTCCTCCACCTTCCCAGGCTGAAGATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAAAAGTAA
Protein sequenceShow/hide protein sequence
MNRRFCNCAICENSNQAFICTVCVNHRLNDYNSSLKSLRARRDLFYSRLSDVLVAKGKADDQFNWRVTRNEKLTRLREKLRRSREQLEQGKAEIEMMSYDLKLKYAMLES
ARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQICKLFPQRRVLVYGENKEGPGEQFDQICNVSLPKRLDPHSVQPHELSASLGYMVQLLNLIVQYLA
APALHNSGFAGSCSRIWQRDSYCDARPSSRSNEYPLFIPRQNYCSTNGENSWSDKSSSNFGVASLESERKPHIGSLENSSFNYSSASPHSIETHKDLQKGIALLKKSVAC
VTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESAHSQIMKTNCESNLPSYLYATEFSDAGKNDCTIEGW
DLIEHPTFPPPPSQAEDIEHWTRAMITDATKK