; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G25050 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G25050
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA-directed RNA polymerase II, putative
Genome locationChr2:21424161..21432172
RNA-Seq ExpressionCSPI02G25050
SyntenyCSPI02G25050
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138644.2 uncharacterized protein LOC101217421 [Cucumis sativus]6.5e-26599.79Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

XP_008441239.1 PREDICTED: uncharacterized protein LOC103485428 isoform X1 [Cucumis melo]4.1e-25997.06Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYN+SLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRG+KEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRSTKHIQKPIKSTWNVNSI+SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATK+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

XP_008441240.1 PREDICTED: uncharacterized protein LOC103485428 isoform X2 [Cucumis melo]4.1e-25997.06Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYN+SLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRG+KEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRSTKHIQKPIKSTWNVNSI+SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATK+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]5.0e-23388.21Show/hide
Query:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKLRR REQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD

Query:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE
        L+LK+AMLESARSVLEKQR+EQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV GE + GPGE FDQIC VSLPR LDPHSV+P+E
Subjt:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE

Query:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS
        LSASLGYMVQLLNL+V  LAAPALH SGFAGSCSRIWQR+SYW+ACPSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKP LSSLEN+S
Subjt:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS

Query:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS KH+QK  KS WNVNSI+SSM
Subjt:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM

Query:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATK
        L ES HSQIMKTNYESN PSSASSYLYATEFSD  KNDS+IEGWDL+EHPTFPPPPSQAEDIEHWTRAM IDATK
Subjt:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATK

XP_038886525.1 uncharacterized protein LOC120076698 [Benincasa hispida]4.1e-24391.39Show/hide
Query:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD
        MNRKFCNCA CENSNQAS CTGCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQL+QGKAEIEM SFD
Subjt:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD

Query:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE
        LQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLG MAITSERLHKQSVVIKQ+CKLFPQRRVLV GEK+VGPGEPFDQIC VSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE

Query:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS
        LSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQR SYW+ CPSS+S+EYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESE+KP LS LE+RS
Subjt:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS

Query:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM
        FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLD+PSEASTFEAFAKLLATLSSSKEVRSVF+LKMASSRS KH+QK  KS WN NS++ SM
Subjt:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM

Query:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        LFES HSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATKQ
Subjt:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LQP7 Uncharacterized protein3.1e-26599.79Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X12.0e-25997.06Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYN+SLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRG+KEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRSTKHIQKPIKSTWNVNSI+SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATK+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X22.0e-25997.06Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYN+SLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRG+KEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRSTKHIQKPIKSTWNVNSI+SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATK+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 142.0e-25997.06Show/hide
Query:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF
        MMNRKFCNCAICENSNQASICTGCVNLRLNDYN+SLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGKAEIEMKSF
Subjt:  MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSF

Query:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRG+KEVGPGEPFDQIC VSLPRSLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPY

Query:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR
        ELSASLGYMVQLLNLVVQYLAAPALH SGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRSTKHIQKPIKSTWNVNSI+SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
        MLFESGHSQIMKTNYESNLPSSASSYLYATEFSD GKNDS+IEGWDLVEHPTFPPPPSQAEDIEHWTRAM IDATK+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451311.6e-23288Show/hide
Query:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLR RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+ LREKLRR REQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD

Query:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE
        L+LK+AMLESARSVLEKQR+EQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV GE + GPGE FDQIC VSLPR LDPHSV+P+E
Subjt:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE

Query:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS
        LSASLGYMVQLLNL+V  LAAPALH SGFAGSCSRIWQR+SYW+ACPSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKP LSSLEN+S
Subjt:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS

Query:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS KH+QK  KS WNVNSI+SSM
Subjt:  FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSM

Query:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATK
        L ES HSQIMKTNYESN PSSASSYLYATEFSD  KNDS+IEGWDL+EHPTFPPPPSQAEDIEHWTRAM IDATK
Subjt:  LFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein2.4e-10850.44Show/hide
Query:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL
        K   CA+C  S + SIC  CVN  LN+Y   L SL++ R+V Y RLS +LV K KA  Q  W+  +NEKL  LREKL+   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL

Query:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA
        +Y ++ES    LE+ R+ QLE  Y D I    L ++ +TSERL+KQ++V+KQ+CKLFP  RV V G+ + G    +DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA

Query:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF
        SLGYMVQLLNLVV  L+ PALH  GFAGSCSRIW+RDSYWN+ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF

Query:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML
        + SSASPHS+E+ ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  +P KS WN+NS +SS L
Subjt:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML

Query:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP
          S H+Q    N    N+P+   SY+   EF D  K+ +SI  W+LVE+P
Subjt:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP

AT1G77890.2 DNA-directed RNA polymerase II protein1.3e-10148.89Show/hide
Query:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL
        K   CA+C  S + SIC  CVN  LN+Y   L SL++ R+V Y RLS +LV K KA  Q  W+  +NEKL  LREKL+   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL

Query:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA
        +Y ++ES    LE+ R+ QLE  Y D I    L +           ++V+KQ+CKLFP  RV V G+ + G    +DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA

Query:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF
        SLGYMVQLLNLVV  L+ PALH  GFAGSCSRIW+RDSYWN+ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF

Query:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML
        + SSASPHS+E+ ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  +P KS WN+NS +SS L
Subjt:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML

Query:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP
          S H+Q    N    N+P+   SY+   EF D  K+ +SI  W+LVE+P
Subjt:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP

AT1G77890.3 DNA-directed RNA polymerase II protein7.7e-10750.44Show/hide
Query:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL
        K   CA+C  S + SIC  CVN  LN+Y   L SL++ R+V Y RLS +LV K KA  Q  W+  +NEKL  LREKL+   E+L+Q K      S +L+ 
Subjt:  KFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQL

Query:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA
        +Y ++ES    LE+ R+ QLE  Y D I    L  + +TSERL+KQ++V+KQ+CKLFP  RV V G+ + G    +DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSA

Query:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF
        SLGYMVQLLNLVV  L+ PALH  GFAGSCSRIW+RDSYWN+ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPQLSSLENRSF

Query:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML
        + SSASPHS+E+ ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  +P KS WN+NS +SS L
Subjt:  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSML

Query:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP
          S H+Q    N    N+P+   SY+   EF D  K+ +SI  W+LVE+P
Subjt:  FESGHSQIMKTNYE-SNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHP

AT4G08540.1 DNA-directed RNA polymerase II protein1.4e-16965.62Show/hide
Query:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD
        M ++  NCAIC+N+N+  ICT CVN RL +YN+ LKSL+ RRD L SR +++L +KGKADDQ NWR+ +NEK++ L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFD

Query:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE
        L++KY +L+SARS LEK R+EQ+EK +P+LI T++LGHMAI+SERLHKQSVV+KQ+CKLFP RRV   GE + G    +D IC   LP  LDPHS+   E
Subjt:  LQLKYAMLESARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYE

Query:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PQLSSLENR
        L+ SLGYMVQLLNLVV  LAAPALH+SGFAGSCSRIWQRDSYW+   S+RSNEYP+F+PR++YCSTS ENSW+DK+SSNFGVAS+ES+RK P+L S  + 
Subjt:  LSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PQLSSLENR

Query:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS
        SF YSSASPHSIESH+DLQKGIALLKKSVAC+TAY YNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+  KS WN +S+ SS
Subjt:  SFNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASS

Query:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ
         L ES H     T+Y  + P+S +SYL ATE S    ND  + GWDLVEHP +PPPPSQ+ED+EHWTRAM IDA K+
Subjt:  MLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAATCGAAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCACCGGTTGCGTCAATCTCAGATTGAATGACTACAATTCTTCGTTGAA
ATCATTGAGAGCTCGGCGAGATGTGTTGTATTCGAGGTTGAGTGATGTGCTTGTGGCAAAGGGGAAGGCAGACGACCAATTAAACTGGAGAGTGACAAGGAATGAGAAAC
TTACGAGCTTAAGGGAGAAACTCAGGCGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGAAATCCTTTGATCTCCAATTGAAATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAACGACTCGAACAACTGGAGAAGGCATATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATCACCTCTGAACGCCT
TCACAAACAATCTGTGGTCATAAAACAACTATGCAAATTGTTTCCACAACGTCGGGTACTAGTTCGTGGGGAGAAAGAAGTGGGACCCGGTGAACCATTTGATCAAATCT
GTATTGTGAGCTTACCAAGAAGTCTGGATCCCCATTCTGTTGAACCATATGAGCTTTCAGCTTCTTTGGGATATATGGTGCAACTTCTAAATCTTGTTGTTCAATATTTG
GCTGCACCTGCTCTTCACACCTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAATGCTTGCCCATCTTCTCGAAGCAATGAGTATCCAGT
CTTTATGCCACGTCAAAGTTATTGTTCAACAAGCGGGGAGAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGGGTTGCTTCGTTGGAATCAGAGAGGAAACCACAAT
TAAGTTCACTTGAAAATAGAAGTTTCAATTACTCCTCCGCTTCTCCACATTCTATTGAATCGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCA
TGTGTTACTGCATATGGCTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCAACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCGTCAAAGGAAGT
GCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCACCAAGCACATTCAGAAACCGATCAAATCAACATGGAATGTGAATTCCATCGCATCCAGCATGCTGTTCG
AGAGTGGACATTCACAAATAATGAAAACCAATTATGAGAGTAACCTTCCAAGTTCTGCATCGAGTTATCTTTATGCCACTGAGTTCTCTGATACCGGAAAGAATGATTCT
TCTATTGAAGGATGGGACCTCGTAGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCCGAAGACATCGAGCATTGGACTCGAGCAATGATCATTGATGCCACCAAACA
GTAA
mRNA sequenceShow/hide mRNA sequence
TGGCATTTCAGTTTCCACACCATCGGCTCCCGCGGTTCAAACTTTCCCCCTTCCGCTCCGACTTCGAAATCCAAATTCAGATTATCGTAACCACAATCTGATTCCTCCCA
CCAGATTCATGACTAGGAATTGATCCATGATGAATCGAAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCACCGGTTGCGTCAATCTCAGA
TTGAATGACTACAATTCTTCGTTGAAATCATTGAGAGCTCGGCGAGATGTGTTGTATTCGAGGTTGAGTGATGTGCTTGTGGCAAAGGGGAAGGCAGACGACCAATTAAA
CTGGAGAGTGACAAGGAATGAGAAACTTACGAGCTTAAGGGAGAAACTCAGGCGCAGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAGATGAAATCCTTTGATC
TCCAATTGAAATATGCAATGCTTGAATCAGCCCGTTCAGTGTTGGAAAAACAACGACTCGAACAACTGGAGAAGGCATATCCTGACCTTATTAGCACCAAGAATCTGGGA
CATATGGCAATCACCTCTGAACGCCTTCACAAACAATCTGTGGTCATAAAACAACTATGCAAATTGTTTCCACAACGTCGGGTACTAGTTCGTGGGGAGAAAGAAGTGGG
ACCCGGTGAACCATTTGATCAAATCTGTATTGTGAGCTTACCAAGAAGTCTGGATCCCCATTCTGTTGAACCATATGAGCTTTCAGCTTCTTTGGGATATATGGTGCAAC
TTCTAAATCTTGTTGTTCAATATTTGGCTGCACCTGCTCTTCACACCTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGAATGCTTGCCCA
TCTTCTCGAAGCAATGAGTATCCAGTCTTTATGCCACGTCAAAGTTATTGTTCAACAAGCGGGGAGAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGGGTTGCTTC
GTTGGAATCAGAGAGGAAACCACAATTAAGTTCACTTGAAAATAGAAGTTTCAATTACTCCTCCGCTTCTCCACATTCTATTGAATCGCACAAGGATTTGCAGAAAGGGA
TTGCCCTCCTAAAGAAAAGTGTAGCATGTGTTACTGCATATGGCTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCAACTTTTGAAGCATTTGCTAAATTATTG
GCTACTCTTTCTTCGTCAAAGGAAGTGCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCACCAAGCACATTCAGAAACCGATCAAATCAACATGGAATGTGAA
TTCCATCGCATCCAGCATGCTGTTCGAGAGTGGACATTCACAAATAATGAAAACCAATTATGAGAGTAACCTTCCAAGTTCTGCATCGAGTTATCTTTATGCCACTGAGT
TCTCTGATACCGGAAAGAATGATTCTTCTATTGAAGGATGGGACCTCGTAGAGCATCCAACTTTTCCTCCTCCACCTTCCCAAGCCGAAGACATCGAGCATTGGACTCGA
GCAATGATCATTGATGCCACCAAACAGTAATTTAATGGGTTCATGAAGATAGCGCGAGAAAGAGAAGTAGTCGGAACTACCCAATGCTCGACCAGCTTTCGACATGTAAA
TAATTATGGAGACTTCATAAGGTGTACATCTCTGAACATATTTAGGTTTAGGATAACTTTTATTATTCCATCATACCCGGATTTATTTAGAATTAGGATAACTTGCAGCT
ATTTCATCATCTTAGCCAATATTATTCTTGGATTATTATTATTGTTTCCCCCCCTTTATTAATGGGTGCACATATAGGAGGAGTTTGTCGATGGGGACGTTTTTCAAATG
TACATAGAAACTGGCCTTTAAAATTAGTTCATTCCCTTTGGTGTTACGATTTTACTAGACATGAACACTAAGGTTCCCCCC
Protein sequenceShow/hide protein sequence
MMNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLE
SARSVLEKQRLEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICIVSLPRSLDPHSVEPYELSASLGYMVQLLNLVVQYL
AAPALHTSGFAGSCSRIWQRDSYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRSFNYSSASPHSIESHKDLQKGIALLKKSVA
CVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSIASSMLFESGHSQIMKTNYESNLPSSASSYLYATEFSDTGKNDS
SIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ