; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018554 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018554
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationChr04:5220171..5225537
RNA-Seq ExpressionHG10018554
SyntenyHG10018554
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138644.2 uncharacterized protein LOC101217421 [Cucumis sativus]6.9e-24392.24Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLT LREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRGEK+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALH SGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K++QK  KS WN+NSI+SS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSD GKNDS+IEGWDL+EHPT  PPPSQAEDIEHWTRAM IDATKQ
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_008441239.1 PREDICTED: uncharacterized protein LOC103485428 isoform X1 [Cucumis melo]1.1e-24593.08Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_008441240.1 PREDICTED: uncharacterized protein LOC103485428 isoform X2 [Cucumis melo]1.1e-24593.08Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]2.5e-24091.39Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKL+RLREKLRR REQLEQGK+EIEMTS+
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DL+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQR+VLV GE K GPGE FDQICNVSLPRRLDPHSV+P+
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNL+V  LAAPALHNSGFA SCSRIWQR+SYWDACPSS+SNEYP+FIPRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSAS HSIETHKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPK+VQKLNKSAWN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK
        MLLESAHSQIMKTNYE N PSSASSYLYAT+FSDA KNDSTIEGWDLIEHPT  PPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK

XP_038886525.1 uncharacterized protein LOC120076698 [Benincasa hispida]1.6e-24793.29Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCA CENSNQAS C GCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQL+QGK+EIEMTSF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLG MAITSERLHKQSVVIKQ+CKLFPQR+VLV GEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQR SYWD CPSS+S+EYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESE+KPHLS LE+R
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLD+PSEASTFEAFAKLLATLSSSKEVRSVF+LKMASSRSPK+VQKLNKSAWN NS+S S
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ESAHSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATKQ
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LQP7 Uncharacterized protein3.4e-24392.24Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLT LREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRGEK+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALH SGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K++QK  KS WN+NSI+SS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSD GKNDS+IEGWDL+EHPT  PPPSQAEDIEHWTRAM IDATKQ
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X15.5e-24693.08Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X25.5e-24693.08Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 145.5e-24693.08Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAK GKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SF
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPY
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNLVVQYLAAPALHNSGFA SCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        ML ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451317.7e-24091.18Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLR RRD LYSRLSDVLVAK GKADDQLNWRVTRNEKL+RLREKLRR REQLEQGK+EIEMTS+
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DL+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQR+VLV GE K GPGE FDQICNVSLPRRLDPHSV+P+
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR
        ELSASLGYMVQ+LNL+V  LAAPALHNSGFA SCSRIWQR+SYWDACPSS+SNEYP+FIPRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SFNYSSAS HSIETHKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPK+VQKLNKSAWN+NSISSS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK
        MLLESAHSQIMKTNYE N PSSASSYLYAT+FSDA KNDSTIEGWDLIEHPT  PPPSQAEDIEHWTRAMFIDATK
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein3.1e-10850.33Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K  KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ

Query:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS
         +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+
Subjt:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS

Query:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS
        ASLGYMVQ+LNLVV  L+ PALHN GFA SCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+   
Subjt:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
         + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS 
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        LL S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein1.7e-10148.78Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K  KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ

Query:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS
         +Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+
Subjt:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS

Query:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS
        ASLGYMVQ+LNLVV  L+ PALHN GFA SCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+   
Subjt:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
         + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS 
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        LL S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein1.0e-10650.33Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K  KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQ

Query:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS
         +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+
Subjt:  LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELS

Query:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS
        ASLGYMVQ+LNLVV  L+ PALHN GFA SCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+   
Subjt:  ASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
         + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS 
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        LL S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LLESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein7.1e-16965.9Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF
        M ++  NCAIC+N+N+  IC  CVNHRL +YN+ LKSL+ RRD L SR +++L +K GKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S 
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSF

Query:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY
        DL++KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVV+KQ+CKLFP R+V   GE + G    +D ICN  LP  LDPHS+   
Subjt:  DLQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPY

Query:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLEN
        EL+ SLGYMVQ+LNLVV  LAAPALH+SGFA SCSRIWQRDSYWD   S+RSNEYP+FIPR++YCSTS ENSW+DK+SSNFGVAS+ES+RK P L S  +
Subjt:  ELSASLGYMVQVLNLVVQYLAAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLEN

Query:  RSFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISS
         SF YSSASPHSIE+H+DLQKGIALLKKSVAC+TAY YNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+LNKS WN +S+ S
Subjt:  RSFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISS

Query:  SMLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        S LLESAH     T+Y  + P+S +SYL AT+ S    ND  + GWDL+EHP   PPPSQ+ED+EHWTRAMFIDA K+
Subjt:  SMLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCATAGGTTGCGTGAATCACAGATTGAATGACTACAACTCTACGTTAAAATC
ATTGAGAGCTCGGCGAGATATGCTGTATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGTTGGGAAGGCCGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAAC
TTACGAGGTTAAGGGAGAAACTCCGACGCAGCAGAGAGCAACTCGAGCAAGGAAAGTCAGAGATTGAGATGACATCCTTTGATCTCCAATTGAAATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAACGAGTCGAACAACTGGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATCACCTCTGAACGCCT
TCACAAACAATCTGTGGTTATAAAACAACTATGCAAGTTGTTTCCACAACGTCAGGTTTTAGTTCGTGGAGAGAAGAAAGTGGGACCTGGTGAACCATTTGATCAAATCT
GTAATGTGAGCTTACCAAGAAGACTAGATCCCCATTCTGTTGAGCCGTATGAGCTTTCAGCTTCTTTAGGATATATGGTGCAAGTTCTAAATCTTGTTGTTCAATATTTG
GCTGCACCTGCACTTCACAACTCGGGTTTTGCAGTTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGACGCTTGTCCATCTTCTCGAAGCAATGAGTATCCAGT
TTTTATACCACGTCAAAGTTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATT
TAAGTTCACTTGAAAATAGAAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCA
TGTGTCACTGCATACGGGTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCCACTTTTGAAGCATTTGCTAAATTATTAGCTACTCTTTCTTCATCAAAGGAAGT
GCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCCAAGAACGTTCAGAAACTGAACAAATCTGCATGGAATCTGAATTCCATCTCATCCAGCATGCTGTTAG
AGAGTGCACATTCACAAATAATGAAAACCAATTATGAGATTAACCTTCCAAGTTCTGCTTCGAGTTATCTTTATGCCACCCAATTCTCTGATGCTGGAAAGAATGATTCC
ACCATTGAAGGATGGGACCTCATAGAGCATCCAACTTTGCCTCCTCCTTCCCAAGCTGAAGACATTGAGCATTGGACTCGAGCAATGTTCATCGATGCAACCAAACAGTA
A
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCATAGGTTGCGTGAATCACAGATTGAATGACTACAACTCTACGTTAAAATC
ATTGAGAGCTCGGCGAGATATGCTGTATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGTTGGGAAGGCCGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAAC
TTACGAGGTTAAGGGAGAAACTCCGACGCAGCAGAGAGCAACTCGAGCAAGGAAAGTCAGAGATTGAGATGACATCCTTTGATCTCCAATTGAAATATGCAATGCTTGAA
TCAGCCCGTTCAGTGTTGGAAAAACAACGAGTCGAACAACTGGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATCACCTCTGAACGCCT
TCACAAACAATCTGTGGTTATAAAACAACTATGCAAGTTGTTTCCACAACGTCAGGTTTTAGTTCGTGGAGAGAAGAAAGTGGGACCTGGTGAACCATTTGATCAAATCT
GTAATGTGAGCTTACCAAGAAGACTAGATCCCCATTCTGTTGAGCCGTATGAGCTTTCAGCTTCTTTAGGATATATGGTGCAAGTTCTAAATCTTGTTGTTCAATATTTG
GCTGCACCTGCACTTCACAACTCGGGTTTTGCAGTTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGACGCTTGTCCATCTTCTCGAAGCAATGAGTATCCAGT
TTTTATACCACGTCAAAGTTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATT
TAAGTTCACTTGAAAATAGAAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCA
TGTGTCACTGCATACGGGTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCCACTTTTGAAGCATTTGCTAAATTATTAGCTACTCTTTCTTCATCAAAGGAAGT
GCGTTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCCAAGAACGTTCAGAAACTGAACAAATCTGCATGGAATCTGAATTCCATCTCATCCAGCATGCTGTTAG
AGAGTGCACATTCACAAATAATGAAAACCAATTATGAGATTAACCTTCCAAGTTCTGCTTCGAGTTATCTTTATGCCACCCAATTCTCTGATGCTGGAAAGAATGATTCC
ACCATTGAAGGATGGGACCTCATAGAGCATCCAACTTTGCCTCCTCCTTCCCAAGCTGAAGACATTGAGCATTGGACTCGAGCAATGTTCATCGATGCAACCAAACAGTA
A
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKVGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQLKYAMLE
SARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSASLGYMVQVLNLVVQYL
AAPALHNSGFAVSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRSFNYSSASPHSIETHKDLQKGIALLKKSVA
CVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSMLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDS
TIEGWDLIEHPTLPPPSQAEDIEHWTRAMFIDATKQ