; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G004500 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G004500
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationchr03:5103845..5110720
RNA-Seq ExpressionLsi03G004500
SyntenyLsi03G004500
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138644.2 uncharacterized protein LOC101217421 [Cucumis sativus]2.5e-24592.65Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRGEK+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALH SGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K++QK  KS WN+NSI+SSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSD GKNDS+IEGWDL+EHPT  PPPSQAEDIEHWTRAM IDATKQ
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_008441239.1 PREDICTED: uncharacterized protein LOC103485428 isoform X1 [Cucumis melo]4.2e-24893.49Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_008441240.1 PREDICTED: uncharacterized protein LOC103485428 isoform X2 [Cucumis melo]4.2e-24893.49Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]6.9e-24391.79Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKLRR REQLEQGK+EIEMTS+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQR+VLV GE K GPGE FDQICNVSLPRRLDPHSV+P+E
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNL+V  LAAPALHNSGFAGSCSRIWQR+SYWDACPSS+SNEYP+FIPRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+S
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPK+VQKLNKSAWN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK
        LLESAHSQIMKTNYE N PSSASSYLYAT+FSDA KNDSTIEGWDLIEHPT  PPPSQAEDIEHWTRAMFIDATK
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK

XP_038886525.1 uncharacterized protein LOC120076698 [Benincasa hispida]5.8e-25093.7Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCA CENSNQAS C GCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQL+QGK+EIEMTSFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLG MAITSERLHKQSVVIKQ+CKLFPQR+VLV GEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQR SYWD CPSS+S+EYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESE+KPHLS LE+RS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLD+PSEASTFEAFAKLLATLSSSKEVRSVF+LKMASSRSPK+VQKLNKSAWN NS+S SM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ESAHSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATKQ
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LQP7 Uncharacterized protein1.2e-24592.65Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYNS+LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLT LREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQR+EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRGEK+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALH SGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKP LSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K++QK  KS WN+NSI+SSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSD GKNDS+IEGWDL+EHPT  PPPSQAEDIEHWTRAM IDATKQ
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X12.0e-24893.49Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X22.0e-24893.49Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 142.0e-24893.49Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLRARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGK+EIEM SFD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQR+VLVRG+K+VGPGEPFDQICNVSLPR LDPHSVEPYE
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNLVVQYLAAPALHNSGFAGSCSRIWQRDSYW+ACPSSRSNEYPVF+PRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSASPHSIE+HKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS K++QK  KS WN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
        L ES HSQIMKTNYE NLPSSASSYLYAT+FSDAGKNDSTIEGWDL+EHPT  PPPSQAEDIEHWTRAMFIDATK+
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451312.2e-24291.58Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLR RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL+RLREKLRR REQLEQGK+EIEMTS+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQR+VLV GE K GPGE FDQICNVSLPRRLDPHSV+P+E
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS
        LSASLGYMVQ+LNL+V  LAAPALHNSGFAGSCSRIWQR+SYWDACPSS+SNEYP+FIPRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+S
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRS

Query:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM
        FNYSSAS HSIETHKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPK+VQKLNKSAWN+NSISSSM
Subjt:  FNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSM

Query:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK
        LLESAHSQIMKTNYE N PSSASSYLYAT+FSDA KNDSTIEGWDLIEHPT  PPPSQAEDIEHWTRAMFIDATK
Subjt:  LLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.1e-11050.67Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+ 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA

Query:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF
        SLGYMVQ+LNLVV  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML
        + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS L
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML

Query:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        L S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein4.7e-10449.11Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+ 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA
        +Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA

Query:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF
        SLGYMVQ+LNLVV  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML
        + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS L
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML

Query:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        L S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein2.8e-10950.67Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K+     S +L+ 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQ+CKLFP  +V V G+ K G    +DQICN  LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSA

Query:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF
        SLGYMVQ+LNLVV  L+ PALHN GFAGSCSRIW+RDSYW++ PSS SN YP+F+P   + S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENRSF

Query:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML
        + SSASPHS+ET ++LQ+GIA LK+SVA +T YGY SLSL+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   K   + NKS WNLNS SSS L
Subjt:  NYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSML

Query:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP
        L S+H+Q    N    N+P+   SY+   +F D  K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-INLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein2.6e-17166.25Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD
        M ++  NCAIC+N+N+  IC  CVNHRL +YN+ LKSL+ RRD L SR +++L +KGKADDQ NWR+ +NEK+++L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFD

Query:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE
        L++KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVV+KQ+CKLFP R+V   GE + G    +D ICN  LP  LDPHS+   E
Subjt:  LQLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYE

Query:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLENR
        L+ SLGYMVQ+LNLVV  LAAPALH+SGFAGSCSRIWQRDSYWD   S+RSNEYP+FIPR++YCSTS ENSW+DK+SSNFGVAS+ES+RK P L S  + 
Subjt:  LSASLGYMVQVLNLVVQYLAAPALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLENR

Query:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS
        SF YSSASPHSIE+H+DLQKGIALLKKSVAC+TAY YNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+LNKS WN +S+ SS
Subjt:  SFNYSSASPHSIETHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSS

Query:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ
         LLESAH     T+Y  + P+S +SYL AT+ S    ND  + GWDL+EHP   PPPSQ+ED+EHWTRAMFIDA K+
Subjt:  MLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDSTIEGWDLIEHPTL-PPPSQAEDIEHWTRAMFIDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCATAGGTTGCGTGAATCACAGATTGAATGACTACAACTCTACGTTAAAATC
ATTGAGAGCTCGGCGAGATATGCTGTATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCCGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAACTTA
CGAGGTTAAGGGAGAAACTCCGACGCAGCAGAGAGCAACTCGAGCAAGGAAAGTCAGAGATTGAGATGACATCCTTTGATCTCCAATTGAAATATGCAATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAACAACGAGTCGAACAACTGGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATCACCTCTGAACGCCTTCA
CAAACAATCTGTGGTTATAAAACAACTATGCAAGTTGTTTCCACAACGTCAGGTTTTAGTTCGTGGAGAGAAGAAAGTGGGACCTGGTGAACCATTTGATCAAATCTGTA
ATGTGAGCTTACCAAGAAGACTAGATCCCCATTCTGTTGAGCCGTATGAGCTTTCAGCTTCTTTAGGATATATGGTGCAAGTTCTAAATCTTGTTGTTCAATATTTGGCT
GCACCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGACGCTTGTCCATCTTCTCGAAGCAATGAGTATCCAGTTTT
TATACCACGTCAAAGTTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATTTAA
GTTCACTTGAAAATAGAAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCATGT
GTCACTGCATACGGGTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCCACTTTTGAAGCATTTGCTAAATTATTAGCTACTCTTTCTTCATCAAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCCAAGAACGTTCAGAAACTGAACAAATCTGCATGGAATCTGAATTCCATCTCATCCAGCATGCTGTTAGAGA
GTGCACATTCACAAATAATGAAAACCAATTATGAGATTAACCTTCCAAGTTCTGCTTCGAGTTATCTTTATGCCACCCAATTCTCTGATGCTGGAAAGAATGATTCCACC
ATTGAAGGATGGGACCTCATAGAGCATCCAACTTTGCCTCCTCCTTCCCAAGCTGAAGACATTGAGCATTGGACTCGAGCAATGTTCATCGATGCAACCAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
CTTGGAATAATTAAATTTTAAACTTTACGGCGTTCAGGAGTTCCCTTGTTAAGGAGGCTGCAATTCCAGTTTCCAGACGATCGGCTCCCGCGGTTCAAGCTTTACCCGCT
TCTGACCCGAATTCAAAATCCAAATTTCAGATTATCGTTCCCAAAATTATAGCGTAATCATCTGATCCAATTCCTCCGAGTAGACTCATGATTAGAAATTGATCGACGAT
GAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCATAGGTTGCGTGAATCACAGATTGAATGACTACAACTCTACGTTAAAATCAT
TGAGAGCTCGGCGAGATATGCTGTATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCCGACGATCAATTAAACTGGAGAGTGACTCGGAATGAGAAACTTACG
AGGTTAAGGGAGAAACTCCGACGCAGCAGAGAGCAACTCGAGCAAGGAAAGTCAGAGATTGAGATGACATCCTTTGATCTCCAATTGAAATATGCAATGCTTGAATCAGC
CCGTTCAGTGTTGGAAAAACAACGAGTCGAACAACTGGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATCACCTCTGAACGCCTTCACA
AACAATCTGTGGTTATAAAACAACTATGCAAGTTGTTTCCACAACGTCAGGTTTTAGTTCGTGGAGAGAAGAAAGTGGGACCTGGTGAACCATTTGATCAAATCTGTAAT
GTGAGCTTACCAAGAAGACTAGATCCCCATTCTGTTGAGCCGTATGAGCTTTCAGCTTCTTTAGGATATATGGTGCAAGTTCTAAATCTTGTTGTTCAATATTTGGCTGC
ACCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGGATTCATATTGGGACGCTTGTCCATCTTCTCGAAGCAATGAGTATCCAGTTTTTA
TACCACGTCAAAGTTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATTTAAGT
TCACTTGAAAATAGAAGCTTCAATTATTCCTCCGCTTCTCCACATTCTATTGAAACGCACAAGGATTTGCAGAAAGGGATTGCCCTCCTAAAGAAAAGTGTAGCATGTGT
CACTGCATACGGGTATAACTCCCTTTCTTTAGACGTTCCGTCTGAAGCTTCCACTTTTGAAGCATTTGCTAAATTATTAGCTACTCTTTCTTCATCAAAGGAAGTGCGTT
CTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCCAAGAACGTTCAGAAACTGAACAAATCTGCATGGAATCTGAATTCCATCTCATCCAGCATGCTGTTAGAGAGT
GCACATTCACAAATAATGAAAACCAATTATGAGATTAACCTTCCAAGTTCTGCTTCGAGTTATCTTTATGCCACCCAATTCTCTGATGCTGGAAAGAATGATTCCACCAT
TGAAGGATGGGACCTCATAGAGCATCCAACTTTGCCTCCTCCTTCCCAAGCTGAAGACATTGAGCATTGGACTCGAGCAATGTTCATCGATGCAACCAAACAGTAATTTA
ATGGTTAGATATTATCATGGCGGTATTCGTATTTCTGTGAAACTGTTGTCAGTGTGTTTTGCAACAACCAAAACAAAAACTCACAAGGAAAAAGTATCAAATGAGACAGA
ATAAAGTGCATCGGCCCCAAGCTCTCACTCATTAACCCCTTCGTTTTATTAATTAATTATGGGACAAATTACAAAAACTACCCTCAAAGTATGGTGGTAGTTGCAATTAT
ACCTTCAAACTTTCAAATTGTAAAAACTAAGCTTTTAAACTTATACAAACATTAGAATTGGACCCTCAAACTTATATAATTGTATAAGTTTGAGAGTTCAATTTTTATCA
TTTGGGGTCCAATTTTTACAATTATATGGTAAGTTTGAGGATTCAATTTTAACACTTCTACAAGTTTGAGGGCTGAATTTTTACATTGAAAGTTTGAGGGTGTAATTGCA
ATTATTACCATACTTCAAGGGTGTGTTTTTGCAATTTTTGCCCTGATTCTTTTTAACTCAATAGGGTCATTGTAATATTATCATTTTTAGATACTGAATGCATATTTCCC
TGTTCTCCGATTTTAATAGCATTTGGAATCGCATCAGGTAAGCGCGAGAAAGAGAAGCACTTGGAACTACTCAATGCACGACCAGCTTTCGACCTGTAAATATGGAGTCT
TCAAAAGGTGTACATCCCTGAACATATTTAGGTTTAGGATTATTTTTATTGTTCCATCATCTTATCTGGAGATTTATTAGAGTTAGGATAACTTATGGCTATTCCATCAT
CTTAGCCAATATTATTCTTGGATTTTTTTTTCTTTCTTTTTCTTTTAATGGGTGCACATAGTGGAGTTGGTTTGTGAGGATGTTTTTCAAATGTATATAGAATTTGGCCT
TTAAAATTAGCTCATTCCCTTTGATGTCA
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRARRDMLYSRLSDVLVAKGKADDQLNWRVTRNEKLTRLREKLRRSREQLEQGKSEIEMTSFDLQLKYAMLES
ARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRQVLVRGEKKVGPGEPFDQICNVSLPRRLDPHSVEPYELSASLGYMVQVLNLVVQYLA
APALHNSGFAGSCSRIWQRDSYWDACPSSRSNEYPVFIPRQSYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENRSFNYSSASPHSIETHKDLQKGIALLKKSVAC
VTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKNVQKLNKSAWNLNSISSSMLLESAHSQIMKTNYEINLPSSASSYLYATQFSDAGKNDST
IEGWDLIEHPTLPPPSQAEDIEHWTRAMFIDATKQ