; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22856 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22856
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA-directed RNA polymerase II protein
Genome locationCarg_Chr15:348722..356894
RNA-Seq ExpressionCarg22856
SyntenyCarg22856
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015913.1 hypothetical protein SDJN02_21017 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-290100Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHMEITREREALGTTRCVICFQPQW
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHMEITREREALGTTRCVICFQPQW
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHMEITREREALGTTRCVICFQPQW

Query:  HPSHSEPGTR
        HPSHSEPGTR
Subjt:  HPSHSEPGTR

XP_022939132.1 uncharacterized protein LOC111445131 [Cucurbita moschata]7.4e-26799.79Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWR+TRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL

XP_022993038.1 uncharacterized protein LOC111489176 [Cucurbita maxima]2.5e-26298.11Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNR+FCNCAICENSNQASICIGCVNHRLNDYNS LKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKLS LREKLRRRREQLEQGK EIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQ NEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]1.4e-26599.37Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL

XP_038886525.1 uncharacterized protein LOC120076698 [Benincasa hispida]5.9e-24090.11Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCA CENSNQAS C GCVNHRLNDYNSTLKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKLRR REQL+QGKAEIEMTS+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LG MAITSERLHKQSVV+KQICKLFPQRRVLVHGE K GPGE FDQICNVSLPRRLDPHSV+P+E
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNL+V  LAAPALHNSGFAGSCSRIWQR SYWD CPSSQS+EYP+FIPRQ+YCSTSGENSWSDKSSSNFGVASLESE+KPHLS LE++S
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LD+PSEASTFEAFAKLLATLSSSKEVRSVF+LKMASSRSPKHVQKLNKSAWN NS+S SM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        L ESAHSQIMKTNYESN PSSASSYLYATEFSDA KNDSTIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

TrEMBL top hitse value%identityAlignment
A0A1S3B2Y9 uncharacterized protein LOC103485428 isoform X11.5e-23688.52Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKLRR REQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G+ + GPGE FDQICNVSLPR LDPHSV+P+E
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNL+V  LAAPALHNSGFAGSCSRIWQR+SYW+ACPSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+S
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS KH+QK  KS WNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHME
        L ES HSQIMKTNYESN PSSASSYLYATEFSDA KNDSTIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK +M+
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHME

A0A1S3B3M2 uncharacterized protein LOC103485428 isoform X27.3e-23689.05Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKLRR REQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G+ + GPGE FDQICNVSLPR LDPHSV+P+E
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNL+V  LAAPALHNSGFAGSCSRIWQR+SYW+ACPSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+S
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS KH+QK  KS WNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        L ES HSQIMKTNYESN PSSASSYLYATEFSDA KNDSTIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 147.3e-23689.05Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASIC GCVN RLNDYN++LKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKL+RLREKLRR REQLEQGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        L+LK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVV+KQ+CKLFPQRRVLV G+ + GPGE FDQICNVSLPR LDPHSV+P+E
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNL+V  LAAPALHNSGFAGSCSRIWQR+SYW+ACPSS+SNEYP+F+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLEN+S
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSAS HSIE+HKDLQ GIALLKKSVAC+TAY YNSL LDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKM SSRS KH+QK  KS WNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
        L ES HSQIMKTNYESN PSSASSYLYATEFSDA KNDSTIEGWDL+EHPTFPPPPSQAEDIEHWTRAMFIDATK
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK

A0A6J1FFY6 uncharacterized protein LOC1114451313.6e-26799.79Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWR+TRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL

A0A6J1K104 uncharacterized protein LOC1114891761.2e-26298.11Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        MNR+FCNCAICENSNQASICIGCVNHRLNDYNS LKSLR RRD LYSRLSDVLVAKGKADDQLNWR+TRNEKLS LREKLRRRREQLEQGK EIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
        LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQ NEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKS

Query:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
        FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLA LSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM
Subjt:  FNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSM

Query:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
        LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL
Subjt:  LLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.2e-11050.89Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R+  Y RLS +LV K KA  Q  W+  +NEKL++LREKL+ + E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL

Query:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA
        ++ ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+A
Subjt:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+R+SYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF

Query:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML
        + SSAS HS+ET ++LQ GIA LK+SVA +T Y Y SL L+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS L
Subjt:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML

Query:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP
        L S+H+Q    N    N P+   SY+   EF D +K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein6.5e-10449.33Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R+  Y RLS +LV K KA  Q  W+  +NEKL++LREKL+ + E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL

Query:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA
        ++ ++ES    LE+ RV QLE  Y D I    L +           ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+A
Subjt:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+R+SYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF

Query:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML
        + SSAS HS+ET ++LQ GIA LK+SVA +T Y Y SL L+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS L
Subjt:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML

Query:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP
        L S+H+Q    N    N P+   SY+   EF D +K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein4.0e-10950.89Show/hide
Query:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL
        K   CA+C  S + SIC+ CVN  LN+Y   L SL++ R+  Y RLS +LV K KA  Q  W+  +NEKL++LREKL+ + E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKL

Query:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA
        ++ ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQICKLFP  RV V G+NK+G   Q+DQICN  LP+ L+P SV P EL+A
Subjt:  KHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSRIW+R+SYW++ PSS SN YPLF+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+    
Subjt:  SLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLENKSF

Query:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML
        + SSAS HS+ET ++LQ GIA LK+SVA +T Y Y SL L+VPS  STFE FAKLLATLSS KEV+S  SL ++SS   +H  + NKS WN+NS SSS L
Subjt:  NYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSML

Query:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP
        L S+H+Q    N    N P+   SY+   EF D +K+ ++I  W+L+E+P
Subjt:  LESAHSQIMKTNYE-SNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein1.8e-17868.28Show/hide
Query:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD
        M ++  NCAIC+N+N+  IC  CVNHRL +YN+ LKSL+TRRDSL SR +++L +KGKADDQ NWR+ +NEK+S+L++KL+  +E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYD

Query:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE
        LK+K+ +L+SARS LEK RVEQ+EK +P+LI T+ LGHMAI+SERLHKQSVVVKQICKLFP RRV   GE++ G   Q+D ICN  LP  LDPHS+   E
Subjt:  LKLKHAMLESARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLENK
        L+ SLGYMVQLLNL+V NLAAPALH+SGFAGSCSRIWQR+SYWD   S++SNEYPLFIPR+NYCSTS ENSW+DK+SSNFGVAS+ES+RK P L S  + 
Subjt:  LSASLGYMVQLLNLIVPNLAAPALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLENK

Query:  SFNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSS
        SF YSSAS HSIE+H+DLQ GIALLKKSVAC+TAYCYNSL L+VP EASTFEAFAKLLATLSSSKEVRSVFSLKMASSRS K  Q+LNKS WN +S+ SS
Subjt:  SFNYSSASAHSIETHKDLQTGIALLKKSVACITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSS

Query:  MLLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK
         LLESAH     T+Y  + P+S +SYL ATE S  + ND  + GWDL+EHP +PPPPSQ+ED+EHWTRAMFIDA K
Subjt:  MLLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDSTIEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTTTGCAACTGCGCTATCTGTGAGAATTCAAATCAGGCTTCCATTTGCATCGGTTGCGTGAATCACAGATTGAATGACTACAACTCTACTTTAAAATC
ATTGAGAACTCGACGGGATTCGTTATATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAACTGGAGAATGACTAGGAATGAGAAACTTT
CGAGGTTAAGGGAGAAACTCCGACGCCGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAAATGACATCCTATGATCTCAAATTGAAACATGCAATGCTTGAATCG
GCCCGTTCAGTGCTGGAAAAACAACGAGTTGAACAACTCGAGAAGGCCTATCCAGACCTTATCAGCACCAAGATTCTGGGACATATGGCAATCACCTCTGAACGCCTTCA
CAAACAATCTGTGGTTGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTCATGGAGAAAATAAAGAGGGTCCCGGTGAGCAATTTGATCAAATCTGTA
ATGTGAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAGCCACATGAGCTTTCAGCTTCTTTGGGATACATGGTGCAGCTTTTAAATCTTATTGTTCCAAACTTGGCT
GCACCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGAATTCATATTGGGACGCTTGTCCATCTTCACAAAGCAATGAGTATCCACTTTT
TATACCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAACTTTGGTGTTGCTTCATTGGAATCAGAGAGGAAACCACATTTAA
GTTCACTAGAAAATAAGAGCTTCAATTATTCCTCCGCTTCTGCACATTCTATTGAAACACACAAGGATTTGCAGACAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGT
ATCACCGCATACTGCTATAACTCTCTTTATTTAGACGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCGTCCAGGTCCCCTAAGCACGTTCAGAAGCTGAACAAATCTGCATGGAACGTGAATTCCATTTCATCCAGCATGCTGCTAGAGA
GTGCACATTCACAAATAATGAAAACCAATTATGAGAGTAACCATCCAAGTTCTGCTTCGAGTTATCTTTATGCCACTGAATTTTCTGATGCCAGAAAGAATGATTCCACC
ATTGAAGGGTGGGACCTCATAGAACATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGACATTGAGCATTGGACTCGTGCAATGTTCATTGATGCCACCAAACTGCA
CATGGAGATAACGCGAGAAAGAGAAGCACTTGGAACTACCCGATGTGTGATCTGCTTTCAACCTCAATGGCATCCATCTCATTCGGAGCCAGGAACTAGATGA
mRNA sequenceShow/hide mRNA sequence
CATTCCAGTTTCCAGACGATCAGTTCCCGCGGTTCAAGCTTTTCCCCTCCTGGCTCGACCGCGAAATGCAAACTTAAGATCATTGATCTGAAAATTAGAGCATAATCATC
TGATCTATTTCCTCCGAGTAGCATGATTAGAAATTGATCGACGATGAATCGGAAATTTTGCAACTGCGCTATCTGTGAGAATTCAAATCAGGCTTCCATTTGCATCGGTT
GCGTGAATCACAGATTGAATGACTACAACTCTACTTTAAAATCATTGAGAACTCGACGGGATTCGTTATATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCA
GACGATCAATTAAACTGGAGAATGACTAGGAATGAGAAACTTTCGAGGTTAAGGGAGAAACTCCGACGCCGTAGAGAGCAACTCGAGCAAGGAAAGGCAGAGATTGAAAT
GACATCCTATGATCTCAAATTGAAACATGCAATGCTTGAATCGGCCCGTTCAGTGCTGGAAAAACAACGAGTTGAACAACTCGAGAAGGCCTATCCAGACCTTATCAGCA
CCAAGATTCTGGGACATATGGCAATCACCTCTGAACGCCTTCACAAACAATCTGTGGTTGTAAAACAAATATGCAAATTGTTTCCACAACGTCGGGTTTTAGTTCATGGA
GAAAATAAAGAGGGTCCCGGTGAGCAATTTGATCAAATCTGTAATGTGAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAGCCACATGAGCTTTCAGCTTCTTTGGG
ATACATGGTGCAGCTTTTAAATCTTATTGTTCCAAACTTGGCTGCACCTGCACTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCATATGGCAAAGGAATTCATATT
GGGACGCTTGTCCATCTTCACAAAGCAATGAGTATCCACTTTTTATACCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGTAAC
TTTGGTGTTGCTTCATTGGAATCAGAGAGGAAACCACATTTAAGTTCACTAGAAAATAAGAGCTTCAATTATTCCTCCGCTTCTGCACATTCTATTGAAACACACAAGGA
TTTGCAGACAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGTATCACCGCATACTGCTATAACTCTCTTTATTTAGACGTTCCTTCTGAAGCTTCTACTTTTGAAGCAT
TTGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAGTGCGTTCTGTTTTTTCCCTCAAAATGGCTTCGTCCAGGTCCCCTAAGCACGTTCAGAAGCTGAACAAATCT
GCATGGAACGTGAATTCCATTTCATCCAGCATGCTGCTAGAGAGTGCACATTCACAAATAATGAAAACCAATTATGAGAGTAACCATCCAAGTTCTGCTTCGAGTTATCT
TTATGCCACTGAATTTTCTGATGCCAGAAAGAATGATTCCACCATTGAAGGGTGGGACCTCATAGAACATCCAACTTTTCCTCCTCCACCTTCCCAAGCTGAAGACATTG
AGCATTGGACTCGTGCAATGTTCATTGATGCCACCAAACTGCACATGGAGATAACGCGAGAAAGAGAAGCACTTGGAACTACCCGATGTGTGATCTGCTTTCAACCTCAA
TGGCATCCATCTCATTCGGAGCCAGGAACTAGATGATGAAAAAGGAATTTGAATCACGCCAGAAAGATATGCTCCATCAGCGAGTTAAACCAAATACAAGAAAGAGCAAC
TACTGTGGGTCACCAAAAAATAGGAACAAGGATGACAAGAACGAATAAAAAAGAAATATTAAACAGTAACTCCGGAAACCGGTGCATTGCCCCTGCATGTGATTCCTTCC
TTAGTGGCCTCAATAAAGCCAGATGGGGGAGGAAAGGCACTGTTGAGCCACAAATTATATAGATATAAAACAAAATTGCAGGGTATCAAAGTATAATAACCATATTACTT
TCTTAAATTTAGGATGAACGCACTAGGATTGATTGGTTAGGTCGTCTGCAGAAGACTTCCGCATCAGTCTTGAAGTACATGTATATTATTATGCATTGCACCATGCATAC
CCAACATTTCTAGAAATGTACATTTTATTCGAAGGGTTCGACTCTATGGAAACCCACCCAAGACGATGCAGAAATTCCCCGTTTATACAGGGTTCAAGGTGGGGATTCCC
TGTATACATATAGGTAGAGTTTCTAAAGCAATTGTAATCTCCATTCTACTTTTTGGAACCTTCAATCTCCCTCGTCCCAACTCATAGCAACATTTTACTTTACAAAATTC
ATACTATTACAATAATATTCAAGTTAGAAATTAGGTTGTATATTTTTTTTTAATCGTTAGTTCATTTTTTTACCATTTTTTATGAAGACAGTTGAAAGTTGAAAACCTTA
AAATAATTATTTAGAAAAAATATTAAATGAAGAAAAAATTTAGCCGGTAAAATGATCCCACAAAAATTATTGAAAAATTTTGCAAGACAGGGAATGCAATAGGTCCCATT
TCCACCCCTCCCATAGACATCTCTAAACCTTTCTAACTATTTTTTCTTTTAGGATAAGTTTAACCGATATTTTGTCTATCTTTATTCTTTGAAAAGTTTATTTCTGAAAT
TCAAATTCTAAAAGTTCGTTGTATTTAAGAACCCAACATATATTAACATTCATTGTTTGTGAATTCAAATAATGATTTTATTATCAAGTTTAACGAAATTAAATGTAGTA
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSNQASICIGCVNHRLNDYNSTLKSLRTRRDSLYSRLSDVLVAKGKADDQLNWRMTRNEKLSRLREKLRRRREQLEQGKAEIEMTSYDLKLKHAMLES
ARSVLEKQRVEQLEKAYPDLISTKILGHMAITSERLHKQSVVVKQICKLFPQRRVLVHGENKEGPGEQFDQICNVSLPRRLDPHSVQPHELSASLGYMVQLLNLIVPNLA
APALHNSGFAGSCSRIWQRNSYWDACPSSQSNEYPLFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLENKSFNYSSASAHSIETHKDLQTGIALLKKSVAC
ITAYCYNSLYLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSPKHVQKLNKSAWNVNSISSSMLLESAHSQIMKTNYESNHPSSASSYLYATEFSDARKNDST
IEGWDLIEHPTFPPPPSQAEDIEHWTRAMFIDATKLHMEITREREALGTTRCVICFQPQWHPSHSEPGTR