; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017040 (gene) of Snake gourd v1 genome

Gene IDTan0017040
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase II protein
Genome locationLG11:940337..945676
RNA-Seq ExpressionTan0017040
SyntenyTan0017040
Gene Ontology termsGO:0035493 - SNARE complex assembly (biological process)
GO:0000323 - lytic vacuole (cellular component)
GO:0005768 - endosome (cellular component)
GO:0000149 - SNARE binding (molecular function)
InterPro domainsIPR018791 - UV radiation resistance protein/autophagy-related protein 14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033396.1 hypothetical protein SDJN02_07452, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-23889.62Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASIC VCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HL SLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISS+M
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WD IEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

XP_022962750.1 uncharacterized protein LOC111463144 isoform X1 [Cucurbita moschata]1.8e-23889.62Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASIC VCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISS+M
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WD IEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

XP_022990571.1 uncharacterized protein LOC111487411 isoform X1 [Cucurbita maxima]3.5e-23990.04Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASICTVCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQR+VLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLS DVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WDLIEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

XP_023518763.1 uncharacterized protein LOC111782150 isoform X1 [Cucurbita pepo subsp. pepo]7.1e-24090.25Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASICTVCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WD IEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

XP_023549965.1 uncharacterized protein LOC111808299 [Cucurbita pepo subsp. pepo]5.1e-23890.53Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENSNQASIC  CVNHRLNDYNS+LKSL+ARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRR REQL+QGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQ+CKLFPQRRVLV+GENKEGPGEQFDQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSR+WQR+SYW+A PSS+SNEYP+FIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE+ S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        FNYSS S+HSIE HK+LQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSRSPKHVQKLNKSAWNV+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK
        LLESAHSQIMKT  ESN P    SYLYATEFSDA KND T++GWDLIEHPTFPPPPSQ+EDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK

TrEMBL top hitse value%identityAlignment
A0A5A7T0W2 UV radiation resistance protein/autophagy-related protein 142.6e-23288.03Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENSNQASICT CVN RLNDYN+SLKSL+ARRD+LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSREQL+QGKAEIEM S+D
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        L+LKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVV+KQLCKLFPQRRVLV G+ + GPGE FDQIC VSLPR LDPHSV+P+E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LSASLGYMVQLLNL+VQYLAAPALHNSGFAGSCSR+WQRDSYWNA PSSRSNEYPVF+PRQ+YCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE+ S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        FNYSS S HSIE HK+LQKGIALLKKSVACVTAY YNSLSLDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKM SSRS KH+QK  KS WNV+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ES HSQIMKT  ESNLP    SYLYATEFSDAGKND T++GWDL+EHPTFPPPPSQ+EDIEHWTRAM  DATK+
Subjt:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

A0A6J1FFY6 uncharacterized protein LOC1114451313.6e-23790.11Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENSNQASIC  CVNHRLNDYNS+LKSL+ RRD LYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRR REQL+QGKAEIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQ+CKLFPQRRVLV+GENKEGPGEQFDQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSR+WQR+SYW+A PSS+SNEYP+FIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE+ S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        FNYSS S+HSIE HK+LQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSRSPKHVQKLNKSAWNV+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK
        LLESAHSQIMKT  ESN P    SYLYATEFSDA KND T++GWDLIEHPTFPPPPSQ+EDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK

A0A6J1HFZ3 uncharacterized protein LOC111463144 isoform X18.5e-23989.62Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASIC VCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKY MLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQRRVLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLSLDVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISS+M
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WD IEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

A0A6J1JNB3 uncharacterized protein LOC111487411 isoform X11.7e-23990.04Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNRKFCNCAICENS+QASICTVCVNHRLNDYNS+LK LKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL RLREKLRRSRE+L+QGK EIE+TSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQS+VVKQ+CKLFPQR+VLV+GENKEGP EQFDQICYVSLPR LDPHSV+PHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LS SLGYMVQLLNL+VQYLAAPALH+SGFAGSCSR+WQRDSYW A PSSRSNEYP+FIPRQ Y STSGENSWSDK SS+FGVASLESERK HLSSLES+S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        F+Y S   HSIE HKNLQKGIALLKKSVACVT+YCYNSLS DVPSEASTFEAFAKLLATLSSSKE+RSVFSLKMASSR PKHVQKLNKSAW V+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
        L ESAHSQIMKT CESNLPSYLYATEFSDAGKNDCT++ WDLIEHPT PPPPSQSE+IEHWTRAMITDATKQ
Subjt:  LLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ

A0A6J1K104 uncharacterized protein LOC1114891761.1e-23589.47Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        MNR+FCNCAICENSNQASIC  CVNHRLNDYNS+LKSL+ARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKL  LREKLRR REQL+QGK EIEMTSYD
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LKLK+AMLESARSVLEKQRVEQLEKAYPDLISTK LGHMAITSERLHKQSVVVKQ+CKLFPQRRVLV+GENKEGPGEQFDQIC VSLPRRLDPHSVQPHE
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS
        LSASLGYMVQLLNLIV  LAAPALHNSGFAGSCSR+WQR+SYW+A PSS+ NEYP+FIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLE+ S
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNS

Query:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM
        FNYSS S+HSIE HK+LQ GIALLKKSVAC+TAYCYNSL LDVPSEASTFEAFAKLLA LSSSKE+RSVFSLKMASSRSPKHVQKLNKSAWNV+SISSSM
Subjt:  FNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSM

Query:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK
        LLESAHSQIMKT  ESN P    SYLYATEFSDA KND T++GWDLIEHPTFPPPPSQ+EDIEHWTRAM  DATK
Subjt:  LLESAHSQIMKTICESNLP----SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77890.1 DNA-directed RNA polymerase II protein1.4e-10850.22Show/hide
Query:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L SLK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA
        +Y ++ES    LE+ RV QLE  Y D I    L ++ +TSERL+KQ++V+KQ+CKLFP  RV V G+NK+G   Q+DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSR+W+RDSYWN+ PSS SN YP+F+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+ +  
Subjt:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF

Query:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML
        + SS S HS+E  +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KE++S  SL ++SS   +H  + NKS WN++S SSS L
Subjt:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML

Query:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP
        L S+H+Q   T   S   N+P+    Y  EF D  K+  ++  W+L+E+P
Subjt:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP

AT1G77890.2 DNA-directed RNA polymerase II protein7.4e-10248.67Show/hide
Query:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L SLK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA
        +Y ++ES    LE+ RV QLE  Y D I    L +           ++V+KQ+CKLFP  RV V G+NK+G   Q+DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSR+W+RDSYWN+ PSS SN YP+F+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+ +  
Subjt:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF

Query:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML
        + SS S HS+E  +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KE++S  SL ++SS   +H  + NKS WN++S SSS L
Subjt:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML

Query:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP
        L S+H+Q   T   S   N+P+    Y  EF D  K+  ++  W+L+E+P
Subjt:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP

AT1G77890.3 DNA-directed RNA polymerase II protein4.5e-10750.22Show/hide
Query:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL
        K   CA+C  S + SIC  CVN  LN+Y   L SLK+ R++ Y RLS +LV K KA  Q  W+  +NEKL +LREKL+   E+L+Q K      S +LK 
Subjt:  KFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKL

Query:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA
        +Y ++ES    LE+ RV QLE  Y D I    L  + +TSERL+KQ++V+KQ+CKLFP  RV V G+NK+G   Q+DQIC   LP+ L+P SV P EL+A
Subjt:  KYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSA

Query:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF
        SLGYMVQLLNL+V  L+ PALHN GFAGSCSR+W+RDSYWN+ PSS SN YP+F+P  ++ S   ++SW+ + ++NFGV SL+S+   +     L+ +  
Subjt:  SLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESER--KPHLSSLESNSF

Query:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML
        + SS S HS+E  +NLQ+GIA LK+SVA +T Y Y SLSL+VPS  STFE FAKLLATLSS KE++S  SL ++SS   +H  + NKS WN++S SSS L
Subjt:  NYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSML

Query:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP
        L S+H+Q   T   S   N+P+    Y  EF D  K+  ++  W+L+E+P
Subjt:  LESAHSQIMKTICES---NLPSY--LYATEFSDAGKNDCTLDGWDLIEHP

AT4G08540.1 DNA-directed RNA polymerase II protein1.6e-17366.95Show/hide
Query:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD
        M ++  NCAIC+N+N+  ICT CVNHRL +YN+ LKSLK RRD L SR +++L +KGKADDQ NWR+ +NEK+ +L++KL+ ++E + QGK +IE  S D
Subjt:  MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYD

Query:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE
        LK+KY +L+SARS LEK RVEQ+EK +P+LI T++LGHMAI+SERLHKQSVVVKQ+CKLFP RRV   GE++ G   Q+D IC   LP  LDPHS+   E
Subjt:  LKLKYAMLESARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHE

Query:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLESN
        L+ SLGYMVQLLNL+V  LAAPALH+SGFAGSCSR+WQRDSYW+   S+RSNEYP+FIPR+NYCSTS ENSW+DK+SSNFGVAS+ES+RK P L S  SN
Subjt:  LSASLGYMVQLLNLIVQYLAAPALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERK-PHLSSLESN

Query:  SFNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSS
        SF YSS S HSIE H++LQKGIALLKKSVAC+TAYCYNSL L+VP EASTFEAFAKLLATLSSSKE+RSVFSLKMASSRS K  Q+LNKS WN  S+ SS
Subjt:  SFNYSSPSSHSIEMHKNLQKGIALLKKSVACVTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSS

Query:  MLLESAHSQIMKTICES-NLP-SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ
         LLESAH     +  +  N P SYL ATE S    ND  ++GWDL+EHP +PPPPSQSED+EHWTRAM  DA K+
Subjt:  MLLESAHSQIMKTICES-NLP-SYLYATEFSDAGKNDCTLDGWDLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCACCGTTTGCGTCAATCACAGATTGAATGATTACAACTCTTCGTTAAAATC
ATTGAAAGCTCGGCGGGATTTGTTATATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTAACTCGGAATGAGAAACTTA
TGAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGCAACTCAAGCAAGGAAAGGCAGAGATTGAGATGACATCCTACGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAACAACGAGTTGAACAACTTGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAGCGCCTTCA
CAAACAATCCGTGGTTGTTAAACAACTGTGCAAATTGTTTCCTCAACGTCGGGTTTTAGTTTATGGAGAGAATAAAGAGGGACCTGGTGAGCAATTTGATCAAATCTGTT
ATGTGAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAGCCACATGAGCTTTCAGCTTCGTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAATATTTGGCT
GCTCCTGCCCTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCGTATGGCAAAGGGATTCATATTGGAACGCTGGTCCATCTTCTCGGAGCAATGAGTATCCAGTTTT
TATACCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGCAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATTTAA
GTTCACTAGAAAGTAATAGCTTCAATTATTCGTCCCCTTCTTCACATTCTATTGAAATGCACAAGAATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGT
GTCACCGCATACTGCTATAACTCTCTCTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTCGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAATGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCTAAGCATGTTCAGAAACTGAACAAATCTGCATGGAATGTGGATTCGATTTCATCCAGCATGCTGCTCGAGA
GTGCACATTCACAAATAATGAAAACCATTTGTGAGAGTAACCTTCCAAGTTATCTTTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTGCACCCTTGACGGATGG
GACCTCATAGAGCATCCAACTTTTCCCCCTCCACCATCTCAATCTGAAGATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAAATTCTGCAACTGCGCTATCTGTGAGAATTCAAATCAAGCTTCCATTTGCACCGTTTGCGTCAATCACAGATTGAATGATTACAACTCTTCGTTAAAATC
ATTGAAAGCTCGGCGGGATTTGTTATATTCGAGGTTGAGTGACGTGCTTGTGGCAAAGGGGAAGGCAGACGATCAATTAAATTGGAGAGTAACTCGGAATGAGAAACTTA
TGAGGTTAAGGGAGAAACTCCGGCGCAGTAGAGAGCAACTCAAGCAAGGAAAGGCAGAGATTGAGATGACATCCTACGATCTCAAATTGAAATATGCGATGCTTGAATCA
GCCCGTTCAGTGTTGGAAAAACAACGAGTTGAACAACTTGAGAAGGCCTATCCTGACCTTATTAGCACCAAGAATCTGGGACATATGGCAATTACCTCTGAGCGCCTTCA
CAAACAATCCGTGGTTGTTAAACAACTGTGCAAATTGTTTCCTCAACGTCGGGTTTTAGTTTATGGAGAGAATAAAGAGGGACCTGGTGAGCAATTTGATCAAATCTGTT
ATGTGAGCTTACCAAGAAGACTGGATCCCCATTCTGTTCAGCCACATGAGCTTTCAGCTTCGTTGGGATACATGGTGCAACTTCTAAATCTTATTGTTCAATATTTGGCT
GCTCCTGCCCTTCACAACTCGGGTTTTGCAGGTTCTTGTTCACGCGTATGGCAAAGGGATTCATATTGGAACGCTGGTCCATCTTCTCGGAGCAATGAGTATCCAGTTTT
TATACCACGTCAAAACTATTGTTCAACAAGTGGGGAAAATTCATGGTCTGATAAAAGCTCTAGCAACTTTGGTGTTGCTTCGTTGGAATCAGAGAGGAAACCACATTTAA
GTTCACTAGAAAGTAATAGCTTCAATTATTCGTCCCCTTCTTCACATTCTATTGAAATGCACAAGAATTTGCAGAAAGGGATTGCCCTCCTCAAGAAAAGTGTAGCATGT
GTCACCGCATACTGCTATAACTCTCTCTCTTTAGATGTTCCTTCTGAAGCTTCTACTTTTGAAGCATTCGCTAAATTATTGGCTACTCTTTCTTCATCCAAGGAAATGCG
TTCTGTTTTTTCCCTCAAAATGGCTTCTTCCAGGTCCCCTAAGCATGTTCAGAAACTGAACAAATCTGCATGGAATGTGGATTCGATTTCATCCAGCATGCTGCTCGAGA
GTGCACATTCACAAATAATGAAAACCATTTGTGAGAGTAACCTTCCAAGTTATCTTTATGCCACTGAATTTTCTGATGCCGGAAAGAATGATTGCACCCTTGACGGATGG
GACCTCATAGAGCATCCAACTTTTCCCCCTCCACCATCTCAATCTGAAGATATTGAGCATTGGACTCGAGCAATGATCACCGATGCCACCAAACAGTAA
Protein sequenceShow/hide protein sequence
MNRKFCNCAICENSNQASICTVCVNHRLNDYNSSLKSLKARRDLLYSRLSDVLVAKGKADDQLNWRVTRNEKLMRLREKLRRSREQLKQGKAEIEMTSYDLKLKYAMLES
ARSVLEKQRVEQLEKAYPDLISTKNLGHMAITSERLHKQSVVVKQLCKLFPQRRVLVYGENKEGPGEQFDQICYVSLPRRLDPHSVQPHELSASLGYMVQLLNLIVQYLA
APALHNSGFAGSCSRVWQRDSYWNAGPSSRSNEYPVFIPRQNYCSTSGENSWSDKSSSNFGVASLESERKPHLSSLESNSFNYSSPSSHSIEMHKNLQKGIALLKKSVAC
VTAYCYNSLSLDVPSEASTFEAFAKLLATLSSSKEMRSVFSLKMASSRSPKHVQKLNKSAWNVDSISSSMLLESAHSQIMKTICESNLPSYLYATEFSDAGKNDCTLDGW
DLIEHPTFPPPPSQSEDIEHWTRAMITDATKQ