; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005764 (gene) of Snake gourd v1 genome

Gene IDTan0005764
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionserine-aspartate repeat-containing protein I-like isoform X3
Genome locationLG07:16486002..16489271
RNA-Seq ExpressionTan0005764
SyntenyTan0005764
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022947032.1 uncharacterized protein LOC111451030 isoform X2 [Cucurbita moschata]8.6e-9163.93Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  KA    PAP P+K VE+KDV VD V +VEAEK           E NQSDKGKEV  D+DKVDDQSVKRRSLS LFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LES ETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKCIEEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVE EK +EE EIKVP+ VVEP K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPSDV P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEATP K ES TSEQKKEDIS++ KTE ET K         E STEPAQK++E KV +EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

XP_023007404.1 serine-aspartate repeat-containing protein I-like isoform X2 [Cucurbita maxima]1.4e-8558Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEK----------------------------------LSEVSTEKITITDVPTTSETIPEE
        VVET+K +EE EIKVP+ VV+P K  EE+E KA QTEVETEK                                   SE+  EKI ITDVPTTS T+P+E
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEK----------------------------------LSEVSTEKITITDVPTTSETIPEE

Query:  KVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        KV + SPS V P SETP EKTSE+VKLP+KVEKP+AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

XP_023007405.1 serine-aspartate repeat-containing protein I-like isoform X3 [Cucurbita maxima]1.2e-8963.39Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVET+K +EE EIK+P+ VVEP K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPS V P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

XP_023007406.1 serine-aspartate repeat-containing protein I-like isoform X4 [Cucurbita maxima]1.6e-8963.39Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVET+K +EE EIKVP+ VV+P K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPS V P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

XP_038902634.1 probable serine/threonine-protein kinase kinX [Benincasa hispida]1.1e-9368.03Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPA-EEVAGEGNQSDKGKEVA-VDEDKVDDQSVKRRSLSHLFKEKEGGEPIE
        MGACATKPK DG+ A APEP        E KD  VDAVVAVE + KV+VPA EEV+GEGNQSDKGKEV  VD+DKVDDQSVKRRSLS+LFKEKEG E +E
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPA-EEVAGEGNQSDKGKEVA-VDEDKVDDQSVKRRSLSHLFKEKEGGEPIE

Query:  CEGPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKV--------------PQTVVETEKCIEEPETKAPQTVVETEKHIEEAEIK------
        CE PAGETET+ESKETE  TKE E KAPQTEVE E C E  ETKV              PQTVVET+K  EE ETK PQTVVET++H EEAE K      
Subjt:  CEGPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKV--------------PQTVVETEKCIEEPETKAPQTVVETEKHIEEAEIK------

Query:  ------------VPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKA
                     PQIVVE  K TEE E K  QT VE EK SE+  E+I ITDVPTTSETI  EKVI PSPSDVTPTSET +EK SEDVKLPEKVEK   
Subjt:  ------------VPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKA

Query:  VTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKP-APTETSTEPAQKHDE-VKVTAEEK
        VT+VEATP+KDES TSE KKEDISDV KTETETPKETEPKP  PTE+ST+PAQ++DE VKVTAEEK
Subjt:  VTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKP-APTETSTEPAQKHDE-VKVTAEEK

TrEMBL top hitse value%identityAlignment
A0A0A0LQ67 Zonadhesin1.2e-8565.32Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVA-VDEDKVDDQSVKRRSLSHLFKEKEGGEPIEC
        MGACATKPK DGA A    PAP P+KK  D D  V  + AV+ +K VEV A EV+GEG+QSDKGKEV  VD+DKVDDQSVKRRSLS+LFKEKEG E I+ 
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVA-VDEDKVDDQSVKRRSLSHLFKEKEGGEPIEC

Query:  EGPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQTVVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQ
        E P GET      ETE  TKE + KAPQTEVETEKCIEEPE KVPQTVV  EK IEE + K PQT+ ETEKH EE+E K+PQ VVE  K+TEE E +   
Subjt:  EGPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQTVVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQ

Query:  TEVETEKL-------------SEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKK
        T VET++              SE+  E+I +TDV TTSETI  EKVI PSPSDVTPTSET +EK SE+VK+PEKVEK + VTLVEATP+ DES TSE+KK
Subjt:  TEVETEKL-------------SEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKK

Query:  EDISDVAKTETETPKETEPKP-APTETSTEPAQ-KHDEVKVTAEEK
        +D SDV KTETETPKETEPKP APTETS EPA+ K++ VKV+AEEK
Subjt:  EDISDVAKTETETPKETEPKP-APTETSTEPAQ-KHDEVKVTAEEK

A0A6J1G5B7 uncharacterized protein LOC111451030 isoform X24.2e-9163.93Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  KA    PAP P+K VE+KDV VD V +VEAEK           E NQSDKGKEV  D+DKVDDQSVKRRSLS LFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LES ETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKCIEEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVE EK +EE EIKVP+ VVEP K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPSDV P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEATP K ES TSEQKKEDIS++ KTE ET K         E STEPAQK++E KV +EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

A0A6J1L0F6 serine-aspartate repeat-containing protein I-like isoform X47.8e-9063.39Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVET+K +EE EIKVP+ VV+P K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPS V P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

A0A6J1L2V7 serine-aspartate repeat-containing protein I-like isoform X36.0e-9063.39Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP
        VVET+K +EE EIK+P+ VVEP K  EE+E KA QTEVETEK SE+  EKI ITDVPTTS T+P+EKV + SPS V P SETP EKTSE+VKLP+KVEKP
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDVPTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKP

Query:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        +AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

A0A6J1L4V0 serine-aspartate repeat-containing protein I-like isoform X26.9e-8658Show/hide
Query:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE
        MGACATKPKVD  K     PAP P+K VE+KDV VD V  VEAEK           E NQSDKGKEV VD+DKVDDQSVKRRSLSHLFKEKEG   + CE
Subjt:  MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECE

Query:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------
        GPAGETE LESKETEKD KE+ TK PQTEVET+KC +EPETKVPQTVVETEKC+EEPETKAPQT                                    
Subjt:  GPAGETETLESKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQT------------------------------------

Query:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEK----------------------------------LSEVSTEKITITDVPTTSETIPEE
        VVET+K +EE EIKVP+ VV+P K  EE+E KA QTEVETEK                                   SE+  EKI ITDVPTTS T+P+E
Subjt:  VVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEK----------------------------------LSEVSTEKITITDVPTTSETIPEE

Query:  KVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK
        KV + SPS V P SETP EKTSE+VKLP+KVEKP+AVTLVEA P K ES TSEQKKEDIS++ KTE ET K         E STEPAQK+ E KV++EEK
Subjt:  KVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCTTGTGCGACTAAGCCCAAGGTCGACGGCGCCAAGGCTTCGGCTCCGGAACCGGCGCCAGCGCCGGATAAGAAGGTGGAGGATAAGGATGTTGTTGTTGATGC
TGTAGTCGCTGTTGAAGCTGAGAAGAAAGTTGAGGTTCCGGCGGAGGAAGTCGCCGGAGAGGGGAACCAGAGCGATAAAGGCAAGGAAGTTGCTGTTGACGAGGATAAGG
TGGATGATCAGAGTGTTAAACGCCGCTCTCTTAGCCACTTGTTTAAGGAGAAAGAAGGGGGTGAACCAATTGAATGTGAGGGGCCAGCAGGGGAAACAGAGACACTGGAG
TCTAAAGAGACAGAAAAAGATACAAAAGAAGCCGAGACAAAGGCGCCTCAAACTGAAGTAGAAACCGAAAAATGTATAGAAGAACCCGAGACAAAGGTGCCTCAAACCGT
AGTTGAAACCGAAAAATGTATAGAAGAACCCGAGACAAAGGCGCCTCAAACTGTAGTTGAAACCGAAAAGCATATAGAAGAAGCTGAGATAAAGGTTCCTCAAATTGTAG
TAGAGCCTGGAAAACGAACAGAAGAAGCTGAACCAAAGGCAGCTCAAACCGAAGTAGAGACTGAAAAATTGTCTGAAGTTTCAACAGAAAAGATAACAATCACTGATGTT
CCTACAACTTCTGAGACCATTCCTGAGGAGAAAGTAATTTTGCCTTCACCATCTGATGTTACGCCAACAAGCGAAACACCCAAGGAAAAGACATCAGAGGATGTAAAATT
GCCCGAAAAAGTCGAGAAACCTAAAGCAGTGACATTAGTTGAAGCAACACCATCAAAAGATGAGAGTAAAACATCTGAACAGAAGAAAGAAGATATCAGCGATGTCGCGA
AGACCGAGACGGAGACACCAAAAGAAACTGAACCGAAGCCCGCTCCAACTGAAACGAGTACTGAACCAGCACAGAAACACGACGAAGTAAAGGTAACTGCTGAAGAAAAA
TAA
mRNA sequenceShow/hide mRNA sequence
ACCATTTCTTCTCCTCTCTCTCTCTCTGTTTCGCCGTAGAAAAATTGTCGGAACAACGAACACTCAAACAGTTTCATTCCCTATAAAATCCTAAATTCTATCGCAAATTT
CTCTCTATCTCTCTTTGTCAAACTCTTGTTGCAAAGAGTAAAGGAAAGACGTAATCCAATCAACAGGAGGAGAAATGGGAGCTTGTGCGACTAAGCCCAAGGTCGACGGC
GCCAAGGCTTCGGCTCCGGAACCGGCGCCAGCGCCGGATAAGAAGGTGGAGGATAAGGATGTTGTTGTTGATGCTGTAGTCGCTGTTGAAGCTGAGAAGAAAGTTGAGGT
TCCGGCGGAGGAAGTCGCCGGAGAGGGGAACCAGAGCGATAAAGGCAAGGAAGTTGCTGTTGACGAGGATAAGGTGGATGATCAGAGTGTTAAACGCCGCTCTCTTAGCC
ACTTGTTTAAGGAGAAAGAAGGGGGTGAACCAATTGAATGTGAGGGGCCAGCAGGGGAAACAGAGACACTGGAGTCTAAAGAGACAGAAAAAGATACAAAAGAAGCCGAG
ACAAAGGCGCCTCAAACTGAAGTAGAAACCGAAAAATGTATAGAAGAACCCGAGACAAAGGTGCCTCAAACCGTAGTTGAAACCGAAAAATGTATAGAAGAACCCGAGAC
AAAGGCGCCTCAAACTGTAGTTGAAACCGAAAAGCATATAGAAGAAGCTGAGATAAAGGTTCCTCAAATTGTAGTAGAGCCTGGAAAACGAACAGAAGAAGCTGAACCAA
AGGCAGCTCAAACCGAAGTAGAGACTGAAAAATTGTCTGAAGTTTCAACAGAAAAGATAACAATCACTGATGTTCCTACAACTTCTGAGACCATTCCTGAGGAGAAAGTA
ATTTTGCCTTCACCATCTGATGTTACGCCAACAAGCGAAACACCCAAGGAAAAGACATCAGAGGATGTAAAATTGCCCGAAAAAGTCGAGAAACCTAAAGCAGTGACATT
AGTTGAAGCAACACCATCAAAAGATGAGAGTAAAACATCTGAACAGAAGAAAGAAGATATCAGCGATGTCGCGAAGACCGAGACGGAGACACCAAAAGAAACTGAACCGA
AGCCCGCTCCAACTGAAACGAGTACTGAACCAGCACAGAAACACGACGAAGTAAAGGTAACTGCTGAAGAAAAATAATAAGTTGATGAAAAGTGAGGCATTTAAGGTGAA
GTATAGAAAGTAAAAAGGAAATGCTTTGTTCTTGGCTGTTTAAATTTATTCTGTGGGAGAAACTTTATCCAAAGCAGTGGCAGCAATCTGCCATGATGAAAGATTAGATT
TGTTATGTGGTTTACTAAGGCAACTTCTTAGAGAGGCTTTGTTTCTATTGTTTTGTTGTATATCATTTGTTGGTTTTTCATTGAGTTGCACATTCTGTTTCCTTTGCTTT
GCATGTAGACATATGAGTTTGTCAGATTAATATGAACATAGAATGTCCTTTTTTAAATGTGAAATTTACAATTCTTCATCATTTTGATTC
Protein sequenceShow/hide protein sequence
MGACATKPKVDGAKASAPEPAPAPDKKVEDKDVVVDAVVAVEAEKKVEVPAEEVAGEGNQSDKGKEVAVDEDKVDDQSVKRRSLSHLFKEKEGGEPIECEGPAGETETLE
SKETEKDTKEAETKAPQTEVETEKCIEEPETKVPQTVVETEKCIEEPETKAPQTVVETEKHIEEAEIKVPQIVVEPGKRTEEAEPKAAQTEVETEKLSEVSTEKITITDV
PTTSETIPEEKVILPSPSDVTPTSETPKEKTSEDVKLPEKVEKPKAVTLVEATPSKDESKTSEQKKEDISDVAKTETETPKETEPKPAPTETSTEPAQKHDEVKVTAEEK