; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018042 (gene) of Chayote v1 genome

Gene IDSed0018042
OrganismSechium edule (Chayote v1)
DescriptionTranslation initiation factor IF-2 like
Genome locationLG09:38479408..38484139
RNA-Seq ExpressionSed0018042
SyntenySed0018042
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058721.1 uncharacterized protein E6C27_scaffold339G001910 [Cucumis melo var. makuwa]2.8e-6365.95Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK
        MSD EWV+VALSDDSLVVD+LLRLN     PLP L WS+R PRS     +++  +AARASPTTPLTW+SSGG G           +SK A +SEVVATMK
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL
        RPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKK+KLD +SQQAI+MVVTS V  EA S+  Q  + P+RSIC+TTP+  DAS+QL
Subjt:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL

Query:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS
         + N+SCKLQE   + TV +LPDLNLPFQEDS
Subjt:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS

KAG6593372.1 hypothetical protein SDJN03_12848, partial [Cucurbita argyrosperma subsp. sororia]5.7e-6465.84Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSGGAG----------KSKN
        MSD++WV+VALSDDSLVVD+LLRLN        SP  RL WS+R PR           SA KNSD  AARASPTTPLTW+S GGA           +SKN
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSGGAG----------KSKN

Query:  ADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTT
        A +SEVVATMKRPRKKKTLGELKEEEVLLLKE++ LKDALA LRLT EKQRSINGSLKKMK+DF SQQAI+M VTS + KEA SD  Q +    SICNT 
Subjt:  ADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTT

Query:  PVDDDA-SHQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS
        PV  DA S+QLPL N+SCKLQEM    TV  +PDLN+PFQEDS
Subjt:  PVDDDA-SHQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS

XP_008461167.1 PREDICTED: uncharacterized protein LOC103499830 [Cucumis melo]2.8e-6365.95Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK
        MSD EWV+VALSDDSLVVD+LLRLN     PLP L WS+R PRS     +++  +AARASPTTPLTW+SSGG G           +SK A +SEVVATMK
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL
        RPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKK+KLD +SQQAI+MVVTS V  EA S+  Q  + P+RSIC+TTP+  DAS+QL
Subjt:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL

Query:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS
         + N+SCKLQE   + TV +LPDLNLPFQEDS
Subjt:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS

XP_022959761.1 uncharacterized protein LOC111460736 isoform X1 [Cucurbita moschata]3.4e-6466.38Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG----------KSKNADESEVVA
        MSD++WV+VALSDDSLVVD+LLRLN        SP  RL WS+R PRS     + +   AARASPTTPLTW+S GGA           +SKNA +SEVVA
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG----------KSKNADESEVVA

Query:  TMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPVDDDA-S
        TMKRPRKKKTLGELKEEEVLLLKE++ LKDALA LRLT EKQRSINGSLKKMK+DF SQQAI+M VTS + KEA SD  QP+    SICNT PV  DA S
Subjt:  TMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPVDDDA-S

Query:  HQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS
        +QLPL N+SCKLQEM    TV  +PDLN+PFQEDS
Subjt:  HQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS

XP_038896159.1 uncharacterized protein LOC120084450 isoform X1 [Benincasa hispida]1.5e-6465.98Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSG-----GAGKSKNADESEVVAT
        MSDEEWV+VALSDDSLVVD+LLRLN     PLP L WS+R PR           SA+KNSD +AARASPTTPLTW+S G      A +SK A +SEVVAT
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSG-----GAGKSKNADESEVVAT

Query:  MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPV-------
        +KRPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKKMKLD +SQQAI+ +VTS V +EA SD  Q +MP RSICNTT +       
Subjt:  MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPV-------

Query:  -DDDASHQLPLTNISCKLQE---MVTVHVLPDLNLPFQEDS
         D DAS+QL L N+SCKLQE   + TV +LPDLNLPFQEDS
Subjt:  -DDDASHQLPLTNISCKLQE---MVTVHVLPDLNLPFQEDS

TrEMBL top hitse value%identityAlignment
A0A0A0K8M7 Uncharacterized protein8.9e-6365.24Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG------------KSKNADESEVVATM
        MSD EWV+VALSDDSLVVD+LLRLN     PLP L WS+R PRS     +++  +AARASPTTPLTW+SSGG G            +SK A +SEVVATM
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG------------KSKNADESEVVATM

Query:  KRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQ
        KRPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKKMKLD +SQQA  MVVTS V  EA S+  Q  + P RS+C+TTP+  DAS+Q
Subjt:  KRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQ

Query:  LPLTNISCKLQEMV---TVHVLPDLNLPFQEDS
        L + N+SCKLQE+    TV +LPDLNLPFQEDS
Subjt:  LPLTNISCKLQEMV---TVHVLPDLNLPFQEDS

A0A1S3CDM0 uncharacterized protein LOC1034998301.4e-6365.95Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK
        MSD EWV+VALSDDSLVVD+LLRLN     PLP L WS+R PRS     +++  +AARASPTTPLTW+SSGG G           +SK A +SEVVATMK
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL
        RPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKK+KLD +SQQAI+MVVTS V  EA S+  Q  + P+RSIC+TTP+  DAS+QL
Subjt:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL

Query:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS
         + N+SCKLQE   + TV +LPDLNLPFQEDS
Subjt:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS

A0A5D3CK05 Uncharacterized protein1.4e-6365.95Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK
        MSD EWV+VALSDDSLVVD+LLRLN     PLP L WS+R PRS     +++  +AARASPTTPLTW+SSGG G           +SK A +SEVVATMK
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG-----------KSKNADESEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL
        RPRKKKTLGELKEEEVLLLKE++ LKDALATLRL+ EKQR++NGSLKK+KLD +SQQAI+MVVTS V  EA S+  Q  + P+RSIC+TTP+  DAS+QL
Subjt:  RPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQP-KMPARSICNTTPVDDDASHQL

Query:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS
         + N+SCKLQE   + TV +LPDLNLPFQEDS
Subjt:  PLTNISCKLQE---MVTVHVLPDLNLPFQEDS

A0A6J1H5R9 uncharacterized protein LOC111460736 isoform X11.6e-6466.38Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG----------KSKNADESEVVA
        MSD++WV+VALSDDSLVVD+LLRLN        SP  RL WS+R PRS     + +   AARASPTTPLTW+S GGA           +SKNA +SEVVA
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPRS---AVKNSDAAAARASPTTPLTWTSSGGAG----------KSKNADESEVVA

Query:  TMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPVDDDA-S
        TMKRPRKKKTLGELKEEEVLLLKE++ LKDALA LRLT EKQRSINGSLKKMK+DF SQQAI+M VTS + KEA SD  QP+    SICNT PV  DA S
Subjt:  TMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPVDDDA-S

Query:  HQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS
        +QLPL N+SCKLQEM    TV  +PDLN+PFQEDS
Subjt:  HQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS

A0A6J1KVJ7 uncharacterized protein LOC111497981 isoform X14.0e-6365.02Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSGGAG----------KSKN
        MSD++WV+VALSDDSLVVD+LLRLN        SP  RL WS+R PR           SA KNSD  AARASPTTPLTW+S GGA           +SKN
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN--------SPLPRLRWSLRSPR-----------SAVKNSDAAAARASPTTPLTWTSSGGAG----------KSKN

Query:  ADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTT
        A +SE VATMKRPRKKKTLGELKEEEVLLLKE++ LKDALA LRLT E+QRSINGSLKKMK+DF SQQAI+M VTS + KE  SD  Q +    SICNT 
Subjt:  ADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTT

Query:  PVDDDA-SHQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS
        PV  DA S+QLPL N+SCKLQEM   VTV  +PDLN+PFQEDS
Subjt:  PVDDDA-SHQLPLTNISCKLQEM---VTVHVLPDLNLPFQEDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein1.6e-1633.99Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLNSPLP-------------RLRWSLRSPR---SAVKNSDAAAARASPTTPLTWT-----SSGGAG------------
        M+  +W++ A+ DDSLV + L+ L    P             +L+WS+R PR   + ++       RASPTTPL+W+     S GG G            
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLNSPLP-------------RLRWSLRSPR---SAVKNSDAAAARASPTTPLTWT-----SSGGAG------------

Query:  ---------KSKNADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDC
                 +SK    S   +  KR RKKKTL +LKEEE +LLKE+ GL++ LAT++   ++QR+ N SLK  KL  +SQ+  D     P    A+ ++ 
Subjt:  ---------KSKNADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDC

Query:  QPK
         P+
Subjt:  QPK

AT1G80610.1 unknown protein3.8e-2142.62Show/hide
Query:  MSDEEWVNVALSDDSLVVDVLLRLN----------SPLPRLRWSLRSPRSAVKNSDAAAARASPTTPLTWT---------SSGGAG--------------
        MS E W+ VA+SDDS+V + LLRL           SPL +L+WS+R  RS  K  D    RASPTTPL+W+          SGG+G              
Subjt:  MSDEEWVNVALSDDSLVVDVLLRLN----------SPLPRLRWSLRSPRSAVKNSDAAAARASPTTPLTWT---------SSGGAG--------------

Query:  --------KSKNADESEVVAT-----MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQS
                +SK +  S +  T      KR RKKKTL ELKEEE++LLKE  GLK+ LA +R   E+QR+ N +LKKMK + QS
Subjt:  --------KSKNADESEVVAT-----MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQS

AT4G32030.1 unknown protein5.3e-1530.86Show/hide
Query:  EEWVNVALSDDSLVVDVLLRL--------NSP---LPRLRWSLRSPRSAVK------------NSDAAAARASPTTPLTW---TSSGGAGKSKNAD----
        ++WV VA++DD LVV++LLRL        ++P   LP LRW +R  RS                 D  + RASP TPL+W   + SGG   S +AD    
Subjt:  EEWVNVALSDDSLVVDVLLRL--------NSP---LPRLRWSLRSPRSAVK------------NSDAAAARASPTTPLTW---TSSGGAGKSKNAD----

Query:  -------------ESEVVAT-------MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEA
                      S+V  T        KR +K+K+  ELK EE L LKE+  L+  +A+LR TF++Q   N  LK++KLD  S +  +      +RK  
Subjt:  -------------ESEVVAT-------MKRPRKKKTLGELKEEEVLLLKEKKGLKDALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEA

Query:  ISDDCQPKMPARSICNTTPVDDDASHQLPLTNISCKLQEMVTVHVLPDLNLPFQED
        +      ++     C T+   +  S                   VLPDLN+   E+
Subjt:  ISDDCQPKMPARSICNTTPVDDDASHQLPLTNISCKLQEMVTVHVLPDLNLPFQED

AT4G32030.2 unknown protein3.1e-0734.84Show/hide
Query:  EEWVNVALSDDSLVVDVLLRL--------NSP---LPRLRWSLRSPRSAVK------------NSDAAAARASPTTPLTW---TSSGGAGKSKNAD----
        ++WV VA++DD LVV++LLRL        ++P   LP LRW +R  RS                 D  + RASP TPL+W   + SGG   S +AD    
Subjt:  EEWVNVALSDDSLVVDVLLRL--------NSP---LPRLRWSLRSPRSAVK------------NSDAAAARASPTTPLTW---TSSGGAGKSKNAD----

Query:  -------------ESEVVAT-------MKRPRKKKTLGELKEEEVLLLKEKKGLK
                      S+V  T        KR +K+K+  ELK EE L LKE+  L+
Subjt:  -------------ESEVVAT-------MKRPRKKKTLGELKEEEVLLLKEKKGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGTCCGACGAAGAATGGGTCAACGTGGCTCTTTCCGACGACTCTCTCGTCGTCGATGTACTCCTCCGCCTCAACTCTCCTCTTCCCCGCCTCCGCTGGTCTCT
CCGGTCTCCTCGCTCCGCCGTCAAAAACTCCGATGCGGCGGCGGCTAGAGCCAGTCCCACCACGCCTCTCACTTGGACTAGCAGCGGCGGCGCCGGAAAATCCAAGAATG
CTGATGAGAGTGAAGTAGTTGCAACAATGAAGAGGCCAAGAAAGAAAAAGACACTGGGAGAACTTAAAGAGGAGGAAGTTTTGCTATTGAAGGAAAAGAAAGGATTGAAA
GATGCCTTGGCTACCTTGCGGCTCACTTTCGAAAAACAGAGGTCTATCAATGGAAGCTTGAAGAAAATGAAGCTTGATTTCCAATCACAACAAGCGATCGACATGGTTGT
AACATCTCCTGTACGGAAGGAGGCGATCTCCGACGATTGTCAACCGAAGATGCCAGCGAGATCGATATGCAACACAACGCCCGTTGATGACGATGCTTCTCACCAATTAC
CTCTGACTAACATTTCTTGCAAACTACAAGAGATGGTTACTGTTCATGTACTACCAGATCTTAATTTGCCATTCCAAGAGGACTCATAG
mRNA sequenceShow/hide mRNA sequence
CCATTTTGCAGTTGTTTGTGTGTGATGGACATGTCCGACGAAGAATGGGTCAACGTGGCTCTTTCCGACGACTCTCTCGTCGTCGATGTACTCCTCCGCCTCAACTCTCC
TCTTCCCCGCCTCCGCTGGTCTCTCCGGTCTCCTCGCTCCGCCGTCAAAAACTCCGATGCGGCGGCGGCTAGAGCCAGTCCCACCACGCCTCTCACTTGGACTAGCAGCG
GCGGCGCCGGAAAATCCAAGAATGCTGATGAGAGTGAAGTAGTTGCAACAATGAAGAGGCCAAGAAAGAAAAAGACACTGGGAGAACTTAAAGAGGAGGAAGTTTTGCTA
TTGAAGGAAAAGAAAGGATTGAAAGATGCCTTGGCTACCTTGCGGCTCACTTTCGAAAAACAGAGGTCTATCAATGGAAGCTTGAAGAAAATGAAGCTTGATTTCCAATC
ACAACAAGCGATCGACATGGTTGTAACATCTCCTGTACGGAAGGAGGCGATCTCCGACGATTGTCAACCGAAGATGCCAGCGAGATCGATATGCAACACAACGCCCGTTG
ATGACGATGCTTCTCACCAATTACCTCTGACTAACATTTCTTGCAAACTACAAGAGATGGTTACTGTTCATGTACTACCAGATCTTAATTTGCCATTCCAAGAGGACTCA
TAGCGCTGAGGACTTATACCGAACGACTTAAGGCAGCAGATTCGAACGTAACGAACCGATATTAAGATTCGATAACTGAGAGGGACGAAAATGGACGACGACGACGACGA
CAACATTTGCTTCTTAATAAGATGCAGTTCGTGAATCCTAACCTTGTCATAGGCTTGAGTTACTTATAACTGTAATTACTAGACAAATTTTCACAGCTGGATTTTCATAT
AGCTTCTTTTTTTCCCCTTGCCATAGTACCCGAAAAAATCGAGCATAGTACCGAGACGAGACCCCGAAAACCCTTTAGCTCTTGAGGTATATTTTTCATTTACAGGCTTC
TGCATCTGCTTTTTATTTTTATTTTTATTTTTTTAAGCCAAACCTTCTCCTTTATTCCAGAGTCAGAGATTAGTAGATTACAAAC
Protein sequenceShow/hide protein sequence
MDMSDEEWVNVALSDDSLVVDVLLRLNSPLPRLRWSLRSPRSAVKNSDAAAARASPTTPLTWTSSGGAGKSKNADESEVVATMKRPRKKKTLGELKEEEVLLLKEKKGLK
DALATLRLTFEKQRSINGSLKKMKLDFQSQQAIDMVVTSPVRKEAISDDCQPKMPARSICNTTPVDDDASHQLPLTNISCKLQEMVTVHVLPDLNLPFQEDS