; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011063 (gene) of Snake gourd v1 genome

Gene IDTan0011063
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCSL zinc finger domain-containing protein
Genome locationLG05:18826153..18829622
RNA-Seq ExpressionTan0011063
SyntenyTan0011063
Gene Ontology termsGO:0002098 - tRNA wobble uridine modification (biological process)
GO:0017183 - peptidyl-diphthamide biosynthetic process from peptidyl-histidine (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR044248 - Diphthamide biosynthesis protein 3/4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605113.1 hypothetical protein SDJN03_02430, partial [Cucurbita argyrosperma subsp. sororia]9.6e-10884.78Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        M T VT AL+   LIL+AEG +TNDVYSPCLDAKIQR DGF+FGVA SSKESFF D IQFSPCDKRLAL  K AQLAVFRP+VDQLTFLTIN+T FNPA 
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
        YGGYMVAFAG KYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKC+GDFS CLD QDCAV SSKCKYNGGS+DCNLGIQLAFSGTDKNL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISD
        +VLNSWFE+++LRRFSLYKLFSDVRD I++
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISD

XP_008459230.1 PREDICTED: uncharacterized protein LOC103498418 [Cucumis melo]1.2e-11084.62Show/hide
Query:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV
        VALV T L+L+AEG +TNDVYSPCLD+KIQRSDGF+FGVA SSKESFFQDQIQFSPCD RL+LASKNAQLAVFRP+VDQL+FLTI+T+TFNPA+ GGYMV
Subjt:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV

Query:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW
        AFAGQKYAARSLPVMVTDNSHTITSFTLVFEF++GTLQNLFWKKFGCDKC+GDFS+C+DNQDCA+ SSKCKY+GGSVDCNLGIQLAFSGTDKNLEVLNSW
Subjt:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW

Query:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        +EIDNLRRFSLY+LF       SDVRDT+T+PFG
Subjt:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

XP_022148528.1 uncharacterized protein LOC111017155 [Momordica charantia]2.1e-11585Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        MRTA T+AL A  LIL+AEG +TNDV+SPCLDAK+QRSDGF+FGVA SSKESFFQDQIQFSPCD+RL LASKNAQL VFRP VDQL+ LTIN+TTFNPA+
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
        YGGYMVAFAGQKYAARSLPV++TDNS+TITSFTLV EFQKGTLQNLFWKKFGCDKCTGD+SVCLDNQDCAV ++KCK+NGGS+DCNL IQLAFSGTD+NL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        EVLNSWFE+DNLRR+SLYKLFSDVRDTISDV DTIT+PFG
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

XP_023513242.1 uncharacterized protein LOC111777756 [Cucurbita pepo subsp. pepo]4.3e-10881.67Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        MRT VTV LV T L+L+AEG +TNDVYSPCLDAKIQ+SDGF+FG+A SSKE+FFQDQIQFSPCD RLAL  KN QLA+FRP+VDQL+ LTIN+TTFNPAM
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
         GGYMVAFAG KYAARSLPVM+TDNSHTITSFTLVFEFQKGTLQNLFWKK+GC+KCTGDFSVCLDNQDCAV SSKCKY+GGSVDCN+ IQLAFSGTD+NL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        EVLNSWFEIDNL RFSL+KLF       +DVRDT+T+PFG
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

XP_038877911.1 uncharacterized protein LOC120070122 [Benincasa hispida]5.4e-11183.33Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        MR  +   +V   L+L  EG NTND+YSPCLD+KIQRSDGF+FGVA SSKESFFQDQIQ SPCD RLALASKNAQLAVFRPEVDQL+FLTIN++TFNPAM
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
         GGYMVAFAG+KYAARSLPVM+TDNSHTITSFTLVFEFQKGTLQNL WKKFGCDKC+GDFSVCLDNQDCAVQSSKCKYNGGSVDCNL IQLAFSGTDKNL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        EVLNSW+E+DNLRRFSLYKLF       SDVRDT+T+PFG
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

TrEMBL top hitse value%identityAlignment
A0A1S3CA87 uncharacterized protein LOC1034984185.9e-11184.62Show/hide
Query:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV
        VALV T L+L+AEG +TNDVYSPCLD+KIQRSDGF+FGVA SSKESFFQDQIQFSPCD RL+LASKNAQLAVFRP+VDQL+FLTI+T+TFNPA+ GGYMV
Subjt:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV

Query:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW
        AFAGQKYAARSLPVMVTDNSHTITSFTLVFEF++GTLQNLFWKKFGCDKC+GDFS+C+DNQDCA+ SSKCKY+GGSVDCNLGIQLAFSGTDKNLEVLNSW
Subjt:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW

Query:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        +EIDNLRRFSLY+LF       SDVRDT+T+PFG
Subjt:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

A0A5D3CQC8 Uncharacterized protein5.9e-11184.62Show/hide
Query:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV
        VALV T L+L+AEG +TNDVYSPCLD+KIQRSDGF+FGVA SSKESFFQDQIQFSPCD RL+LASKNAQLAVFRP+VDQL+FLTI+T+TFNPA+ GGYMV
Subjt:  VALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMV

Query:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW
        AFAGQKYAARSLPVMVTDNSHTITSFTLVFEF++GTLQNLFWKKFGCDKC+GDFS+C+DNQDCA+ SSKCKY+GGSVDCNLGIQLAFSGTDKNLEVLNSW
Subjt:  AFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSW

Query:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        +EIDNLRRFSLY+LF       SDVRDT+T+PFG
Subjt:  FEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

A0A6J1D4C1 uncharacterized protein LOC1110171551.0e-11585Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        MRTA T+AL A  LIL+AEG +TNDV+SPCLDAK+QRSDGF+FGVA SSKESFFQDQIQFSPCD+RL LASKNAQL VFRP VDQL+ LTIN+TTFNPA+
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
        YGGYMVAFAGQKYAARSLPV++TDNS+TITSFTLV EFQKGTLQNLFWKKFGCDKCTGD+SVCLDNQDCAV ++KCK+NGGS+DCNL IQLAFSGTD+NL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        EVLNSWFE+DNLRR+SLYKLFSDVRDTISDV DTIT+PFG
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

A0A6J1FVK1 uncharacterized protein LOC1114488551.0e-10780.93Show/hide
Query:  VTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGY
        VTV LV T L+L+AEG +TNDVYSPCLDAKIQ+SDGF+FG+A SSKE+FFQDQIQFSPCD RLAL  KN QLA+FRP+VDQL+ LTIN+TTFNPAM GGY
Subjt:  VTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGY

Query:  MVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLN
        MVAFAG KYAARSLPVM+TDNSHTITSFTLVFEFQ+GTLQNLFWKK+GC+KCTGDFSVCLDNQDCAV SSKCKY+GGSVDCN+ IQLAFSGTD+NLEVLN
Subjt:  MVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLN

Query:  SWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG
        SWFE+DNL RFSL+KLF       +DVRDT+T+PFG
Subjt:  SWFEIDNLRRFSLYKLFSDVRDTISDVRDTITDPFG

A0A6J1G6F2 uncharacterized protein LOC1114512951.8e-10784.35Show/hide
Query:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM
        M T VT AL+   LIL+AEG +TNDVYSPCLDAKIQR DGF+FGVA SSKESFF D IQFSPCDKRLAL  K AQLAVFRP+VDQLTFLTIN+T FNPA 
Subjt:  MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAM

Query:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL
        YGGYMVAFAG KYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKC+GDFS CLD QDCAV SSKCKY+GGS+DCNLGIQLAFSGTDKNL
Subjt:  YGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNL

Query:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISD
        +VLNSWFE+++LRRFSLYKLFSDVRD I++
Subjt:  EVLNSWFEIDNLRRFSLYKLFSDVRDTISD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15910.1 CSL zinc finger domain-containing protein8.7e-7555.33Show/hide
Query:  MRTAVTVALVATALILIAE---GANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFN
        MR + T+ ++   ++++ +    A+ N VYSPC D +I + DGF+ G+AISSKE+FF DQ+Q SPCD RL LA+K AQLA+FRP+VD+++ L+I+T+ FN
Subjt:  MRTAVTVALVATALILIAE---GANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFN

Query:  PAMYGGYMVAFAGQKYAARSLPVMVTDNSHTITSFT---------LVFEFQKGTLQNLFWKKFGCDKCTG---DFSVCLDNQDCAVQSSKCKYNGGSVDC
        P+  GG+MV FAG KYAARS PV V D S+TIT+FT         LV EFQKG LQNLFWK FGCD C G     SVCL+  DCAV +SKCK NGG  +C
Subjt:  PAMYGGYMVAFAGQKYAARSLPVMVTDNSHTITSFT---------LVFEFQKGTLQNLFWKKFGCDKCTG---DFSVCLDNQDCAVQSSKCKYNGGSVDC

Query:  NLGIQLAFSGTDKNLEVLNSWFEIDNLRRFSLYKLFSDVRDTIS
        N+GIQ+AFSGTD+NLE LN+W+E++NLR++SL  L+++  D++S
Subjt:  NLGIQLAFSGTDKNLEVLNSWFEIDNLRRFSLYKLFSDVRDTIS

AT3G11800.1 unknown protein1.9e-6954.05Show/hide
Query:  IAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFF----QDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINT---TTFNPAMYGGYMVAFA
        + E  + N VYSPC D+ +   DGF+FG+A ++K+SFF       +Q+SPCD R    + N+++AVFRP+VD++T LTINT   ++F P    GYMVAFA
Subjt:  IAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFF----QDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINT---TTFNPAMYGGYMVAFA

Query:  GQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFS-VCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSWFE
        G KYAARSLP+MV D++H +TSFTLV EFQKG L+N+FWKK GC KC+GD   VCL+ ++CA++   CK  GG VDC+LGIQLAFSGTDK+   LNSW+E
Subjt:  GQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFS-VCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSWFE

Query:  IDNLRRFSLYKLFSDVRDTISD
        + NL+++SLY L+S+++D++++
Subjt:  IDNLRRFSLYKLFSDVRDTISD

AT3G44150.1 unknown protein3.7e-7353.85Show/hide
Query:  VTVALVATALILIAEG------ANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQ-IQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFN
        +T+ + +  ++ +A G       NTN +YSPC D +IQRSDGF+FG+A SS+ SFF +Q +  SPCD+RL+LA+ N+Q +VFRP++D+++ L+INT+ F 
Subjt:  VTVALVATALILIAEG------ANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQ-IQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFN

Query:  PAMYGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFS-VCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGT
        P  YGGYMVAFAG+KYAARS+P  + +++  +TSFTLV EFQKG LQNL+WK+ GC  C G+ + VCL+ QDCA+++  CK  GG+VDC+LGIQLAFSGT
Subjt:  PAMYGGYMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFS-VCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGT

Query:  DKNLEVLNSWFEIDNLRRFSLYKLFSDVRDTISD
        DK+L VLNSW+E++NL+++SLY L+S+++ ++++
Subjt:  DKNLEVLNSWFEIDNLRRFSLYKLFSDVRDTISD

AT3G48630.1 unknown protein8.6e-0642.31Show/hide
Query:  YMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDK
        Y V   G +  +   P  + +++  +TSFT V EFQKG LQNL+WK+  C K
Subjt:  YMVAFAGQKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGACGGCGGTGACGGTGGCTCTGGTTGCGACGGCGCTGATTTTGATTGCGGAAGGGGCGAATACGAACGATGTATACAGTCCTTGTTTGGATGCGAAGATTCAGAG
ATCAGACGGATTTAGTTTCGGCGTAGCCATTTCGTCGAAGGAATCGTTCTTTCAGGATCAGATTCAATTTTCGCCTTGCGATAAACGTCTTGCTTTGGCATCCAAAAATG
CTCAACTTGCTGTTTTCAGGCCTGAGGTCGACCAGCTCACTTTCCTCACCATTAATACCACCACCTTCAATCCGGCTATGTATGGTGGGTATATGGTAGCATTTGCTGGG
CAGAAGTATGCAGCAAGATCTCTCCCAGTAATGGTTACTGATAATTCTCACACCATTACCAGTTTCACTTTGGTTTTTGAATTTCAAAAAGGCACCCTTCAAAATCTATT
CTGGAAGAAATTTGGGTGTGATAAATGCACCGGCGATTTTTCAGTTTGCCTGGACAACCAAGACTGTGCAGTTCAAAGCTCCAAATGTAAGTACAATGGTGGATCAGTTG
ACTGCAATTTAGGCATACAACTAGCTTTTTCAGGCACAGACAAGAACCTGGAAGTCCTCAACTCCTGGTTTGAAATTGACAATCTCAGGCGCTTCTCACTCTATAAACTT
TTCTCCGATGTTCGTGATACGATCTCCGATGTTCGTGATACCATCACCGATCCATTCGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGACGGCGGTGACGGTGGCTCTGGTTGCGACGGCGCTGATTTTGATTGCGGAAGGGGCGAATACGAACGATGTATACAGTCCTTGTTTGGATGCGAAGATTCAGAG
ATCAGACGGATTTAGTTTCGGCGTAGCCATTTCGTCGAAGGAATCGTTCTTTCAGGATCAGATTCAATTTTCGCCTTGCGATAAACGTCTTGCTTTGGCATCCAAAAATG
CTCAACTTGCTGTTTTCAGGCCTGAGGTCGACCAGCTCACTTTCCTCACCATTAATACCACCACCTTCAATCCGGCTATGTATGGTGGGTATATGGTAGCATTTGCTGGG
CAGAAGTATGCAGCAAGATCTCTCCCAGTAATGGTTACTGATAATTCTCACACCATTACCAGTTTCACTTTGGTTTTTGAATTTCAAAAAGGCACCCTTCAAAATCTATT
CTGGAAGAAATTTGGGTGTGATAAATGCACCGGCGATTTTTCAGTTTGCCTGGACAACCAAGACTGTGCAGTTCAAAGCTCCAAATGTAAGTACAATGGTGGATCAGTTG
ACTGCAATTTAGGCATACAACTAGCTTTTTCAGGCACAGACAAGAACCTGGAAGTCCTCAACTCCTGGTTTGAAATTGACAATCTCAGGCGCTTCTCACTCTATAAACTT
TTCTCCGATGTTCGTGATACGATCTCCGATGTTCGTGATACCATCACCGATCCATTCGGGTAA
Protein sequenceShow/hide protein sequence
MRTAVTVALVATALILIAEGANTNDVYSPCLDAKIQRSDGFSFGVAISSKESFFQDQIQFSPCDKRLALASKNAQLAVFRPEVDQLTFLTINTTTFNPAMYGGYMVAFAG
QKYAARSLPVMVTDNSHTITSFTLVFEFQKGTLQNLFWKKFGCDKCTGDFSVCLDNQDCAVQSSKCKYNGGSVDCNLGIQLAFSGTDKNLEVLNSWFEIDNLRRFSLYKL
FSDVRDTISDVRDTITDPFG