; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014217 (gene) of Snake gourd v1 genome

Gene IDTan0014217
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionENDO3c domain-containing protein
Genome locationLG11:63697974..63699505
RNA-Seq ExpressionTan0014217
SyntenyTan0014217
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia]1.7e-13976.06Show/hide
Query:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL
        + L LGV R S F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS  S+LVSINQSSS LLT+ IHS   L   D+ AILDQVARMLR++EKDED++
Subjt:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL

Query:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA
         +FQNLHP AK+IGFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAK MRESKKRKRKG     E GNFPNA EVCRMGVE LK H LGYRA
Subjt:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA

Query:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE
         Y++KFAQ+V++GRI+LQ  EK +SSP+AFPKIKGFGPFATANI MCLGFYH+LPIDTETIRHLKQVHG ++CT KTVGEDVKQ+YD YAP+QCLAYW E
Subjt:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE

Query:  LVEYYESKFGKLSELCSLDYHKISGTTVNL
        LV+YYE+KFGKLSEL S DYHKISG+T++L
Subjt:  LVEYYESKFGKLSELCSLDYHKISGTTVNL

XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya]5.1e-9153.06Show/hide
Query:  LNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ-SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDE
        +NL L+LG  + S F+LEKAVCNHG FMMPPN W PS KTL+RPLRLSN   SV  SI+  S+S  L I +H    +S+ DK AIL+QV RMLRIS+KDE
Subjt:  LNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ-SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDE

Query:  DDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIRE---------YEGGNFPNATEVC
        + + +FQ +H  AK  GFGR+FRSP+LFED VKS+LLCN +W RTL MA+ LCELQ +++R    E +K++++   R          +  GNFPNA E+ 
Subjt:  DDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIRE---------YEGGNFPNATEVC

Query:  RMGVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVK
         +  ++L++   LGYRA Y+I  AQ V++GR+DL   +  +       KIKGFG F  AN+ MC+GFY  +P DTET+RHLKQVHG E C+  T+ +DVK
Subjt:  RMGVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVK

Query:  QVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT
         +YDKY+PFQ LAYWFEL+ YYESK GKLSEL    Y  ++G+
Subjt:  QVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT

XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia]3.1e-12871.68Show/hide
Query:  MKNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS
        M+ +   ++LNLG + S  FDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRL++S  SVLVSI+Q SS LL I IHSS   S LD+QAILDQV RMLRI+
Subjt:  MKNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS

Query:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIR-EYEGGNFPNATEVCRMGV
        E+DE+++  FQNLH +AKEIGFGR+FRSPTLFEDAVKSILLCN +WRRTLAMA +LCELQAK+ R    + KKRKRKG+   E EGGNFP A E+CRM V
Subjt:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIR-EYEGGNFPNATEVCRMGV

Query:  EVLKKHLLGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDK
         +L+KH +GYRA YII  AQ VQNG+IDLQK E+ALS    FPKIKGFGPF TAN+ MCLG Y RLPIDTETIRHLKQVHGR+ C  KT  E VK VYDK
Subjt:  EVLKKHLLGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDK

Query:  YAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGTT
        YAPFQCLAYW ELVEYYES+FGKLSEL   DY KISGTT
Subjt:  YAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGTT

XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata]3.9e-13975.76Show/hide
Query:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL
        + L LGV  S  F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS  S+LVSINQSSS LLT+ IHS   L   D+ AILDQVARMLR++EKDED++
Subjt:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL

Query:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA
         +FQNLHP AK+IGFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAK MRESKKRKRKG     E GNFPNA EVCRMGVE LK H LGYRA
Subjt:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA

Query:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE
         Y++KFAQ+V++GRI+LQ  EK +SSP+AFPKIKGFGPFATANI MCLGFYH+LPIDTETIRHLKQVHG ++CT KTVGEDVKQ+YD YAP+QCLAYW E
Subjt:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE

Query:  LVEYYESKFGKLSELCSLDYHKISGTTVNL
        LV+YYE+KFGKLSEL S DYHKISG+T++L
Subjt:  LVEYYESKFGKLSELCSLDYHKISGTTVNL

XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida]3.0e-12376.49Show/hide
Query:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIH-SSLPLSALDKQAILDQVARMLRISEKDEDD
        ++LNLGVS S  FDLEKAVCNHG FMMPPN+WIPSSKTLQRPLRLS+S  SV VSINQ SS LLTI IH SS PLS  D+QAILDQV RMLR++EKDED+
Subjt:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIH-SSLPLSALDKQAILDQVARMLRISEKDEDD

Query:  LTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRE-SKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGY
        L KFQ+LHPRAK++GFGR+FRSPTLFEDA+KSILLCNT+W+RTLAMA +LCELQAKM R+ ++KRKRK   +E E GNFPNA EVCRMGVE+LKKH LGY
Subjt:  LTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRE-SKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGY

Query:  RAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYW
        RA YII FA+ VQ+G+IDLQ       +PN FPKIKGFGPFATAN+LMCLG Y +LPIDTETIRHLKQVHGR+FC NKTV EDVKQ+YDKYAPFQCLAYW
Subjt:  RAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYW

Query:  FE
         E
Subjt:  FE

TrEMBL top hitse value%identityAlignment
A0A438CJ05 Uncharacterized protein3.7e-8751.87Show/hide
Query:  KNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ-SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS
        K   + L++ LG   SS F+LE AVCNHG FMM PN WIPS+KTLQRPLRL++   S+L SI+   +   + + +H +  +S  D++ IL  VARMLRIS
Subjt:  KNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ-SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS

Query:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKR------KRKGEIREYEG-GNFPNATEVCRM
        ++DE D+ +F  + P AK   FGRIFRSP++FED VKSILLCN  WRRTL MA+ LCELQ ++    +KR      K K    E +  GNFPN+ E+  +
Subjt:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKR------KRKGEIREYEG-GNFPNATEVCRM

Query:  GVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSP------NAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVG
          E LKK   LGYRA  I++ A +++NG + LQ FEKAL +       +   K KGFGPFA ANILMC+G+Y R+P D+ET RH+K++HGR     K   
Subjt:  GVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSP------NAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVG

Query:  EDVKQVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT
        +DVK++YDKYAPFQCLAYW EL EYY+S+FGKLSEL   +YH I+G+
Subjt:  EDVKQVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT

A0A6A1W9S6 Uncharacterized protein3.6e-9051.93Show/hide
Query:  FDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSS---FLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDLTKFQNLHPR
        F++EKAVCNHG FMM PN WIPS+KTLQRPLRL+NS VSVLVSI+  +S     + I +H +  +S  D++AIL+QVARMLRISE+DE +L +FQNLHP 
Subjt:  FDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSS---FLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDLTKFQNLHPR

Query:  AKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKM-------------MRESKKR--KRKGEIREYEG--------------------
        AKE GFGR FRSP+LFEDA+KS+LLCN +W RTL MA+ LCELQ ++              + S+KR  KRK   R+                       
Subjt:  AKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKM-------------MRESKKR--KRKGEIREYEG--------------------

Query:  ----GNFPNATEVCRMGVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSP-----NAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHL
            GNFP++ EV  +    L+ H  LGYRA YI+K A+ V++G++ L++F+   S+          KIKGFGPFA AN++MC+G+Y  +P+DTET+RHL
Subjt:  ----GNFPNATEVCRMGVEVLKKHL-LGYRAGYIIKFAQNVQNGRIDLQKFEKALSSP-----NAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHL

Query:  KQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT
        +QVHGR+    +TV EDVK VYDK+APFQ LAYWFEL+E+YE KFGKLSEL +  Y  +SG+
Subjt:  KQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGT

A0A6J1DS88 uncharacterized protein LOC1110238221.5e-12871.68Show/hide
Query:  MKNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS
        M+ +   ++LNLG + S  FDLE+AVCNHG FMMPPNKWIPSSKTLQRPLRL++S  SVLVSI+Q SS LL I IHSS   S LD+QAILDQV RMLRI+
Subjt:  MKNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRIS

Query:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIR-EYEGGNFPNATEVCRMGV
        E+DE+++  FQNLH +AKEIGFGR+FRSPTLFEDAVKSILLCN +WRRTLAMA +LCELQAK+ R    + KKRKRKG+   E EGGNFP A E+CRM V
Subjt:  EKDEDDLTKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMR----ESKKRKRKGEIR-EYEGGNFPNATEVCRMGV

Query:  EVLKKHLLGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDK
         +L+KH +GYRA YII  AQ VQNG+IDLQK E+ALS    FPKIKGFGPF TAN+ MCLG Y RLPIDTETIRHLKQVHGR+ C  KT  E VK VYDK
Subjt:  EVLKKHLLGYRAGYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDK

Query:  YAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGTT
        YAPFQCLAYW ELVEYYES+FGKLSEL   DY KISGTT
Subjt:  YAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKISGTT

A0A6J1GJ25 uncharacterized protein LOC1114546591.9e-13975.76Show/hide
Query:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL
        + L LGV  S  F+LEKAVCNHG FMM PN+WIPSSKTLQRPLRLSNS  S+LVSINQSSS LLT+ IHS   L   D+ AILDQVARMLR++EKDED++
Subjt:  LNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDL

Query:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA
         +FQNLHP AK+IGFGRIFRSP+LFED VKSIL+CNTSWRRTL MA +LCE+QAK MRESKKRKRKG     E GNFPNA EVCRMGVE LK H LGYRA
Subjt:  TKFQNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRA

Query:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE
         Y++KFAQ+V++GRI+LQ  EK +SSP+AFPKIKGFGPFATANI MCLGFYH+LPIDTETIRHLKQVHG ++CT KTVGEDVKQ+YD YAP+QCLAYW E
Subjt:  GYIIKFAQNVQNGRIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFE

Query:  LVEYYESKFGKLSELCSLDYHKISGTTVNL
        LV+YYE+KFGKLSEL S DYHKISG+T++L
Subjt:  LVEYYESKFGKLSELCSLDYHKISGTTVNL

A0A6P4BPN5 uncharacterized protein LOC1074341919.8e-8851.2Show/hide
Query:  SVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ---SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDLTKFQNLH
        S F+LEKAVCNHG FMM PN WIPS+KTLQRPLRLS+   S +VSI+     S  LL I +HS  P S+ D+ AIL QV RMLRISE+DE D+ +FQ   
Subjt:  SVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQ---SSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDLTKFQNLH

Query:  PRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKM--MRESKKRKRKGEI-----REYEGGNFPNATEVCRMGVEVLKKH--LLGY
        P+AK  GFGR+FRSP++FEDAVKSILLCN +W ++L MA+ LCELQ ++   R+ K ++++G+       E   GNFP + E+  +    L++   +LGY
Subjt:  PRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKM--MRESKKRKRKGEI-----REYEGGNFPNATEVCRMGVEVLKKH--LLGY

Query:  RAGYIIKFAQNVQNGRIDLQKFEKALS-SPNAFPKI-------KGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYA
        RA YI++ A+NV++GR+ L++ E+ ++  P  + ++        GFGP+  AN+ MC+G Y  +P+DTETIRH++QVHGR+ C  KTV + V+++YDK+A
Subjt:  RAGYIIKFAQNVQNGRIDLQKFEKALS-SPNAFPKI-------KGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYA

Query:  PFQCLAYWFELVEYYESKFGKLSELCSLDYHKIS
        PFQCLAYW EL++ YE KFGKLSEL    Y  +S
Subjt:  PFQCLAYWFELVEYYESKFGKLSELCSLDYHKIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACAATCCTTTGAATTTGAATTTGAATTTGGGAGTTTCAAGGAGTAGTGTTTTCGATCTTGAGAAAGCAGTTTGTAATCATGGAGTGTTTATGATGCCACCAAA
CAAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCAAATTCTAAAGTTTCTGTTTTGGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCACCATTC
ATATTCACTCTTCTCTCCCTCTCTCTGCTTTAGATAAACAAGCTATATTGGATCAAGTCGCTCGAATGTTGAGAATTTCAGAAAAAGATGAAGATGACCTTACAAAATTT
CAAAATTTGCATCCAAGAGCCAAAGAGATTGGATTTGGTCGCATTTTTCGATCTCCAACTCTTTTTGAAGATGCGGTGAAGTCCATCCTTTTATGCAATACCTCATGGAG
AAGGACGCTGGCAATGGCTAGAGAACTATGTGAACTACAAGCGAAAATGATGAGAGAAAGTAAGAAGAGAAAAAGAAAAGGAGAAATTAGGGAGTATGAGGGAGGCAATT
TTCCAAATGCAACAGAAGTTTGTAGAATGGGCGTTGAAGTCTTGAAGAAACATTTGCTTGGTTATAGAGCCGGTTACATCATTAAATTTGCTCAAAATGTTCAAAACGGG
AGAATTGATCTCCAAAAATTTGAAAAAGCACTTTCCTCTCCTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAACTGCCAATATACTCATGTGCCTCGGATT
TTACCATCGACTTCCTATTGATACTGAAACTATAAGGCATTTAAAACAGGTACATGGAAGAGAATTTTGCACCAACAAGACTGTTGGGGAAGATGTCAAACAAGTTTACG
ACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTCGAGCTTGTCGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTAGATTATCACAAAATT
AGTGGCACTACCGTCAACCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAACAATCCTTTGAATTTGAATTTGAATTTGGGAGTTTCAAGGAGTAGTGTTTTCGATCTTGAGAAAGCAGTTTGTAATCATGGAGTGTTTATGATGCCACCAAA
CAAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCAAATTCTAAAGTTTCTGTTTTGGTCTCTATAAACCAATCTTCTTCTTTTCTCCTCACCATTC
ATATTCACTCTTCTCTCCCTCTCTCTGCTTTAGATAAACAAGCTATATTGGATCAAGTCGCTCGAATGTTGAGAATTTCAGAAAAAGATGAAGATGACCTTACAAAATTT
CAAAATTTGCATCCAAGAGCCAAAGAGATTGGATTTGGTCGCATTTTTCGATCTCCAACTCTTTTTGAAGATGCGGTGAAGTCCATCCTTTTATGCAATACCTCATGGAG
AAGGACGCTGGCAATGGCTAGAGAACTATGTGAACTACAAGCGAAAATGATGAGAGAAAGTAAGAAGAGAAAAAGAAAAGGAGAAATTAGGGAGTATGAGGGAGGCAATT
TTCCAAATGCAACAGAAGTTTGTAGAATGGGCGTTGAAGTCTTGAAGAAACATTTGCTTGGTTATAGAGCCGGTTACATCATTAAATTTGCTCAAAATGTTCAAAACGGG
AGAATTGATCTCCAAAAATTTGAAAAAGCACTTTCCTCTCCTAATGCTTTCCCTAAAATCAAAGGCTTTGGTCCTTTTGCAACTGCCAATATACTCATGTGCCTCGGATT
TTACCATCGACTTCCTATTGATACTGAAACTATAAGGCATTTAAAACAGGTACATGGAAGAGAATTTTGCACCAACAAGACTGTTGGGGAAGATGTCAAACAAGTTTACG
ACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTCGAGCTTGTCGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAATTGTGTTCCCTAGATTATCACAAAATT
AGTGGCACTACCGTCAACCTTTGA
Protein sequenceShow/hide protein sequence
MKNNPLNLNLNLGVSRSSVFDLEKAVCNHGVFMMPPNKWIPSSKTLQRPLRLSNSKVSVLVSINQSSSFLLTIHIHSSLPLSALDKQAILDQVARMLRISEKDEDDLTKF
QNLHPRAKEIGFGRIFRSPTLFEDAVKSILLCNTSWRRTLAMARELCELQAKMMRESKKRKRKGEIREYEGGNFPNATEVCRMGVEVLKKHLLGYRAGYIIKFAQNVQNG
RIDLQKFEKALSSPNAFPKIKGFGPFATANILMCLGFYHRLPIDTETIRHLKQVHGREFCTNKTVGEDVKQVYDKYAPFQCLAYWFELVEYYESKFGKLSELCSLDYHKI
SGTTVNL