; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000115 (gene) of Snake gourd v1 genome

Gene IDTan0000115
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG08:14433646..14435137
RNA-Seq ExpressionTan0000115
SyntenyTan0000115
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]6.7e-5444.16Show/hide
Query:  IPLHYKTWNETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEP
        + L     N+ +F VDL+K  +N+++E+ +  +FK+Y ++L+QYY EF+DP EA   P +R+ NP DWN LCDRWE  EWK         I K  KK   
Subjt:  IPLHYKTWNETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEP

Query:  ISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGW
              +L  +H    ++     +  KI++G DIG VDLF ESH+N KD  VND+A DAY  MQ L+ A +QEG EP++Q E C+ VLG R DH+K LG+
Subjt:  ISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGW

Query:  DPKPS-----SSSSVTSSFQHEKKLEKKVEQMQAEIGTLTTK-------LSSWEERWVEFTKYMDERQGEGSSN
         P+P+     SSS+VTSS  +EK+LEKKVE M+ E+  + T+       +S+WE+RW E +++M  RQG+G SN
Subjt:  DPKPS-----SSSSVTSSFQHEKKLEKKVEQMQAEIGTLTTK-------LSSWEERWVEFTKYMDERQGEGSSN

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]2.6e-6638.52Show/hide
Query:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT
        + H  Q+ +   +L+ ++ P        +  +AS SR           R  RGHSR +EL+R+VN HGRI IEIDE+VGKP+C   T  S AIGTI R+T
Subjt:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT

Query:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL
        IPL  K W+                                         ++YF  D+ K H+ +YV + +  TFKEY ++L ++Y  F DPKEA   P 
Subjt:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL

Query:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA
         RI +  DWN+LC+RWE  EW              KKK E    +  ++   H    ++     +  KI++G D+ +VDLF +SHF  KD WVN++A+DA
Subjt:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA

Query:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW
        YL+MQ+L+ A  QE   P+S  EVCK VLG RS +IK LG +PKPSSSSSVTS  Q +K+LEKK+E+M+ E+                LT++LS WE RW
Subjt:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW

Query:  VEFTKYMDERQG-EGSSN
         E    +   QG +G SN
Subjt:  VEFTKYMDERQG-EGSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]1.9e-6941.18Show/hide
Query:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT
        + H  Q+ +   +L+ ++ P        +  +AS SR           R  RGHSR +EL+R+VN HGRI IEIDE+VGKP+C   T  S AIGTI R+T
Subjt:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT

Query:  IPLHYKTWNE--------------TYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYV
        IPL  K W++              +YF  D+ K H+ +YV + +  TFKEY ++L ++Y  F DPKEA   P  RI +  DWN+LC+RWE  EW      
Subjt:  IPLHYKTWNE--------------TYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYV

Query:  VLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKM
                KKK E    +  ++   H    ++     +  KI++G D+ +VDLF +SHF  KD WVN++A+DAYL+MQ+L+ A  QE   P+S  EVCK 
Subjt:  VLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKM

Query:  VLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERWVEFTKYMDERQG-EGSSN
        VLG RS +IK LG +PKPSSSSSVTS  Q +K+LEKK+E+M+ E+                LT++LS WE RW E    +   QG +G SN
Subjt:  VLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERWVEFTKYMDERQG-EGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]6.5e-4933.25Show/hide
Query:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT
        + H  Q+ +   +L+ ++ P        +  +AS SR           R  RGHSR +EL+R+VN HGRI IEIDE+VGKP+C   T  S AIGTI R+T
Subjt:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT

Query:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL
        IPL  K W+                                         ++YF  D+ K H+ +YV + +  TFKEY ++L ++Y  F DPKEA   P 
Subjt:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL

Query:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA
         RI +  DWN+LC+RWE  EW              KKK E    +  ++   H    ++     +  KI++G D+ +VDLF +SHF  KD WVN++A+DA
Subjt:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA

Query:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW
        YL+MQ+L+ A  QE                           DP P       SS + +K+LEKK+E+M+ E+                LT++LS WE RW
Subjt:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW

Query:  VEFTKYMDERQG-EGSSN
         E    +   QG +G SN
Subjt:  VEFTKYMDERQG-EGSSN

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.6e-6638.52Show/hide
Query:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT
        + H  Q+ +   +L+ ++ P        +  +AS SR           R  RGHSR +EL+R+VN HGRI IEIDE+VGKP+C   T  S AIGTI R+T
Subjt:  MVHPSQDHDVAAVLEEENAP--------EVDLASRSRG----------RSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDT

Query:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL
        IPL  K W+                                         ++YF  D+ K H+ +YV + +  TFKEY ++L ++Y  F DPKEA   P 
Subjt:  IPLHYKTWN-----------------------------------------ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPL

Query:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA
         RI +  DWN+LC+RWE  EW              KKK E    +  ++   H    ++     +  KI++G D+ +VDLF +SHF  KD WVN++A+DA
Subjt:  DRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDA

Query:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW
        YL+MQ+L+ A  QE   P+S  EVCK VLG RS +IK LG +PKPSSSSSVTS  Q +K+LEKK+E+M+ E+                LT++LS WE RW
Subjt:  YLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEI--------------GTLTTKLSSWEERW

Query:  VEFTKYMDERQG-EGSSN
         E    +   QG +G SN
Subjt:  VEFTKYMDERQG-EGSSN

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ3 Transposase5.9e-4031.81Show/hide
Query:  DHDVAAVLEEENAPEVDLASRS--RGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTW--------------
        DH      + +  PE    SR+  R R  RG+ R IEL+++V +HG++ IEI+E+ GKP+ T    ++  IGT  R+TI L  + W              
Subjt:  DHDVAAVLEEENAPEVDLASRS--RGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTW--------------

Query:  NETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIEL
        +ET+F  D T   + +Y++  +   F+E+ A L++YY +FDD  EA   P D+I +  DWNM+CDRWE   W              KKK E    +   +
Subjt:  NETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIEL

Query:  GPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSS
          +H+   ++     +  + +KG D+ EV++F E+HF  K+ W+ND A+DAY    +++A  ++ G + IS ++ CK+VLG+ S  ++ +      S  S
Subjt:  GPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSS

Query:  SVTSSFQHEKKLEKKVEQMQAEIGTLTTKLSSWEERWVEFTKYMDERQG
        +V+S+ + EK     ++++  +   LT +L+ WE+RW +  K +  R G
Subjt:  SVTSSFQHEKKLEKKVEQMQAEIGTLTTKLSSWEERWVEFTKYMDERQG

A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class1.0e-3931.75Show/hide
Query:  LEEENAPEVD--------LASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------
        +E+ N P VD        L ++ RGR  RG+ R IEL+++V +HG+I IEI+E+ GKP+ T    ++  IGT  R+TIPL  + W               
Subjt:  LEEENAPEVD--------LASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------

Query:  ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELG
        ET+F  D T   + +Y+E  +   F+E+ A+L++YY +FDD  EA   P +RI    DWNM+CDRWE   WK                            
Subjt:  ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELG

Query:  PSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSS
                           +KG D+ E+++F E+HF  K+ W+ND A+DAYL+MQ+++   ++ G + IS ++ C+ VLG+RS           P S  S
Subjt:  PSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSS

Query:  VTSSFQHEKKLEK-KVEQMQAEIGTLTTKLSSWEERW
        + S+    ++ EK ++  ++     LT +L+ WE+ +
Subjt:  VTSSFQHEKKLEK-KVEQMQAEIGTLTTKLSSWEERW

A0A5A7TRX4 DUF4216 domain-containing protein2.2e-4233.96Show/hide
Query:  ASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------ETYFIVDLTKHHINRYVER
        +S S GR  RG+ R IEL+++V +HG+I IEI+E+ GKP+ T    ++  IGT  R+TIPL  + W               ET+F  D T   + +Y+E 
Subjt:  ASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------ETYFIVDLTKHHINRYVER

Query:  LISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKI
         +  TF+E+ A+L++YY +FDD  EA   P +RI +  DWNM+CDRWE   W              KKK E    +   +  +H+   ++     +  K 
Subjt:  LISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKI

Query:  QKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEK-KVEQM
        +KG+D+ E+++F E+HF  K+ W ND A+DAYL+MQ+++   ++ G + IS ++ C+ VLG+RS           P S  S+ S+    ++ EK ++  +
Subjt:  QKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEK-KVEQM

Query:  QAEIGTLTTKLSSWEERW
        +     LT +L+ WE+ +
Subjt:  QAEIGTLTTKLSSWEERW

A0A5A7US78 Uncharacterized protein6.4e-4232.94Show/hide
Query:  LEEENAPEVD--------LASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------
        +E+ N P +D        L ++ RGR  RG+ R IEL+++V +HG+I IEI+E+ GKP+ T G  ++  IGT  R+TIPL  + W               
Subjt:  LEEENAPEVD--------LASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWN--------------

Query:  ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELG
        ET+F  D T   + +Y+E  +  TF+E+ A L++YY +FDD  EA   P +RI +  DWNM+CDRWE   WK             KKK            
Subjt:  ETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELG

Query:  PSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSS
                            KG D+ E+++F E+HF  K+ W+ND A+DAYL+MQ+++   ++ G + IS ++ CK VLG+RS           P S  S
Subjt:  PSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSS

Query:  VTSSFQHEKKLEK-KVEQMQAEIGTLTTKLSSWEERW
        + S+    ++ +K ++  ++     LT +L+ WE+ +
Subjt:  VTSSFQHEKKLEK-KVEQMQAEIGTLTTKLSSWEERW

A0A6J1DUH3 uncharacterized protein LOC1110232123.2e-5444.16Show/hide
Query:  IPLHYKTWNETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEP
        + L     N+ +F VDL+K  +N+++E+ +  +FK+Y ++L+QYY EF+DP EA   P +R+ NP DWN LCDRWE  EWK         I K  KK   
Subjt:  IPLHYKTWNETYFIVDLTKHHINRYVERLISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEP

Query:  ISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGW
              +L  +H    ++     +  KI++G DIG VDLF ESH+N KD  VND+A DAY  MQ L+ A +QEG EP++Q E C+ VLG R DH+K LG+
Subjt:  ISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVDLFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGW

Query:  DPKPS-----SSSSVTSSFQHEKKLEKKVEQMQAEIGTLTTK-------LSSWEERWVEFTKYMDERQGEGSSN
         P+P+     SSS+VTSS  +EK+LEKKVE M+ E+  + T+       +S+WE+RW E +++M  RQG+G SN
Subjt:  DPKPS-----SSSSVTSSFQHEKKLEKKVEQMQAEIGTLTTK-------LSSWEERWVEFTKYMDERQGEGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGCTGCTGTGTTAGAAGAGGAGAATGCTCCGGAGGTTGATCTAGCCTCTCGATCGCGAGGCAGGAGCGCGAGAGGGCATAG
CCGAAGGATTGAGTTAGAGCGTTATGTCAATGAACATGGTAGAATACCCATTGAGATCGATGAGAAGGTCGGTAAACCAATGTGTACTAAGGGCACTACGTTAAGTGGAG
CCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACATGGAACGAGACATACTTCATCGTGGATTTGACCAAACATCATATAAATAGATACGTAGAGCGA
CTGATATCAACCACATTTAAGGAATATATGGCAGAATTGAATCAATACTACCTTGAGTTTGACGACCCTAAAGAGGCTTGTGAATATCCTCTAGACAGAATCGATAATCC
AGCTGATTGGAATATGTTATGTGATCGATGGGAGATCGCTGAATGGAAGGTATACTTTTATGTTGTACTTTATTATATAGAAAAATTAAAAAAAAAAGTCGAGCCAATCT
CTCTCACAACCATCGAACTGGGTCCAAGTCATTTGTTCAAGTGCAGAACGAATTGGTTAGATACAAACTCATGTAAGATACAAAAGGGGCATGACATAGGCGAAGTGGAT
TTGTTCGATGAAAGTCACTTCAACATAAAGGACGAATGGGTGAACGACCATGCGAGGGATGCATATTTGAAAATGCAACAACTTCTTGCAGCATTGTCACAAGAAGGATC
TGAGCCAATCTCACAGTCCGAAGTTTGTAAAATGGTTTTGGGTACTCGATCAGACCACATAAAATGTCTTGGTTGGGACCCAAAACCTAGTTCGTCGTCTAGCGTCACAT
CTTCTTTCCAACATGAAAAAAAGCTTGAAAAGAAGGTGGAGCAAATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGTTGAATTC
ACTAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGCTGCTGTGTTAGAAGAGGAGAATGCTCCGGAGGTTGATCTAGCCTCTCGATCGCGAGGCAGGAGCGCGAGAGGGCATAG
CCGAAGGATTGAGTTAGAGCGTTATGTCAATGAACATGGTAGAATACCCATTGAGATCGATGAGAAGGTCGGTAAACCAATGTGTACTAAGGGCACTACGTTAAGTGGAG
CCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACATGGAACGAGACATACTTCATCGTGGATTTGACCAAACATCATATAAATAGATACGTAGAGCGA
CTGATATCAACCACATTTAAGGAATATATGGCAGAATTGAATCAATACTACCTTGAGTTTGACGACCCTAAAGAGGCTTGTGAATATCCTCTAGACAGAATCGATAATCC
AGCTGATTGGAATATGTTATGTGATCGATGGGAGATCGCTGAATGGAAGGTATACTTTTATGTTGTACTTTATTATATAGAAAAATTAAAAAAAAAAGTCGAGCCAATCT
CTCTCACAACCATCGAACTGGGTCCAAGTCATTTGTTCAAGTGCAGAACGAATTGGTTAGATACAAACTCATGTAAGATACAAAAGGGGCATGACATAGGCGAAGTGGAT
TTGTTCGATGAAAGTCACTTCAACATAAAGGACGAATGGGTGAACGACCATGCGAGGGATGCATATTTGAAAATGCAACAACTTCTTGCAGCATTGTCACAAGAAGGATC
TGAGCCAATCTCACAGTCCGAAGTTTGTAAAATGGTTTTGGGTACTCGATCAGACCACATAAAATGTCTTGGTTGGGACCCAAAACCTAGTTCGTCGTCTAGCGTCACAT
CTTCTTTCCAACATGAAAAAAAGCTTGAAAAGAAGGTGGAGCAAATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGTTGAATTC
ACTAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
Protein sequenceShow/hide protein sequence
MVHPSQDHDVAAVLEEENAPEVDLASRSRGRSARGHSRRIELERYVNEHGRIPIEIDEKVGKPMCTKGTTLSGAIGTITRDTIPLHYKTWNETYFIVDLTKHHINRYVER
LISTTFKEYMAELNQYYLEFDDPKEACEYPLDRIDNPADWNMLCDRWEIAEWKVYFYVVLYYIEKLKKKVEPISLTTIELGPSHLFKCRTNWLDTNSCKIQKGHDIGEVD
LFDESHFNIKDEWVNDHARDAYLKMQQLLAALSQEGSEPISQSEVCKMVLGTRSDHIKCLGWDPKPSSSSSVTSSFQHEKKLEKKVEQMQAEIGTLTTKLSSWEERWVEF
TKYMDERQGEGSSNP