; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011319 (gene) of Snake gourd v1 genome

Gene IDTan0011319
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG01:53482081..53483005
RNA-Seq ExpressionTan0011319
SyntenyTan0011319
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]3.6e-5751.97Show/hide
Query:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL
        WRVDNGTF+  YLVQVQKL K+K+ ES+IQVTPNLES VKILKKQY  IAEMMGP CSGF WN ERKCI+ EK++ +DWVK H +AR L NKPFPYF +L
Subjt:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL

Query:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG
         I+FG+DRATG   +TP +M S +A D  +DD+ +  +D  IP+P   +P S EDM +TP S    AGS                ++++R Y+       
Subjt:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG

Query:  IDMDDCLQIAETLLADISKFHSFLDYPAE
         D+ D  +  E+LL D +  H+FLDYP E
Subjt:  IDMDDCLQIAETLLADISKFHSFLDYPAE

KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-5848.34Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR DNGTF+  YLVQVQKL K+K+  S+IQVTPNL+SRVKILKKQY AIAEMMGPACSGFGWN+ERKCI+ EK++FDDWVK                  
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------
                                +A D+ +DD+++  +D  IP+P   +P S EDM +TP S    AGSS         SG                  
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------

Query:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
         IA W +EK E+ES+  KRLY +LQ I G+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]8.8e-5647.27Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR DNGTF+  YL                              KQY AIAEMMGPACSGFGWN+ +KCI+VEK +FDDWVK HP+A+GL NKPFPYF +
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------
        L ++FG+DRATG   +TP +M+S +A D  +DD+++  +D  IP+P   +P S EDM +TP S    AGSS         SG                  
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------

Query:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYP
         IA W +EK E+ES+  KRLYAELQ I G+D+DDCL +AE+LL D +  H+FLDYP
Subjt:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.6e-5752.4Show/hide
Query:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL
        WRVDNGTF+  YLVQVQKL K+K+ ES+IQVTPNLES VKILKKQY  IAEMMGP CSGF WN ERKCI+ EK++ +DWVK H +AR L NKPFPYF +L
Subjt:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL

Query:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG
         I+FG+DRATG   +TP +M S +A D  +DD+ +  +D  IP+P   +P S EDM +TP S    AGSS               ++++R Y+       
Subjt:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG

Query:  IDMDDCLQIAETLLADISKFHSFLDYPAE
         D+ D  +  E+LL D +  H+FLDYP E
Subjt:  IDMDDCLQIAETLLADISKFHSFLDYPAE

XP_031741735.1 uncharacterized protein At2g29880-like [Cucumis sativus]1.6e-5260.95Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR  NGTF+  YLVQVQKL K+K+P S+IQVTPNLE RVKILKKQY AI EMMGP+CS FGWN++RKCI+ EK +FDD VK HP+ARGL NKPFPYF +
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAG
        L I+FG+DRATG   +TP +M S +  D+ +DD+++  +D  IP+P   +P S EDMS+T  S    AG
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAG

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein1.7e-5751.97Show/hide
Query:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL
        WRVDNGTF+  YLVQVQKL K+K+ ES+IQVTPNLES VKILKKQY  IAEMMGP CSGF WN ERKCI+ EK++ +DWVK H +AR L NKPFPYF +L
Subjt:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL

Query:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG
         I+FG+DRATG   +TP +M S +A D  +DD+ +  +D  IP+P   +P S EDM +TP S    AGS                ++++R Y+       
Subjt:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG

Query:  IDMDDCLQIAETLLADISKFHSFLDYPAE
         D+ D  +  E+LL D +  H+FLDYP E
Subjt:  IDMDDCLQIAETLLADISKFHSFLDYPAE

A0A5A7U7F7 Retrotransposon protein5.4e-5948.34Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR DNGTF+  YLVQVQKL K+K+  S+IQVTPNL+SRVKILKKQY AIAEMMGPACSGFGWN+ERKCI+ EK++FDDWVK                  
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------
                                +A D+ +DD+++  +D  IP+P   +P S EDM +TP S    AGSS         SG                  
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------

Query:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
         IA W +EK E+ES+  KRLY +LQ I G+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

A0A5D3BC95 Retrotransposon protein2.2e-5260.82Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR +N TF+  YLVQVQKL K+K+P S+IQVT NLESRVK LKKQY AIA+MMGPACS FGWN+ERKCI+ EK++FDDWVK HP+ARGL NKPF YF +
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS
        L I+FG+D+ATG   +   +MAS +A D  +DD+++  +D  IP+P   +P S EDM +T IS    AGSS
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS

A0A5D3C7T4 Uncharacterized protein4.3e-5647.27Show/hide
Query:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE
        GWR DNGTF+  YL                              KQY AIAEMMGPACSGFGWN+ +KCI+VEK +FDDWVK HP+A+GL NKPFPYF +
Subjt:  GWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------
        L ++FG+DRATG   +TP +M+S +A D  +DD+++  +D  IP+P   +P S EDM +TP S    AGSS         SG                  
Subjt:  LSIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSS---------SG------------------

Query:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYP
         IA W +EK E+ES+  KRLYAELQ I G+D+DDCL +AE+LL D +  H+FLDYP
Subjt:  -IALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLADISKFHSFLDYP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein7.8e-5852.4Show/hide
Query:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL
        WRVDNGTF+  YLVQVQKL K+K+ ES+IQVTPNLES VKILKKQY  IAEMMGP CSGF WN ERKCI+ EK++ +DWVK H +AR L NKPFPYF +L
Subjt:  WRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDEL

Query:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG
         I+FG+DRATG   +TP +M S +A D  +DD+ +  +D  IP+P   +P S EDM +TP S    AGSS               ++++R Y+       
Subjt:  SIIFGKDRATGAGAETPHDMASASATDV-DDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISG

Query:  IDMDDCLQIAETLLADISKFHSFLDYPAE
         D+ D  +  E+LL D +  H+FLDYP E
Subjt:  IDMDDCLQIAETLLADISKFHSFLDYPAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein2.3e-0628.28Show/hide
Query:  LESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDIN
        L  R   L K Y  +  ++     GF W++ R  I  + A++D ++K HP AR  R K  P +++L  IF      G         A  S T    + N
Subjt:  LESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDIN

AT2G24960.2 unknown protein2.3e-0628.28Show/hide
Query:  LESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDIN
        L  R   L K Y  +  ++     GF W++ R  I  + A++D ++K HP AR  R K  P +++L  IF      G         A  S T    + N
Subjt:  LESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDIN

AT4G02210.1 unknown protein9.8e-1326.42Show/hide
Query:  RGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFD
        RG +++ G FR     ++  LF  K  ES+  V   L++R K L++Q+NAI  ++     GF W++ER+ +  +  ++ D++KAH  AR    +P PY+ 
Subjt:  RGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFD

Query:  ELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSAT----PISRNDGAGSSSGIALWPKEKDELESTRRKRLYAEL
        +L ++ G      +G E      +    D + +    FQ+           +++E+ S +    P ++ D   ++    + PK K  ++ T+   +   +
Subjt:  ELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSAT----PISRNDGAGSSSGIALWPKEKDELESTRRKRLYAEL

Query:  QAISGI-DMDDCLQI-AETLLADISKFHSFLDYPAEWKYKCCMRIL
        +AI  + DMDD L + A  LL D  K  +FL    + + K  +R L
Subjt:  QAISGI-DMDDCLQI-AETLLADISKFHSFLDYPAEWKYKCCMRIL

AT4G02210.2 unknown protein9.8e-1326.42Show/hide
Query:  RGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFD
        RG +++ G FR     ++  LF  K  ES+  V   L++R K L++Q+NAI  ++     GF W++ER+ +  +  ++ D++KAH  AR    +P PY+ 
Subjt:  RGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFD

Query:  ELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSAT----PISRNDGAGSSSGIALWPKEKDELESTRRKRLYAEL
        +L ++ G      +G E      +    D + +    FQ+           +++E+ S +    P ++ D   ++    + PK K  ++ T+   +   +
Subjt:  ELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSAT----PISRNDGAGSSSGIALWPKEKDELESTRRKRLYAEL

Query:  QAISGI-DMDDCLQI-AETLLADISKFHSFLDYPAEWKYKCCMRIL
        +AI  + DMDD L + A  LL D  K  +FL    + + K  +R L
Subjt:  QAISGI-DMDDCLQI-AETLLADISKFHSFLDYPAEWKYKCCMRIL

AT5G27260.1 unknown protein5.4e-1126.76Show/hide
Query:  NRGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLE-----SRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNK
        N  WR  NGT      + V+  F   +PE + +   +       SR+K LK QY +  ++     SGFGW+   K       ++ D++KAHP+ + LR  
Subjt:  NRGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLE-----SRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNK

Query:  PFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSA--TPISRNDGAGSSSGIALWPKEKDELESTRRKRL
         F +FDEL IIFG+  ATG  A    D          ++    + D    +   YD T+  + S    P   +   G+S    L P+++   E +  ++ 
Subjt:  PFPYFDELSIIFGKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSA--TPISRNDGAGSSSGIALWPKEKDELESTRRKRL

Query:  YAELQAISGIDMD
         + +  +S   +D
Subjt:  YAELQAISGIDMD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGCAACAGGGGGTGGAGAGTCGATAATGGCACATTTCGACACTGGTACTTAGTACAAGTACAAAAATTGTTCAAGCAGAAACTCCCTGAAAGCGACATACAAGT
GACCCCAAACCTAGAATCTAGAGTGAAGATTCTGAAGAAGCAATACAATGCTATAGCTGAGATGATGGGGCCAGCATGTAGTGGGTTTGGGTGGAATGACGAACGTAAGT
GCATTCAGGTAGAGAAAGCAATTTTCGATGACTGGGTTAAGGCACACCCTCATGCTCGAGGCCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTATATTC
GGTAAAGACAGGGCAACTGGTGCGGGTGCAGAGACTCCCCATGACATGGCCTCAGCATCGGCCACAGACGTAGATGACGACATCAACATGACTTTTCAAGATCTCCCAAT
CCCTGACCCACCTGCATATGACCCGACATCTGACGAGGATATGTCTGCCACACCTATATCCAGGAACGATGGGGCAGGATCATCAAGTGGAATTGCCTTGTGGCCTAAAG
AGAAGGATGAACTGGAGTCGACCCGACGCAAACGACTATATGCAGAACTTCAAGCTATCTCTGGTATAGATATGGATGATTGTTTACAGATTGCTGAGACTCTGTTGGCC
GATATATCCAAATTCCACTCATTCCTCGACTACCCAGCTGAATGGAAATACAAATGTTGCATGCGTATCTTGGGAAGGGAGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGCAACAGGGGGTGGAGAGTCGATAATGGCACATTTCGACACTGGTACTTAGTACAAGTACAAAAATTGTTCAAGCAGAAACTCCCTGAAAGCGACATACAAGT
GACCCCAAACCTAGAATCTAGAGTGAAGATTCTGAAGAAGCAATACAATGCTATAGCTGAGATGATGGGGCCAGCATGTAGTGGGTTTGGGTGGAATGACGAACGTAAGT
GCATTCAGGTAGAGAAAGCAATTTTCGATGACTGGGTTAAGGCACACCCTCATGCTCGAGGCCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTATATTC
GGTAAAGACAGGGCAACTGGTGCGGGTGCAGAGACTCCCCATGACATGGCCTCAGCATCGGCCACAGACGTAGATGACGACATCAACATGACTTTTCAAGATCTCCCAAT
CCCTGACCCACCTGCATATGACCCGACATCTGACGAGGATATGTCTGCCACACCTATATCCAGGAACGATGGGGCAGGATCATCAAGTGGAATTGCCTTGTGGCCTAAAG
AGAAGGATGAACTGGAGTCGACCCGACGCAAACGACTATATGCAGAACTTCAAGCTATCTCTGGTATAGATATGGATGATTGTTTACAGATTGCTGAGACTCTGTTGGCC
GATATATCCAAATTCCACTCATTCCTCGACTACCCAGCTGAATGGAAATACAAATGTTGCATGCGTATCTTGGGAAGGGAGGCATGA
Protein sequenceShow/hide protein sequence
MSCNRGWRVDNGTFRHWYLVQVQKLFKQKLPESDIQVTPNLESRVKILKKQYNAIAEMMGPACSGFGWNDERKCIQVEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIF
GKDRATGAGAETPHDMASASATDVDDDINMTFQDLPIPDPPAYDPTSDEDMSATPISRNDGAGSSSGIALWPKEKDELESTRRKRLYAELQAISGIDMDDCLQIAETLLA
DISKFHSFLDYPAEWKYKCCMRILGREA