; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011235 (gene) of Snake gourd v1 genome

Gene IDTan0011235
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG07:62698903..62702915
RNA-Seq ExpressionTan0011235
SyntenyTan0011235
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040138.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.7e-5052.88Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        M TT +EIERFN   DF+LW +R Q IL  Q+ALK ++DPK+LP  + + +K+ MEE  Y  L++N+T N+ RQV++E+  +   +KL ALY KKD+P+K
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNK
        + +RE+LF++ MN SKTL+ENLDEFKKLT E    GE+L +ESEA I IN L + YK+VK+ LKYGR+S+ +D VI+ +K KELEL+ + K
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNK

KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-6347.46Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT +EIE+F+ N DF+LW +R   IL  Q ALKALEDPK LP  +T  E++ +EE+AY TL+MN+T N+ RQV++E+ AF+ W+KL++LY KKD+PNK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
        + ++E+LF++  N +K LDENLDEFKKLT  L   GE+LG+E+EA ILIN + + YK+VK  LKYGR++I+++SVI+T+K KELELK++NK +   E   
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSK-GQKNFKGKAALKCFTCQRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTIK
         K     +  R+   G+++F                  ++ + R+GR+HGR    +G   FEYTEVLA T +K ++
Subjt:  TKGKGHSKKTRFSK-GQKNFKGKAALKCFTCQRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTIK

TXG46510.1 hypothetical protein EZV62_027990 [Acer yangbiense]1.2e-4849.75Show/hide
Query:  YEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNKICLRE
        +EI++F+   DF +W R+ + +L QQ  LKA+E P  LPD++ DE+K  M E+A  T+++N++ N+ R+V DE+ A+  W KL++LY+ K + NKI L+E
Subjt:  YEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNKICLRE

Query:  RLFTYWMNSSKTLDENLDEFKKLTAELAAA--GERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLFTKG
        RLF++ M++SK LD+NLDE+KK+T ELA A   E+L  E+EA IL+N LP+++KDVK A+KYGR S+SL+  IS +K KELELK + K NG  E+LF  G
Subjt:  RLFTYWMNSSKTLDENLDEFKKLTAELAAA--GERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLFTKG

Query:  K
        +
Subjt:  K

TYK27723.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]8.2e-6645.34Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT +EIE+F+ N DF+L  +R    L  Q ALKALEDPK LP  +T  E++ +EE+AY TL+MN+T N+ RQV++E+  F+ W+ L++LY KKD+ NK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
        + +RE+LF++ MN +K LDENLDEFKKLT  L    E+LG+ESEA ILIN + + YK+VK +LKYGR++I+++SVI+ +K KELELK++NK +   ESLF
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC------QRDCPQRRKVNLR---------------------------KGRKHGRGDVSIGENTFEYTEVLA
        +KG    +K   +K Q++ + K ALKCF C      +R+CP R K N R                           +GR+HG     +G   FEYT+VLA
Subjt:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC------QRDCPQRRKVNLR---------------------------KGRKHGRGDVSIGENTFEYTEVLA

Query:  TTGEKTIKQDL
         T ++ ++ ++
Subjt:  TTGEKTIKQDL

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]9.1e-4940.59Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT YEIE+F    DF LW  + +++L++Q AL A+ DP   P  +   EK+ +E  AYGT+++NV  ++ RQ++D   A++ W KL  +Y+ KD+PNK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
          LRER FTY M+ +K+L +NL+EFK L+++  + G+ +G E+EA IL+N LPE +KDVK ALKYGR+ I+  ++IS +  KELEL+   K     E  F
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTCQRDC-PQRRKVNLR-KGRKHGRGDVSIGENTFEYTEVLATT
         KG         + G+ N   +  ++  + ++DC   +RK+N + KG K  + + ++GEN+  Y++ LA T
Subjt:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTCQRDC-PQRRKVNLR-KGRKHGRGDVSIGENTFEYTEVLATT

TrEMBL top hitse value%identityAlignment
A0A5A7TAZ3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-5052.88Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        M TT +EIERFN   DF+LW +R Q IL  Q+ALK ++DPK+LP  + + +K+ MEE  Y  L++N+T N+ RQV++E+  +   +KL ALY KKD+P+K
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNK
        + +RE+LF++ MN SKTL+ENLDEFKKLT E    GE+L +ESEA I IN L + YK+VK+ LKYGR+S+ +D VI+ +K KELEL+ + K
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNK

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-6447.46Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT +EIE+F+ N DF+LW +R   IL  Q ALKALEDPK LP  +T  E++ +EE+AY TL+MN+T N+ RQV++E+ AF+ W+KL++LY KKD+PNK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
        + ++E+LF++  N +K LDENLDEFKKLT  L   GE+LG+E+EA ILIN + + YK+VK  LKYGR++I+++SVI+T+K KELELK++NK +   E   
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSK-GQKNFKGKAALKCFTCQRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTIK
         K     +  R+   G+++F                  ++ + R+GR+HGR    +G   FEYTEVLA T +K ++
Subjt:  TKGKGHSKKTRFSK-GQKNFKGKAALKCFTCQRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTIK

A0A5A7UJ23 Integrase catalytic domain-containing protein1.7e-4839.22Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT +EIE F+ N DF+ W +R   IL  Q ALKA EDPK LP  +T  E++ +EE+AY TL+MN+T N+ RQV++E+ AF+                 
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
                              +EFKKLT      GE+LG+ESEA ILIN + + YK+VK ALKYGR+ I+++ VI+ +K +ELELK++NK +   ESLF
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC-----QRDCPQR--------------------------RKVNLRKGRKHGRGDVSIGENTFEYTEVLATT
         KGK   +K   +K Q++ + K ALKCF C     +R+CP R                          ++ + R+GR+HGR    +G   FEYTE+L TT
Subjt:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC-----QRDCPQR--------------------------RKVNLRKGRKHGRGDVSIGENTFEYTEVLATT

Query:  GEKTIK
         ++T++
Subjt:  GEKTIK

A0A5C7GPM1 Uncharacterized protein5.8e-4949.75Show/hide
Query:  YEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNKICLRE
        +EI++F+   DF +W R+ + +L QQ  LKA+E P  LPD++ DE+K  M E+A  T+++N++ N+ R+V DE+ A+  W KL++LY+ K + NKI L+E
Subjt:  YEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNKICLRE

Query:  RLFTYWMNSSKTLDENLDEFKKLTAELAAA--GERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLFTKG
        RLF++ M++SK LD+NLDE+KK+T ELA A   E+L  E+EA IL+N LP+++KDVK A+KYGR S+SL+  IS +K KELELK + K NG  E+LF  G
Subjt:  RLFTYWMNSSKTLDENLDEFKKLTAELAAA--GERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLFTKG

Query:  K
        +
Subjt:  K

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-6645.34Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        MTTT +EIE+F+ N DF+L  +R    L  Q ALKALEDPK LP  +T  E++ +EE+AY TL+MN+T N+ RQV++E+  F+ W+ L++LY KKD+ NK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF
        + +RE+LF++ MN +K LDENLDEFKKLT  L    E+LG+ESEA ILIN + + YK+VK +LKYGR++I+++SVI+ +K KELELK++NK +   ESLF
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLF

Query:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC------QRDCPQRRKVNLR---------------------------KGRKHGRGDVSIGENTFEYTEVLA
        +KG    +K   +K Q++ + K ALKCF C      +R+CP R K N R                           +GR+HG     +G   FEYT+VLA
Subjt:  TKGKGHSKKTRFSKGQKNFKGKAALKCFTC------QRDCPQRRKVNLR---------------------------KGRKHGRGDVSIGENTFEYTEVLA

Query:  TTGEKTIKQDL
         T ++ ++ ++
Subjt:  TTGEKTIKQDL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-2726.64Show/hide
Query:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK
        M+   YE+ +FN +N FS W RR + +L QQ   K L+     PD +  E+   ++E A   + ++++ ++   ++DE  A   W +L++LY+ K + NK
Subjt:  MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNK

Query:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKE-LELKSKNKGNGGVESL
        + L+++L+   M+       +L+ F  L  +LA  G ++  E +A +L+N LP +Y ++   + +G+ +I L  V S +   E +  K +N+G    ++L
Subjt:  ICLRERLFTYWMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKE-LELKSKNKGNGGVESL

Query:  FTKGKGHS-KKTRFSKGQKNFKGKA-------ALKCFTC------QRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTI
         T+G+G S +++  + G+   +GK+          C+ C      +RDCP  RK         G+G+ S G+   + T  +    +  +
Subjt:  FTKGKGHS-KKTRFSKGQKNFKGKA-------ALKCFTC------QRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTACAACCAACTATGAGATTGAAAGATTCAATGACAATAATGATTTCAGTTTATGGTGCAGGAGAACTCAAATCATTCTTCAACAACAATCTGCACTCAAAGCCTT
GGAGGATCCCAAAAGCCTACCTGATGCAATCACTGATGAGGAGAAAAAGAAAATGGAAGAAATCGCCTATGGTACTCTTCTTATGAATGTTACTGGCAATATTTTTAGAC
AAGTTATGGACGAGTCTGCTGCTTTTAGTGCTTGGAAAAAATTACAAGCCCTTTATGTCAAGAAAGATATACCAAACAAAATTTGTCTTAGAGAAAGATTGTTCACTTAT
TGGATGAATAGCTCTAAAACACTTGATGAAAATCTTGATGAATTCAAGAAACTCACAGCTGAATTGGCAGCTGCTGGAGAAAGATTGGGAAGTGAAAGTGAAGCTACGAT
CTTGATAAATTATCTACCTGAGGCATATAAAGATGTAAAGAATGCACTAAAATATGGAAGAGATTCCATTTCTCTAGATTCTGTCATATCAACAATAAAATGTAAAGAAC
TCGAGCTGAAATCAAAAAACAAAGGAAATGGCGGAGTTGAATCTCTCTTTACAAAAGGTAAGGGGCATTCCAAGAAAACCAGATTCTCAAAGGGGCAAAAGAACTTCAAA
GGAAAGGCTGCTTTAAAGTGTTTTACCTGCCAACGTGATTGCCCTCAGAGAAGGAAGGTAAATCTCAGAAAAGGCAGGAAACATGGTAGAGGGGATGTCTCTATTGGAGA
AAACACTTTTGAATATACAGAAGTGTTAGCAACCACTGGGGAAAAGACCATAAAACAGGATTTAGGAAAATTGAAAGAAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTACAACCAACTATGAGATTGAAAGATTCAATGACAATAATGATTTCAGTTTATGGTGCAGGAGAACTCAAATCATTCTTCAACAACAATCTGCACTCAAAGCCTT
GGAGGATCCCAAAAGCCTACCTGATGCAATCACTGATGAGGAGAAAAAGAAAATGGAAGAAATCGCCTATGGTACTCTTCTTATGAATGTTACTGGCAATATTTTTAGAC
AAGTTATGGACGAGTCTGCTGCTTTTAGTGCTTGGAAAAAATTACAAGCCCTTTATGTCAAGAAAGATATACCAAACAAAATTTGTCTTAGAGAAAGATTGTTCACTTAT
TGGATGAATAGCTCTAAAACACTTGATGAAAATCTTGATGAATTCAAGAAACTCACAGCTGAATTGGCAGCTGCTGGAGAAAGATTGGGAAGTGAAAGTGAAGCTACGAT
CTTGATAAATTATCTACCTGAGGCATATAAAGATGTAAAGAATGCACTAAAATATGGAAGAGATTCCATTTCTCTAGATTCTGTCATATCAACAATAAAATGTAAAGAAC
TCGAGCTGAAATCAAAAAACAAAGGAAATGGCGGAGTTGAATCTCTCTTTACAAAAGGTAAGGGGCATTCCAAGAAAACCAGATTCTCAAAGGGGCAAAAGAACTTCAAA
GGAAAGGCTGCTTTAAAGTGTTTTACCTGCCAACGTGATTGCCCTCAGAGAAGGAAGGTAAATCTCAGAAAAGGCAGGAAACATGGTAGAGGGGATGTCTCTATTGGAGA
AAACACTTTTGAATATACAGAAGTGTTAGCAACCACTGGGGAAAAGACCATAAAACAGGATTTAGGAAAATTGAAAGAAGACTAGGTCCTTGACTCAGGGTGTACCTACC
ACATGACCTACTCAAAAGAATGGTTTGTGACTTATAAACCATGTGAGGGAGGTGTTATTTTCATGGGAGATAATCATGGTTGTAAAGTTGTGGGAATTGGTTCAGAGGGA
TTGAAGCTTAAAGACAATAGAGATATTCTATTAAGGAATGTGAGACATGTTCCTGACCTAAGGAGAAATCTAATCTCCATTGGAATCTTAGATGATCAAGGATGCTCTTT
TAATGGGAAATGTGGAGTTTTCAAATTTACAAAAGGTCCCAAAGAAATTTTGACAGGAGAAAAATGCAATGGCCTTTACATCCTAAAAGATGTTACTCCTCCAAGTTCAA
CTCTTATTACAGAAGATGACAAAGCTGAAGAAATCGAATTGTGGCATAAGACGTTGTCTCACATAAGTGAAAATGGGCTAAAAGAACTTCTAAAACAAGGTTTAATTAAA
ACCCGAGGTAATAAACGATTGAGATTTTGTGAACACTGTGTTTTTGGAATGCCCAAGAGGTTGAAATTCACTAAAGGAGAACACACAACAAAATCCATTTTGAGCTATGT
ACATGTTGATCTATGGGGACCCTCACGAACTCCTTCATTAGATGGTTCCAGTAAAGGGAATTTCTTATTGACCAACCTTTAAGAGATAGGGGGACCCAATTCGACGATGA
AGGATAATGTGACTGAGGAAACCACTATTGTTACTTTCAATGAGGCCATTCAGGATGACCGTCGAAATAGAAGTACTTAGAGCTGAGCCGCACACTTTGGATCAAGCCAT
TGCCACTGCCTATAAACAGGCAAATCTTAATGTCAAGCTTGGACTGAAGACTTTTAAGGGTTGCAAACTCTCAAACCCTAGCCAGAACACTACTTCCAAAAACCTTGTGA
ATCTTGTCAACCTAACCATGTCTGACTCCGGGCCAGCAGTGAACGAAGTGTTAAAGTAGAGAAAGCTAGCGTTCCGTACTCAGTCGACCAAGCAAAACTCCACTAAGCTA
ATAGCTCTAATTGTGGGATATCTAACATTCTACTAAATAGTGCATTATATTAATACAATGTGTATATTTTATCATTTCATAA
Protein sequenceShow/hide protein sequence
MTTTNYEIERFNDNNDFSLWCRRTQIILQQQSALKALEDPKSLPDAITDEEKKKMEEIAYGTLLMNVTGNIFRQVMDESAAFSAWKKLQALYVKKDIPNKICLRERLFTY
WMNSSKTLDENLDEFKKLTAELAAAGERLGSESEATILINYLPEAYKDVKNALKYGRDSISLDSVISTIKCKELELKSKNKGNGGVESLFTKGKGHSKKTRFSKGQKNFK
GKAALKCFTCQRDCPQRRKVNLRKGRKHGRGDVSIGENTFEYTEVLATTGEKTIKQDLGKLKED