; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006691 (gene) of Snake gourd v1 genome

Gene IDTan0006691
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPol protein
Genome locationLG09:42668292..42673759
RNA-Seq ExpressionTan0006691
SyntenyTan0006691
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033514.1 pol protein [Cucumis melo var. makuwa]1.3e-6570Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPM+GV+R  R+EKLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR++ P LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

KAA0052869.1 pol protein [Cucumis melo var. makuwa]1.3e-6570.56Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPMKGV+R  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VHN FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

KAA0061323.1 pol protein [Cucumis melo var. makuwa]2.6e-6670Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPM+GV+R  R+ KLSPCF+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

KAA0061618.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.7e-6670.56Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG++VF KVAPM+GVVR  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVPNP+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

KAA0065287.1 pol protein [Cucumis melo var. makuwa]7.4e-6667.2Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+ M TAQSRQKSY D RRKDL+FE+G+ VF KVAPMKGV+R  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQDWELSGTKVP
        SMLR YVP+P+++VD+EPL +DE+LSY E+PV++LAR+ K LRNR I LVKVLWRN +  EATW+RE++MR+  P+LF++    G K P
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQDWELSGTKVP

TrEMBL top hitse value%identityAlignment
A0A5A7SQQ3 Pol protein6.1e-6670Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPM+GV+R  R+EKLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR++ P LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

A0A5A7UH56 Pol protein6.1e-6670.56Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPMKGV+R  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VHN FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

A0A5A7UZD7 Pol protein1.2e-6670Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG+ VF KVAPM+GV+R  R+ KLSPCF+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVP+P+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

A0A5A7V223 Reverse transcriptase4.7e-6670.56Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+RM TAQSRQKSY D RRKDL+FEVG++VF KVAPM+GVVR  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD
        SMLR YVPNP+H+VD+EPL +DE+LSY E+PV++LAR+ K LRN+ I LVKVLWRN +  EATW+RE++MR+  P+LF++
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQD

A0A5A7VDX0 Pol protein3.6e-6667.2Show/hide
Query:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV
        GPELVQ TNEAIQKIR+ M TAQSRQKSY D RRKDL+FE+G+ VF KVAPMKGV+R  R+ KLSP F+GPFEIL+RI PVAYRLAL PSL +VH+ FHV
Subjt:  GPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHV

Query:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQDWELSGTKVP
        SMLR YVP+P+++VD+EPL +DE+LSY E+PV++LAR+ K LRNR I LVKVLWRN +  EATW+RE++MR+  P+LF++    G K P
Subjt:  SMLRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQDWELSGTKVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTGGTGAACGCCAGTTATGGCCCCGAGTTAGTCCAAGTTACAAATGAGGCGATACAAAAGATTCGAGCGAGAATGCAAACTGCTCAAAGTAGGCAGAAGAGCTA
TGTTGATTCAAGAAGGAAGGACCTGAAGTTCGAGGTGGGCAACCATGTGTTCTTTAAAGTGGCACCAATGAAGGGGGTTGTGAGGGTTGGTCGTAAAGAAAAGTTGAGTC
CATGCTTTATAGGACCCTTCGAGATTTTGAAGCGGATTAGCCCAGTGGCTTATCGGTTGGCATTGGCGCCGTCCTTATTTTCAGTTCATAATGCCTTCCATGTTTCCATG
TTGAGGAATTACGTGCCTAACCCAACACACATGGTTGATTTTGAACCCTTACGATTGGACGAGGACTTGAGTTATGAGGAACGACCAGTGCAGATCCTCGCCAGGGACCA
AAAGGTTCTCCGTAATCGAACTATCGGTCTGGTCAAGGTTTTATGGCGGAACCAGCAAGCAGTGGAAGCTACTTGGCAACGAGAGGAAGAAATGCGAGCCAATAACCCAC
AGCTGTTCCAGGATTGGGAACTTTCGGGGACGAAAGTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATATGAATCACATTCACATTTAAATTTAATATTTGAACTCTTTCAAATATTTAATTCTCTCATTTAATTTAATATGAATCATATTCACATTAAATTTATAATATATAGTT
CCAAAACTATATATTATATCGTATCTATATACATTAAATTTATTCCTATTTATGAATTTGAACATTTCAAATTCAAAACGATCTAAGAATCCTTTACGAGCTAGAAGGTG
GACCTAATGGACCTACAGATCAGAAGCTCCAACGATACAAGATTATTCGGTTAATCTTATTAACCTTCTAATCAACATTCGTTAACTGCGGGAACACTCCACTAAAGTCC
CACAGCTGCACTCTTCTCACTGCAGATATATTTTTGTGTGCACGGATATTGACCAATAATAGCAAATCAAACCTTCACGAATGTTCATAACACTAGCTGGGTCAAATTAC
CATTTTACCCCTGGGTTACATCTTGTGCCTTAAGTACCAGTGCTCCTCTAATGAACAATTTGTTTGTGGTCCTACCAACAAACAGAGTCCCTCTCGGGCAAATGAGAACA
TTCGTTTACTTACCCTAGTAGGCGGGAAGAAGTGAATTTCATCTTGCTAGGGTAAGTCCCCAACTGCTCACTCGGTCTTGTCTCCAAGAAGGTAGGCATATTGAGTCGGC
GAATCTGGCCACTCTCACCCATACTAGTCAAAGGACAATCCCTCGCAAACAGGAGTTCGTAACCTACTCAGGATTGAGATCAAGTTGCCTAGGTCATCCTAGTGAAATAG
AAACCTAACTAGTCAACGGAGTTACATCTAGAGGTTACTATTTCGTGGTCCGGTCTTATGTAATCTCATTACATAGGATATCCCCACTCACATGTCATCTACACGAACAC
GTTAGGATCACAGTGTTTGTGTCATATACAAAGTGGGTCACATCCATAGTTTTACCAGGGTAAGGTACCCAAACCTTATCCCCTTACTATATACCCTTTAGGCTATATCT
CGAACCGAGATCCTTTATATGCACACTACATTCAGTTAAAGATATATTTTACAGCCTTGGATGTTTAGTTTATTGGATTTAGGGTTAAAACAATGGCAATATCGCAAATA
ACGAATAATAACACTTTATTGAATTAATAACAATTTATTACAGTAATTAGAACAAAATTACAAACTATGAGTTTTAGGGCACAAAACCCAACAGACAAACCTCGCTGCAA
CAACTGTGGAAGATCTCACTGGGGCAGGTGTTTGGCCTGAACAAGAGCTTGCTTCAGATGTGGACAGGAGGGACATTTCTCTGGGAATTGCCCAAACAAAGCAATGGACA
GTCACACTCAGGGCTCTAGTTGTGTGGTGCCATTGAAACCAGGGATTCAACTTGGAAGGGCTTAAGCAAGCACATGCCGAGATGCCAACAACTTTGACTCTGTGGTCACA
GGTACACTCCCGATACTTGGGCATCTCGCTTTTGTATTGTTTGATTCAAGGTCAACCCACTCCTTTATTTTTTCTTCTTTTGAGAGCCAAGCTAAGTTAGAATGAGAGCC
TTTATCTTTTATCTTATATGTTGCCACCCCTGCGGGGGTGAGTTTGTATGCCACCGAAAGAGTCAAAGCATGTAAGTTGTCAGTCTCTCGCCACGCGCTGGAGGTAAATT
TGATAGTCTTGGACATGACCGAATTTGATGTCATACTAGGCATGGATTGGTTAGCACGAAACCATACTAGTATAGATTGCTACCTTAAGGAAGTGGTGTTTACACCACCC
TGTCTGAAGAGCTACCGGTTAAGGGGAGTAGGGACAGGGTCACACCCAGAGTGGTGTCTGCACTCAGGGCAAGGAAGTTGATCCACCATGGAGCTTGGGCATGTTAGCTA
GCGTGGTTGACTTGGGGCACGCTAAGGATTCACTATCCTCGGTTCCTGTGGCAAACGAGTTTCCAGACGTTTTTCCTGAGGATCTTTCGGGATTACCTCCTGTGAGGGAG
GTGGATTTCAACATAGAACTTGAGCCAGGGACGACACCTATTTCTAAGGCTCCATACAGAATGGCGCCGGCTGAGTTAAAGGAGGTTAAGTTGCAGCTGCAGGACTTGTT
AGATAAGGGGTATATACGCCCTAGCGTTTCCCCTTGGGGAGCGCCTGTGTTGTTGGTTAAGAAAAAGGATGGGTCTATGCGTCTGTGCATAGACTATAGGGAGCTTAATA
AGGTCACAGTCAAGAATCGGTATCCTCTCCCCTGAATAGATGACCTTTTTGATCAGTTGTGAGGAGCTTCTATTTTCTCAAAGATTGATCTCAGGTCGGGTTACCACCAG
TTGAGAATCAGAGAGGAAGATATTCCCAAGTCTGCCTTAAGGTCACGTTATGGACATTACGAGTTCACTGTAATGTCATTTGGATTTACCAATGCGCCTGCAGTTTTAAT
GGACCTCATGAACCGAGTATTCAAGGAATTCCTAGATACGTTTGTTATAGTATTTATAGACGATATCCTGGTTTATTCAAGGTCCGAAGAATAGCACAGGGAACACCTTC
AAAGGGTTTTAGAGACCTTGCGAGAGAACAAGTTGTTTACAAAGTTCTCGAAGTGCGAGTTTTGGCTTCGTCAGGTGTCATTTTTGGGACACGTGGTGTCCAAGGTAGGG
ATCTTTGTAGATCCTGCTAAAGTTGATGCAGTTATGCAGTGGCCTCGTCTGTCGACGGCTACAGAGGTACGTAGTTTTCTTGGACTTGCAGGCTATTATAGACGCTTTGT
GCAAGATTTCTCCACGTTAGCAATGCCCTTAACATAGTTGACTAGGAAGGGTACCGCGTTTGTTTGGGATGATGTTTGCGAGGAGAGTTTCGAGAAGCTCAAGGAGAGCT
TAGTTACAACACCAGTTCTTACAGTGTCAGATGGTGTGAAAGGTTACGTGATCTACAGTGATGCCTCCTGGAAAGGCTTGGGTTGTGTCCTGATGCAGCATGGGAAGGTG
ATAGCTTATGTGTCTCGCTAGTTGAAGAATCATGAGAGGAATTATCCAACTCACGATTTGGAGTTCGCAGTCGTGGTGTTCGCACTTAAAATATGGAGACATTATCTGTA
GGGAGAGAAAATCCAAATTTTCACAAATCACAAGAGCCTGAAGTATTTATTCACTCAGAAGGAACTGAATATGAGGCAACGCAGGTGGTTAGAACTGGTGAAGGACTATG
ATGTTGAGATCCTGTACCACCCAGGTAAAGCAAATGTAGTTGCAGATGTCCTTAGTAGGAAGCCAGTTCATTCCTCATCTCAAGTCACTGATCAGGAGTTGCAGTCAGAG
TTTGAGCATGCAGAAATTGCAGTTTTGTTGGGCGAAGCAACTGCACGTTTGGCCCGATTGGCAGTTCTACCAACCTTAAGAAAATGTATCATAACTTCTCAACCGACAGA
TCCTTGGCTTATGAAGAAGTTCCGTCAGGTGGGTTCAGAGTAGGAACAAGAGTTTTCCTTGTCCTCAGACGGGGGACTGTTATTTCAGGGAGGGTTGTGTGTCCCTGACG
TTGGAGGACTAAAAAACGAGATTCTAGCAGAGGGTCACAATTCGCCATTTTCTATGCATCCAGGTATCTGGTGGCATAATATGAAGCGGGATGTGGCGGATTATGTCAGT
AGATGTTTGGTTTGCCATCAGGTGAAGCACCAAGACAAAGGCCAACAGGTCTTTTGCAACCCTTCGACGCTCCGCAGTGGAAGTGGGACGAAATTTCTGTGGACTTCATA
GTGGGACTACCCAAGACCTTGAAAGGTTTCAAAGTAGTCTGGGTTATTGTCGACAGGCTGACTAAATCAGCAAACTTTCTTCCAGGGAAGCCCACTTATTCCACGGATAG
ATGGGCTCAGTTGTACATGGAGAAGATAGTCAGGTTAAACGAAGTTCCTGCTAAAATTGTTTCAGATAGAGATCCTCGTTTCACTTCTAAGTTATGGAGGAGTTTGCAGA
AAGCGCTAGGCACTCAGTTGAATTTTAGTACCTCGTTCCATCCTCAAACATACGGTCAGACGGAACGCCTTAACCAGGTGTTGGAGGACATGCTACGAGCCTGAGCTCTC
GACTTCCCAGGCAGTTAGGATGCCCATCTGCATTTGATGGAATTCGCTTATAATAACAGTTACCAAGGAACCATAGGTATGGCGCCGTTTGAGGCATTGTACGGAAGGAG
GTGCAGAACTCCCGTTTACTGAGATGAGGTTGGTGAACGCCAGTTATGGCCCCGAGTTAGTCCAAGTTACAAATGAGGCGATACAAAAGATTCGAGCGAGAATGCAAACT
GCTCAAAGTAGGCAGAAGAGCTATGTTGATTCAAGAAGGAAGGACCTGAAGTTCGAGGTGGGCAACCATGTGTTCTTTAAAGTGGCACCAATGAAGGGGGTTGTGAGGGT
TGGTCGTAAAGAAAAGTTGAGTCCATGCTTTATAGGACCCTTCGAGATTTTGAAGCGGATTAGCCCAGTGGCTTATCGGTTGGCATTGGCGCCGTCCTTATTTTCAGTTC
ATAATGCCTTCCATGTTTCCATGTTGAGGAATTACGTGCCTAACCCAACACACATGGTTGATTTTGAACCCTTACGATTGGACGAGGACTTGAGTTATGAGGAACGACCA
GTGCAGATCCTCGCCAGGGACCAAAAGGTTCTCCGTAATCGAACTATCGGTCTGGTCAAGGTTTTATGGCGGAACCAGCAAGCAGTGGAAGCTACTTGGCAACGAGAGGA
AGAAATGCGAGCCAATAACCCACAGCTGTTCCAGGATTGGGAACTTTCGGGGACGAAAGTTCCTTAGGAGGGAAGATTGTAAGGCCCATGATGTAAAGTGCGTGATTTTG
GAAGTTTGTTTTTTTAAAAAATGGCATGGTTTTGATTTAGTCTTTGGGTTGGTTATGATTTTAAAATGATTGAGGGAAATTCTGGTAGATAGGGACTATAAGGATTTGGA
GCAGAAGGTATTTATTTGTCCTTTTGTGCACGCTTCTCGAGGTAGCCCATCCACATCTCTCTCTCACTTTCTTATTCACGAAACTACAACCCCTTTTTCCCTAGTCGCCA
CATGTCGCACGAGTCAAGTCGTCCGCTACCGTCGCACGCAACCGGGAGTCAGCCGCCGCCTGAAGTCGACTGCTCGAGGGTTCGAGTCGTTCGTTCCTTGGCCGCCATCG
CTGTCGGGATCGCGTCCGTGAAGTTTGTCGTCACGCTCCACCGTTAGTAGGCTCAGACGTTCGCTGCAACGTCGTCGCCGCTGCTGCACTGTCGTGGTCGTCGCCGATTG
TAGTTACGTGAGTGTACCAGAAGCATTTTGGCTTCGTCGTACCCTCACAAATTTTCGAGATCAAAGGGTTGGTTCAGGTTACGTCGAAGCATTTA
Protein sequenceShow/hide protein sequence
MRLVNASYGPELVQVTNEAIQKIRARMQTAQSRQKSYVDSRRKDLKFEVGNHVFFKVAPMKGVVRVGRKEKLSPCFIGPFEILKRISPVAYRLALAPSLFSVHNAFHVSM
LRNYVPNPTHMVDFEPLRLDEDLSYEERPVQILARDQKVLRNRTIGLVKVLWRNQQAVEATWQREEEMRANNPQLFQDWELSGTKVP