; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005799 (gene) of Snake gourd v1 genome

Gene IDTan0005799
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:63020702..63022926
RNA-Seq ExpressionTan0005799
SyntenyTan0005799
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045342.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-4451.74Show/hide
Query:  RGSTSGTKYVVSSSPKGKRKKMKKGKADRA---ASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAA
        R    G    V++  KGK K   KGK       A  K    + L +K     A         V SSL     +  SS++QL E+E+TL+VG  +V+SA A
Subjt:  RGSTSGTKYVVSSSPKGKRKKMKKGKADRA---ASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAA

Query:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGR
        +   K YGY+YLM  KSE LEKFKEYK EVENLL P +PQQN VSERRNR LLD+VRSMM+YA+LP  FWGY +ETAVHILNNVPS SV E P ELW GR
Subjt:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGR

Query:  KGSLHHFRIWVGDLILDTLCGCDRFILDTN
        K SL HFRIW          GC   +L TN
Subjt:  KGSLHHFRIWVGDLILDTLCGCDRFILDTN

KAA0050621.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4454.42Show/hide
Query:  GSTSGTKYVVSSSPKGKRKKMKKGKADRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAAIDMV
        G   G+   V    KGK K + KGK       +  KT   +  V      G T     V SSL     +  SS++QL E+E+TL+VG   V+SA A+   
Subjt:  GSTSGTKYVVSSSPKGKRKKMKKGKADRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAAIDMV

Query:  KLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGRKGSL
        K YGY+YLM  KSE LEKFKEYKTEVENLL    PQQNGVSE RN+TLLDMVRSMMSYA+LP SFWGY VETAVHILNNVPS SV E  FELW GRK SL
Subjt:  KLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGRKGSL

Query:  HHFRIWVGDLILDTLCGCDRFILDTN
         HFRIW          GC   +L TN
Subjt:  HHFRIWVGDLILDTLCGCDRFILDTN

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-4367.59Show/hide
Query:  ITLRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVP
        +  R G E  +S   ID    YGY+YLM  KSE LEKFKEYKTEVENLL P  PQQNGVSERRNRTLLDMVRSMM YA+LP SFWGY VETAVHILNNVP
Subjt:  ITLRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVP

Query:  SNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
        S SV EIPFELW  RK SL HFRIW          GC   +L TN
Subjt:  SNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

KAA0062925.1 gag/pol protein [Cucumis melo var. makuwa]8.1e-4351.07Show/hide
Query:  KYVVSSSPKGKRKKMKKGKA-DRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKK------------GISSWQQLREAEITLRVGIEEVVS
        ++V SSS   K +K K+GK      + +GK+  K+  K             +     L   ++K              SS +QL E+E+TL+VG  +V+S
Subjt:  KYVVSSSPKGKRKKMKKGKA-DRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKK------------GISSWQQLREAEITLRVGIEEVVS

Query:  AAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELW
        A A+   K YGY+YLM  KSE LEKFKEYK EVENLL P  PQQNGVSERRNR LLDMVRSMMSYA+LP SFWGY VETAVHILNNVPS SV E  F+LW
Subjt:  AAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELW

Query:  NGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
         GRK SL HFRIW          GC   +L TN
Subjt:  NGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

KAA0064189.1 gag/pol protein [Cucumis melo var. makuwa]8.1e-4349Show/hide
Query:  PEVEAKVAFRSFHRGSTSGTKYVVSSSPKGKRKKMKKGKADR---AASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEIT
        P ++ K      H    S  +++ SSS   K +K K+GK      A   KGK    ++RK      +    ++      +    K+  SS++QL E+E+T
Subjt:  PEVEAKVAFRSFHRGSTSGTKYVVSSSPKGKRKKMKKGKADR---AASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEIT

Query:  LRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTE---VEN-----LLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVH
        L+VG  +V+SA A+   K YGY+YLM  KSE LEKFKEYKTE   +E+     L AP  PQQNGVSERRNRTLLDMVRSMMSYA+LP SFWGY VET VH
Subjt:  LRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTE---VEN-----LLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVH

Query:  ILNNVPSNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
        ILNNVPS SV   PFELW GRK SL HFRIW          GC   +L TN
Subjt:  ILNNVPSNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

TrEMBL top hitse value%identityAlignment
A0A5A7TPP4 Gag/pol protein3.6e-4451.74Show/hide
Query:  RGSTSGTKYVVSSSPKGKRKKMKKGKADRA---ASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAA
        R    G    V++  KGK K   KGK       A  K    + L +K     A         V SSL     +  SS++QL E+E+TL+VG  +V+SA A
Subjt:  RGSTSGTKYVVSSSPKGKRKKMKKGKADRA---ASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAA

Query:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGR
        +   K YGY+YLM  KSE LEKFKEYK EVENLL P +PQQN VSERRNR LLD+VRSMM+YA+LP  FWGY +ETAVHILNNVPS SV E P ELW GR
Subjt:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGR

Query:  KGSLHHFRIWVGDLILDTLCGCDRFILDTN
        K SL HFRIW          GC   +L TN
Subjt:  KGSLHHFRIWVGDLILDTLCGCDRFILDTN

A0A5A7UAP2 Gag/pol protein5.5e-4554.42Show/hide
Query:  GSTSGTKYVVSSSPKGKRKKMKKGKADRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAAIDMV
        G   G+   V    KGK K + KGK       +  KT   +  V      G T     V SSL     +  SS++QL E+E+TL+VG   V+SA A+   
Subjt:  GSTSGTKYVVSSSPKGKRKKMKKGKADRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEVVSAAAIDMV

Query:  KLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGRKGSL
        K YGY+YLM  KSE LEKFKEYKTEVENLL    PQQNGVSE RN+TLLDMVRSMMSYA+LP SFWGY VETAVHILNNVPS SV E  FELW GRK SL
Subjt:  KLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGRKGSL

Query:  HHFRIWVGDLILDTLCGCDRFILDTN
         HFRIW          GC   +L TN
Subjt:  HHFRIWVGDLILDTLCGCDRFILDTN

A0A5A7USZ2 Gag/pol protein1.4e-4367.59Show/hide
Query:  ITLRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVP
        +  R G E  +S   ID    YGY+YLM  KSE LEKFKEYKTEVENLL P  PQQNGVSERRNRTLLDMVRSMM YA+LP SFWGY VETAVHILNNVP
Subjt:  ITLRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVP

Query:  SNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
        S SV EIPFELW  RK SL HFRIW          GC   +L TN
Subjt:  SNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

A0A5A7VAK2 Gag/pol protein3.9e-4351.07Show/hide
Query:  KYVVSSSPKGKRKKMKKGKA-DRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKK------------GISSWQQLREAEITLRVGIEEVVS
        ++V SSS   K +K K+GK      + +GK+  K+  K             +     L   ++K              SS +QL E+E+TL+VG  +V+S
Subjt:  KYVVSSSPKGKRKKMKKGKA-DRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKK------------GISSWQQLREAEITLRVGIEEVVS

Query:  AAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELW
        A A+   K YGY+YLM  KSE LEKFKEYK EVENLL P  PQQNGVSERRNR LLDMVRSMMSYA+LP SFWGY VETAVHILNNVPS SV E  F+LW
Subjt:  AAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELW

Query:  NGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
         GRK SL HFRIW          GC   +L TN
Subjt:  NGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

A0A5A7VF82 Gag/pol protein3.9e-4349Show/hide
Query:  PEVEAKVAFRSFHRGSTSGTKYVVSSSPKGKRKKMKKGKADR---AASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEIT
        P ++ K      H    S  +++ SSS   K +K K+GK      A   KGK    ++RK      +    ++      +    K+  SS++QL E+E+T
Subjt:  PEVEAKVAFRSFHRGSTSGTKYVVSSSPKGKRKKMKKGKADR---AASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEIT

Query:  LRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTE---VEN-----LLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVH
        L+VG  +V+SA A+   K YGY+YLM  KSE LEKFKEYKTE   +E+     L AP  PQQNGVSERRNRTLLDMVRSMMSYA+LP SFWGY VET VH
Subjt:  LRVGIEEVVSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTE---VEN-----LLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVH

Query:  ILNNVPSNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN
        ILNNVPS SV   PFELW GRK SL HFRIW          GC   +L TN
Subjt:  ILNNVPSNSVCEIPFELWNGRKGSLHHFRIWVGDLILDTLCGCDRFILDTN

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-1337.04Show/hide
Query:  YIYLMHRK---SETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCE---IPFELWNGRKG
        Y+Y+ + +   S  + +F   K    +L  P  PQ NGVSER  RT+ +  R+M+S A+L KSFWG  V TA +++N +PS ++ +    P+E+W+ +K 
Subjt:  YIYLMHRK---SETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCE---IPFELWNGRKG

Query:  SLHHFRIW
         L H R++
Subjt:  SLHHFRIW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-1531.65Show/hide
Query:  TLEKFKEYKTE---VENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVC-EIPFELWNGRKGSLHHFRIWVGDL
        T  +F+EY +          P  PQ NGV+ER NRT+++ VRSM+  A+LPKSFWG  V+TA +++N  PS  +  EIP  +W  ++ S  H +++    
Subjt:  TLEKFKEYKTE---VENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVC-EIPFELWNGRKGSLHHFRIWVGDL

Query:  ILDTLCGCDRFILDTNEGQNQV------------GSWELNYARWNSLVRAFSETRGVV
              GC  F     E + ++            G  E  Y  W+ + +    +R VV
Subjt:  ILDTLCGCDRFILDTNEGQNQV------------GSWELNYARWNSLVRAFSETRGVV

P92512 Uncharacterized mitochondrial protein AtMg007102.0e-0444Show/hide
Query:  NRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSV-CEIPFELW
        NRT+++ VRSM+    LPK+F      TAVHI+N  PS ++   +P E+W
Subjt:  NRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSV-CEIPFELW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-0623.08Show/hide
Query:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLL--------------------------------APCMPQQNGVSERRNRTLLDMVRSMMSYARLPKS
        +D    Y ++Y + +KS+  E F  +K  +EN                                   P  P+ NG+SER++R +++   +++S+A +PK+
Subjt:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLL--------------------------------APCMPQQNGVSERRNRTLLDMVRSMMSYARLPKS

Query:  FWGYVVETAVHILNNVPSNSV-CEIPFELWNGRKGSLHHFRIW
        +W Y    AV+++N +P+  +  E PF+   G   +    R++
Subjt:  FWGYVVETAVHILNNVPSNSV-CEIPFELWNGRKGSLHHFRIW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-0622.38Show/hide
Query:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLL--------------------------------APCMPQQNGVSERRNRTLLDMVRSMMSYARLPKS
        +D    Y ++Y + +KS+  + F  +K+ VEN                                   P  P+ NG+SER++R +++M  +++S+A +PK+
Subjt:  IDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLL--------------------------------APCMPQQNGVSERRNRTLLDMVRSMMSYARLPKS

Query:  FWGYVVETAVHILNNVPSNSV-CEIPFELWNGRKGSLHHFRIW
        +W Y    AV+++N +P+  +  + PF+   G+  +    +++
Subjt:  FWGYVVETAVHILNNVPSNSV-CEIPFELWNGRKGSLHHFRIW

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0544Show/hide
Query:  NRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSV-CEIPFELW
        NRT+++ VRSM+    LPK+F      TAVHI+N  PS ++   +P E+W
Subjt:  NRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSV-CEIPFELW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATCAGAACACCAGAAGTTGAGGCCAAAGTTGCCTTCAGGTCCTTTCACAGGGGTTCAACCTCTGGGACTAAATATGTAGTCTCTTCAAGCCCAAAAGGGAAGAG
GAAAAAGATGAAGAAGGGCAAAGCTGACCGTGCTGCCTCCCGAAAGGGCAAAAAGACAAGGAAGTTGCAGAGAAAGGTAAGTGTTTCCACTGCAATGGGGAGGACCACTA
GAAGTGTAACTGTCTCAAGTTCCTTGCCGACAGGAAGAAAGAAGGGAATTAGCTCCTGGCAACAGCTGCGAGAGGCTGAGATAACACTACGGGTTGGAATCGAGGAGGTT
GTCTCTGCTGCAGCGATTGACATGGTGAAGCTGTATGGGTATATTTACCTAATGCATAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAAAA
CCTCTTAGCACCTTGCATGCCACAGCAGAATGGTGTATCAGAGAGGAGAAACAGAACCCTGTTGGACATGGTTCGATCAATGATGAGCTATGCTCGTCTCCCTAAATCTT
TTTGGGGTTATGTAGTGGAGACTGCAGTTCATATTTTGAACAACGTTCCATCGAATAGCGTTTGTGAAATACCTTTCGAACTCTGGAATGGACGTAAAGGCAGTTTACAC
CATTTCAGAATTTGGGTTGGAGACCTAATCTTGGATACACTTTGTGGATGCGATCGCTTTATATTAGATACAAACGAGGGACAGAACCAAGTAGGGAGCTGGGAACTTAA
CTACGCGAGATGGAATTCACTCGTTCGCGCCTTTAGTGAAACTAGAGGGGTTGTTCCCTTAAGTGTTGTCTCCAGGGCTTGGACAACGGGCGCCCACCTTCTCATTGGCC
CGAAAAGGGTGTTGTGTATGGTTGGACCATCACAATATGTTGTTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGATCAGAACACCAGAAGTTGAGGCCAAAGTTGCCTTCAGGTCCTTTCACAGGGGTTCAACCTCTGGGACTAAATATGTAGTCTCTTCAAGCCCAAAAGGGAAGAG
GAAAAAGATGAAGAAGGGCAAAGCTGACCGTGCTGCCTCCCGAAAGGGCAAAAAGACAAGGAAGTTGCAGAGAAAGGTAAGTGTTTCCACTGCAATGGGGAGGACCACTA
GAAGTGTAACTGTCTCAAGTTCCTTGCCGACAGGAAGAAAGAAGGGAATTAGCTCCTGGCAACAGCTGCGAGAGGCTGAGATAACACTACGGGTTGGAATCGAGGAGGTT
GTCTCTGCTGCAGCGATTGACATGGTGAAGCTGTATGGGTATATTTACCTAATGCATAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAAAA
CCTCTTAGCACCTTGCATGCCACAGCAGAATGGTGTATCAGAGAGGAGAAACAGAACCCTGTTGGACATGGTTCGATCAATGATGAGCTATGCTCGTCTCCCTAAATCTT
TTTGGGGTTATGTAGTGGAGACTGCAGTTCATATTTTGAACAACGTTCCATCGAATAGCGTTTGTGAAATACCTTTCGAACTCTGGAATGGACGTAAAGGCAGTTTACAC
CATTTCAGAATTTGGGTTGGAGACCTAATCTTGGATACACTTTGTGGATGCGATCGCTTTATATTAGATACAAACGAGGGACAGAACCAAGTAGGGAGCTGGGAACTTAA
CTACGCGAGATGGAATTCACTCGTTCGCGCCTTTAGTGAAACTAGAGGGGTTGTTCCCTTAAGTGTTGTCTCCAGGGCTTGGACAACGGGCGCCCACCTTCTCATTGGCC
CGAAAAGGGTGTTGTGTATGGTTGGACCATCACAATATGTTGTTCATTAG
Protein sequenceShow/hide protein sequence
MWIRTPEVEAKVAFRSFHRGSTSGTKYVVSSSPKGKRKKMKKGKADRAASRKGKKTRKLQRKVSVSTAMGRTTRSVTVSSSLPTGRKKGISSWQQLREAEITLRVGIEEV
VSAAAIDMVKLYGYIYLMHRKSETLEKFKEYKTEVENLLAPCMPQQNGVSERRNRTLLDMVRSMMSYARLPKSFWGYVVETAVHILNNVPSNSVCEIPFELWNGRKGSLH
HFRIWVGDLILDTLCGCDRFILDTNEGQNQVGSWELNYARWNSLVRAFSETRGVVPLSVVSRAWTTGAHLLIGPKRVLCMVGPSQYVVH