; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027983 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027983
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:9320215..9321639
RNA-Seq ExpressionLag0027983
SyntenyLag0027983
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAT72801.1 hypothetical protein VIGAN_01023800 [Vigna angularis var. angularis]3.2e-4643.58Show/hide
Query:  KLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLE
        ++ EE    E E      EEG  + E+EK   E     PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K +  E
Subjt:  KLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLE

Query:  TVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVD
        T+ L   CSA +QQ +P KL DPGSF IPC                               GE+K T + LQLAD+S+  PYGIVE++L+KV +F  P D
Subjt:  TVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVD

Query:  FFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSD
        F VLD++E+  +PIILGRPFLATGR +ID+E+ +L++RV  EK      E  K+  D
Subjt:  FFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSD

XP_014506502.2 uncharacterized protein LOC106766276 [Vigna radiata var. radiata]6.0e-4545.66Show/hide
Query:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------
        PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K +  ET+ L   CSA +QQ +P KL DPGSF IPC        
Subjt:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------

Query:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR
                               G++K T + LQLAD+S+  PYGIVE++L+KV +F  P DF VLD++E+  +PIILGRPFLATGR +ID+E+ +L++R
Subjt:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR

Query:  VQQEKEVLKAFEDPKNTSD
        V  EK      E  K+  D
Subjt:  VQQEKEVLKAFEDPKNTSD

XP_017441920.1 PREDICTED: uncharacterized protein LOC108347264 [Vigna angularis]1.6e-4546.12Show/hide
Query:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------
        PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K +  ET+ L   CSA +QQ +P KL DPGSF IPC        
Subjt:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------

Query:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR
                               GE+K T + LQLAD+S+  PYGIVE++L+KV +F  P DF VLD++E+  +PIILGRPFLATGR +ID+E+ +L++R
Subjt:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR

Query:  VQQEKEVLKAFEDPKNTSD
        V  EK      E  K+  D
Subjt:  VQQEKEVLKAFEDPKNTSD

XP_020202387.1 uncharacterized protein LOC109788136 [Cajanus cajan]1.3e-4447.32Show/hide
Query:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------
        PTV  P+  KKK  + +F +FLD F  LH+NI F++ALEQMP Y KFMK+ L++K+K ++ ET+ L   CSA +QQ +P KL DPGSF IPC        
Subjt:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------

Query:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR
                               GE+K T + LQLAD+S+  PYG+VE++L+KV +F  P DF VLD++E+  +PIILGRPFLATGR +ID+++ EL++R
Subjt:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR

Query:  VQQEK
        V  EK
Subjt:  VQQEK

XP_039144038.1 uncharacterized protein LOC120281228 [Dioscorea cayenensis subsp. rotundata]1.3e-4438.91Show/hide
Query:  LWKGRKGKAPVEQE----KSSLEFCKVVSVHYEEEIKLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSP-------TVLVPKSKKKKNSKVKFD
        L + ++G  P   E    KS  E C  +++   +E+K+        PE+ +  V++    +  E    E  I +P        +  P+  KK     +F 
Subjt:  LWKGRKGKAPVEQE----KSSLEFCKVVSVHYEEEIKLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSP-------TVLVPKSKKKKNSKVKFD

Query:  KFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC---------------------------
        KFLD F  LH+NIPF++ALEQMP Y KFMKE L+ K+K K  ETV L   CSA +Q+ +P KL DPGSFTIPC                           
Subjt:  KFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC---------------------------

Query:  --NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFE
          N GE + T+V LQLAD+SL  P G++E++L+K+ +F  P DF VLD++E+  +PI+LGRPFLATGR +ID+++ EL +RVQ+E+     F+
Subjt:  --NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFE

TrEMBL top hitse value%identityAlignment
A0A0S3QWS7 Uncharacterized protein1.5e-4643.58Show/hide
Query:  KLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLE
        ++ EE    E E      EEG  + E+EK   E     PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K +  E
Subjt:  KLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLE

Query:  TVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVD
        T+ L   CSA +QQ +P KL DPGSF IPC                               GE+K T + LQLAD+S+  PYGIVE++L+KV +F  P D
Subjt:  TVYLASTCSARVQQGVPEKLSDPGSFTIPC-----------------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVD

Query:  FFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSD
        F VLD++E+  +PIILGRPFLATGR +ID+E+ +L++RV  EK      E  K+  D
Subjt:  FFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSD

A0A1S3UKE1 uncharacterized protein LOC1067662762.9e-4545.66Show/hide
Query:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------
        PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K +  ET+ L   CSA +QQ +P KL DPGSF IPC        
Subjt:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------

Query:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR
                               G++K T + LQLAD+S+  PYGIVE++L+KV +F  P DF VLD++E+  +PIILGRPFLATGR +ID+E+ +L++R
Subjt:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR

Query:  VQQEKEVLKAFEDPKNTSD
        V  EK      E  K+  D
Subjt:  VQQEKEVLKAFEDPKNTSD

A0A1S3V057 uncharacterized protein LOC1067704195.5e-4444.91Show/hide
Query:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------
        PT+  P+  KK+    +F +FLD F  LH+NIPF++ALEQMP Y KFMK+ L+KK+K ++ ET+ L   CSA +QQ +P KL DPGSF +PC        
Subjt:  PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPC--------

Query:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR
                               GE+K   + LQLAD+S+  PYGIVE++L+KV +F  P DF VLD++E+  +PIILGRPFLA GR +ID+E+ +L++R
Subjt:  ---------------------NFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIR

Query:  VQQEKEVLKAFEDPKN
        V  EK      E  K+
Subjt:  VQQEKEVLKAFEDPKN

A0A6P4BCZ7 uncharacterized protein LOC1074658175.5e-4439.45Show/hide
Query:  PVEQEKS-SLEFCKVVSVHYEEEIK---LAEEEKTTE--PEELTGGVEEGTTSNEAEKLNPEPSIPS--PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVN
        P + EK+   E CK +++   ++++   + +EE   E   EE+    +E  T  +++KL  + ++ +  P +  P+  K++N + ++ KFL+ F  LH+N
Subjt:  PVEQEKS-SLEFCKVVSVHYEEEIK---LAEEEKTTE--PEELTGGVEEGTTSNEAEKLNPEPSIPS--PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVN

Query:  IPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG-----------------------------EIKSTSV
        IPF +ALEQMP Y KFMKE L KK+  K+ +TV +   CSA +Q+ +P+K+ DPGSF IPC  G                             E+K T +
Subjt:  IPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG-----------------------------EIKSTSV

Query:  RLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSDT
         LQ+AD+S+    G+VEN+L+KV +FFLPVDF +LDIKE+   PIILGRPFLAT R +ID+E+ EL++RV  E  V   F   KN  D+
Subjt:  RLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSDT

A0A6P4CAM9 uncharacterized protein LOC1074725111.6e-4339.43Show/hide
Query:  EFCKVVSVHYEEEIK---LAEEEKTTE--PEELTGGVEEGTTSNEAEKLNPEPSIPS--PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQM
        E CK +++   ++++   + +EE   E   EE+    +E  T  +++KL  + ++ +  P +  P+  K +N + ++ KFL+ F  LH+NIPF +ALEQM
Subjt:  EFCKVVSVHYEEEIK---LAEEEKTTE--PEELTGGVEEGTTSNEAEKLNPEPSIPS--PTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQM

Query:  PHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG-----------------------------EIKSTSVRLQLADQSLV
        P Y KFMKE L KK+  K+ +TV +   CSA +Q+ +P+K+ DPGSF IPC  G                             E+K T + LQ+AD+S+ 
Subjt:  PHYRKFMKEWLNKKKKEKQLETVYLASTCSARVQQGVPEKLSDPGSFTIPCNFG-----------------------------EIKSTSVRLQLADQSLV

Query:  SPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSDT
           G+VEN+L+KV +FFLPVDF +LDI+E+   PIILGRPFLAT R +ID+E+ EL++RV +E  V   F   KN  D+
Subjt:  SPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLKAFEDPKNTSDT

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-1545.35Show/hide
Query:  QFQDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSV---SETPYELWKGRK
        + + + ++ GI+  L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+++   S+TPYE+W  +K
Subjt:  QFQDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSV---SETPYELWKGRK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-1646.43Show/hide
Query:  QFQDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELWKGRK
        +F++Y   HGI  + + P TPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV TA Y++N  PS  ++ E P  +W  ++
Subjt:  QFQDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELWKGRK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.5e-1036Show/hide
Query:  DYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYE
        +Y  +HGI+   S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+P++
Subjt:  DYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-1135.05Show/hide
Query:  QDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELWKGRKGKAPVEQEKSSLEFC
        +DYL +HGI+   S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++P++   G+    P   EK  +  C
Subjt:  QDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELWKGRKGKAPVEQEKSSLEFC

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0442Show/hide
Query:  NRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELW
        NRT+++ VRSM+    LP +F   A +TAV+I+N  PS +++   P E+W
Subjt:  NRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVS-ETPYELW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTACAATTCCAAGACTATTTGATAGAACATGGAATTACGTCTCAACTCTCAGCCCCTGCTACACCACAACAAAATGGTGTATCAGAAAGGAGAAACCGAACTCT
GTTAGACATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGGATATGCAGTTGACACTGCCGTATACATTTTGAACATGGTTCCCTCTAAAAGTG
TCTCTGAAACACCTTATGAGCTATGGAAAGGACGTAAAGGTAAAGCCCCAGTTGAACAGGAGAAATCTTCTTTGGAGTTCTGCAAGGTCGTATCTGTGCATTATGAGGAG
GAGATTAAATTAGCTGAAGAGGAAAAAACAACTGAACCAGAGGAACTCACAGGAGGAGTTGAAGAAGGCACCACCTCAAACGAAGCTGAAAAGCTTAATCCTGAGCCTTC
TATCCCTTCTCCTACTGTTTTAGTTCCTAAATCAAAGAAAAAGAAAAATTCTAAGGTTAAGTTTGACAAATTTTTAGATGCTTTTATGGGTTTGCATGTTAATATTCCTT
TTTCAGATGCCTTGGAGCAGATGCCTCACTACAGAAAATTTATGAAGGAATGGCTCAACAAGAAGAAAAAGGAAAAGCAGTTGGAGACTGTATATCTTGCATCGACGTGC
AGTGCTCGTGTCCAACAGGGAGTACCAGAGAAATTGTCTGACCCAGGGAGTTTTACTATTCCTTGTAATTTTGGAGAAATAAAATCTACCTCTGTTAGACTTCAGTTGGC
TGACCAGTCTCTGGTTAGTCCATATGGGATTGTTGAGAATATTCTGATTAAAGTAGGTAGATTTTTCCTTCCTGTTGATTTCTTTGTGCTAGATATTAAAGAGAATCCTG
CTATGCCTATCATATTAGGGAGACCATTCCTTGCTACAGGGAGGGTTATAATTGATATTGAACGTAGGGAGCTAATCATAAGAGTCCAACAGGAGAAGGAAGTTTTAAAA
GCTTTTGAAGACCCCAAGAACACATCAGATACAATGATGGTGGGCTACAGGAGAGGTGCTAGGAAGAGCACCAGTGATGGAAAATCTGACAGGAGACCACCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTACAATTCCAAGACTATTTGATAGAACATGGAATTACGTCTCAACTCTCAGCCCCTGCTACACCACAACAAAATGGTGTATCAGAAAGGAGAAACCGAACTCT
GTTAGACATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGGATATGCAGTTGACACTGCCGTATACATTTTGAACATGGTTCCCTCTAAAAGTG
TCTCTGAAACACCTTATGAGCTATGGAAAGGACGTAAAGGTAAAGCCCCAGTTGAACAGGAGAAATCTTCTTTGGAGTTCTGCAAGGTCGTATCTGTGCATTATGAGGAG
GAGATTAAATTAGCTGAAGAGGAAAAAACAACTGAACCAGAGGAACTCACAGGAGGAGTTGAAGAAGGCACCACCTCAAACGAAGCTGAAAAGCTTAATCCTGAGCCTTC
TATCCCTTCTCCTACTGTTTTAGTTCCTAAATCAAAGAAAAAGAAAAATTCTAAGGTTAAGTTTGACAAATTTTTAGATGCTTTTATGGGTTTGCATGTTAATATTCCTT
TTTCAGATGCCTTGGAGCAGATGCCTCACTACAGAAAATTTATGAAGGAATGGCTCAACAAGAAGAAAAAGGAAAAGCAGTTGGAGACTGTATATCTTGCATCGACGTGC
AGTGCTCGTGTCCAACAGGGAGTACCAGAGAAATTGTCTGACCCAGGGAGTTTTACTATTCCTTGTAATTTTGGAGAAATAAAATCTACCTCTGTTAGACTTCAGTTGGC
TGACCAGTCTCTGGTTAGTCCATATGGGATTGTTGAGAATATTCTGATTAAAGTAGGTAGATTTTTCCTTCCTGTTGATTTCTTTGTGCTAGATATTAAAGAGAATCCTG
CTATGCCTATCATATTAGGGAGACCATTCCTTGCTACAGGGAGGGTTATAATTGATATTGAACGTAGGGAGCTAATCATAAGAGTCCAACAGGAGAAGGAAGTTTTAAAA
GCTTTTGAAGACCCCAAGAACACATCAGATACAATGATGGTGGGCTACAGGAGAGGTGCTAGGAAGAGCACCAGTGATGGAAAATCTGACAGGAGACCACCCTGA
Protein sequenceShow/hide protein sequence
MDLQFQDYLIEHGITSQLSAPATPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYAVDTAVYILNMVPSKSVSETPYELWKGRKGKAPVEQEKSSLEFCKVVSVHYEE
EIKLAEEEKTTEPEELTGGVEEGTTSNEAEKLNPEPSIPSPTVLVPKSKKKKNSKVKFDKFLDAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKQLETVYLASTC
SARVQQGVPEKLSDPGSFTIPCNFGEIKSTSVRLQLADQSLVSPYGIVENILIKVGRFFLPVDFFVLDIKENPAMPIILGRPFLATGRVIIDIERRELIIRVQQEKEVLK
AFEDPKNTSDTMMVGYRRGARKSTSDGKSDRRPP