; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010544 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010544
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:964299..974912
RNA-Seq ExpressionLag0010544
SyntenyLag0010544
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.9e-6656.62Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A ++PW++D VNYL  G+ P + + QQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLMNTIFTR
         +ND+KVV  F+   IFTR
Subjt:  SSNDAKVVKKFLMNTIFTR

PIN03721.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.0e-6455.71Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A++++   I ++F+DEQ+LA+ A ++PW++D VNYL  G+ P +  TQQ KKFL D + Y+ D+ +L+K GPD ILRRCV E E+  +LE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF   +TATK+LQ G+F P+LF+DAHLF  N ++CQ+TGN++ R EM LN ILEVE+FDVWGIDFMGPF  SFGN+YIL AV DYVSKW+EA+  
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLMNTIFTR
         +ND+KVV  F+   IFTR
Subjt:  SSNDAKVVKKFLMNTIFTR

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.6e-6557.28Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A E+PW++D VNYL  G+ P + +TQQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLM
         +ND+KVV+   M
Subjt:  SSNDAKVVKKFLM

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]9.3e-6657.28Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A E+PW++D VNYL  G+ P + +TQQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLM
         SND+K+V+  +M
Subjt:  SSNDAKVVKKFLM

PIN26668.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.6e-6554.46Show/hide
Query:  KEFLSSELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFIL
        K+     L   A+ ++   I ++F DEQ+LA+ A ++PW+SD VNYL  G+ P + + QQ KKFL D + Y+ D+ +L+K GPD ILRRCV E E+  IL
Subjt:  KEFLSSELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFIL

Query:  EACHSIPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWI
        E CH+ P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN IL+VE+FDVWGIDF+GPF  SFGN+YIL+AV DYVSKW+
Subjt:  EACHSIPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWI

Query:  EAITTSSNDAKVVKKFLMNTIFTR
        EA+   +ND+KVV  F+   IFTR
Subjt:  EAITTSSNDAKVVKKFLMNTIFTR

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase9.1e-6756.62Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A ++PW++D VNYL  G+ P + + QQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLMNTIFTR
         +ND+KVV  F+   IFTR
Subjt:  SSNDAKVVKKFLMNTIFTR

A0A2G9HYA0 Reverse transcriptase2.2e-6557.28Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A E+PW++D VNYL  G+ P + +TQQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLM
         +ND+KVV+   M
Subjt:  SSNDAKVVKKFLM

A0A2G9HYD8 Reverse transcriptase4.5e-6657.28Show/hide
Query:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS
        S L   A+ ++   I ++F DEQ+LA+ A E+PW++D VNYL  G+ P + +TQQ KKFL D + Y+ D+P+L+K GPD ILRRCV E E+  ILE CH+
Subjt:  SELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHS

Query:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT
         P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN ILEVE+FDVWGIDFMGPF  SFGN+YIL+AV DYVSKW+EA   
Subjt:  IPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITT

Query:  SSNDAKVVKKFLM
         SND+K+V+  +M
Subjt:  SSNDAKVVKKFLM

A0A2G9IA86 DNA-directed DNA polymerase2.2e-6554.46Show/hide
Query:  KEFLSSELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFIL
        K+     L   A+ ++   I ++F DEQ+LA+ A ++PW+SD VNYL  G+ P + + QQ KKFL D + Y+ D+ +L+K GPD ILRRCV E E+  IL
Subjt:  KEFLSSELPREAQVEKRLDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFIL

Query:  EACHSIPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWI
        E CH+ P+G HF G +TA K+LQ G+FWP+LF+DAH F  N ++CQ+TGN++ R EMPLN IL+VE+FDVWGIDF+GPF  SFGN+YIL+AV DYVSKW+
Subjt:  EACHSIPFGEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWI

Query:  EAITTSSNDAKVVKKFLMNTIFTR
        EA+   +ND+KVV  F+   IFTR
Subjt:  EAITTSSNDAKVVKKFLMNTIFTR

A0A803R2M6 Uncharacterized protein1.1e-6758.14Show/hide
Query:  EAQVEKRLDIRESFADEQILAVK-AIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHSIPFG
        E+Q  K + I E F DEQ+ +V+ ++ +PW++DYVN+L   + PPE + QQLKKF  +VK YY +EP LYK   DQI+RRCV EEE+  IL  CH++P G
Subjt:  EAQVEKRLDIRESFADEQILAVK-AIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHSIPFG

Query:  EHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSND
         HF+G +TA KVLQ G+FWP+LF+DA  F +  ++CQ+TGN++ R+EMPL  ILEVE+FDVWGIDFMGPFP SF N+YILLAV DYVSKW+EA  T +ND
Subjt:  EHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSND

Query:  AKVVKKFLMNTIFTR
         K V +FL   IFTR
Subjt:  AKVVKKFLMNTIFTR

SwissProt top hitse value%identityAlignment
O93209 Pro-Pol polyprotein7.3e-0525.71Show/hide
Query:  AGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSNDAKV
        AG +     +Q  Y+WP + +D   F    N C+    L ++   P   +   + FD + +D++GP P S G +++L+ V+D  + +     T +  +K 
Subjt:  AGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSNDAKV

Query:  VKKFL
          K L
Subjt:  VKKFL

P23074 Pro-Pol polyprotein3.6e-0427.5Show/hide
Query:  ILEACHSIPF-GEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVS
        I+   H+I   G     LK ++K     Y+WP+L +D     R   QC  T    + S   L  +  ++ FD + ID++GP P S G +++L+ V+D ++
Subjt:  ILEACHSIPF-GEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVS

Query:  KWIEAITTSSNDAKVVKKFL
         ++    T +       K L
Subjt:  KWIEAITTSSNDAKVVKKFL

P27401 Pro-Pol polyprotein1.2e-0426.85Show/hide
Query:  YLYKLGPDQIL-------RRCVTEEEVPFILEACHSIPF-GEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIF
        Y Y+L   Q++       R    + + P I+   H+I   G     LK ++K     Y+WP+L +D     R   QC  T   T+ +   L     V+ F
Subjt:  YLYKLGPDQIL-------RRCVTEEEVPFILEACHSIPF-GEHFAGLKTATKVLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIF

Query:  DVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSNDAKVVKKFL
        D + ID++GP P S G +++L+ V+D ++ ++    T +       K L
Subjt:  DVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSNDAKVVKKFL

P92516 Uncharacterized mitochondrial protein AtMg007502.9e-1460.71Show/hide
Query:  VLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFM
        VLQ G++WP+ F+DAH F  + + CQ+ GN T R+EMP +FILEVE+FDVWGI FM
Subjt:  VLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFM

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein2.1e-1560.71Show/hide
Query:  VLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFM
        VLQ G++WP+ F+DAH F  + + CQ+ GN T R+EMP +FILEVE+FDVWGI FM
Subjt:  VLQLGYFWPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGATCCATTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTTATTCGGGGCACCAATCCCAACAATTACATCGTTGGAGC
TTCTGATCTGCAGAAGAAATTTCCAGCGAAGAAAAACAAAGGCGGCTGCTGCGTTTTCGTTCGTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTTCAACGAAGGCAAGG
AGAATAGTAGAGAAGACACTAAAGGTTGTCCAAGAGAAGTTGGAGAAGGAAAACACCACAAGGAGTTCCTCTCAAGTGAGCTACCAAGAGAGGCGCAAGTGGAGAAGAGG
CTGGACATTCGAGAGTCGTTTGCAGATGAACAGATATTGGCGGTGAAAGCAATTGAGATTCCATGGTTTTCAGACTATGTGAACTACCTAGTCAATGGACTAAAACCTCC
GGAAGCCACAACTCAACAACTGAAGAAGTTTTTAAAAGATGTGAAGGAGTATTATTCGGATGAACCGTATTTATACAAGCTTGGACCAGATCAGATACTCAGACGATGTG
TAACAGAGGAGGAAGTACCATTCATTTTGGAAGCTTGTCATTCAATTCCATTTGGAGAGCACTTCGCAGGTCTAAAAACTGCAACAAAAGTCCTCCAATTAGGATACTTT
TGGCCAAGCCTTTTCAGAGATGCACATTTGTTTGCAAGGAACGGTAACCAATGCCAAAAAACAGGGAATTTGACAGTTAGGAGTGAAATGCCACTGAACTTCATACTTGA
AGTAGAAATCTTTGATGTATGGGGTATAGATTTCATGGGCCCATTCCCTCTGTCCTTTGGAAATATCTATATTCTGCTTGCAGTTATTGATTATGTATCGAAATGGATCG
AAGCCATAACCACATCATCCAATGATGCGAAGGTAGTAAAGAAATTCTTAATGAATACGATCTTTACAAGGACAACCGCTAGAGTCTTCCCCGCTATTATCCGGGCCAAG
AATGACTTGTGGGAGTCTATAGAGATAGGGATTGGAAAGGGAATTCTTGAAGTGTCAAGCTCTAAAGAGTTTCTTTTGTGTGAGTGTATCTATCTACGTGACGATTACAT
CTCCTCGTTGCACGATCTCCCTGATGAGATGATATTTGCACTCAATGTGCTTTCCACGCTTGTGACTTCGAGGTTCCTTAGAGTTTGCAACTGCTCCATTGTTATCACAA
TAGAGGGTGAGAGGGAATTGCATATCTGGAACGACTTCCAACTCTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGATCCATTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTTATTCGGGGCACCAATCCCAACAATTACATCGTTGGAGC
TTCTGATCTGCAGAAGAAATTTCCAGCGAAGAAAAACAAAGGCGGCTGCTGCGTTTTCGTTCGTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTTCAACGAAGGCAAGG
AGAATAGTAGAGAAGACACTAAAGGTTGTCCAAGAGAAGTTGGAGAAGGAAAACACCACAAGGAGTTCCTCTCAAGTGAGCTACCAAGAGAGGCGCAAGTGGAGAAGAGG
CTGGACATTCGAGAGTCGTTTGCAGATGAACAGATATTGGCGGTGAAAGCAATTGAGATTCCATGGTTTTCAGACTATGTGAACTACCTAGTCAATGGACTAAAACCTCC
GGAAGCCACAACTCAACAACTGAAGAAGTTTTTAAAAGATGTGAAGGAGTATTATTCGGATGAACCGTATTTATACAAGCTTGGACCAGATCAGATACTCAGACGATGTG
TAACAGAGGAGGAAGTACCATTCATTTTGGAAGCTTGTCATTCAATTCCATTTGGAGAGCACTTCGCAGGTCTAAAAACTGCAACAAAAGTCCTCCAATTAGGATACTTT
TGGCCAAGCCTTTTCAGAGATGCACATTTGTTTGCAAGGAACGGTAACCAATGCCAAAAAACAGGGAATTTGACAGTTAGGAGTGAAATGCCACTGAACTTCATACTTGA
AGTAGAAATCTTTGATGTATGGGGTATAGATTTCATGGGCCCATTCCCTCTGTCCTTTGGAAATATCTATATTCTGCTTGCAGTTATTGATTATGTATCGAAATGGATCG
AAGCCATAACCACATCATCCAATGATGCGAAGGTAGTAAAGAAATTCTTAATGAATACGATCTTTACAAGGACAACCGCTAGAGTCTTCCCCGCTATTATCCGGGCCAAG
AATGACTTGTGGGAGTCTATAGAGATAGGGATTGGAAAGGGAATTCTTGAAGTGTCAAGCTCTAAAGAGTTTCTTTTGTGTGAGTGTATCTATCTACGTGACGATTACAT
CTCCTCGTTGCACGATCTCCCTGATGAGATGATATTTGCACTCAATGTGCTTTCCACGCTTGTGACTTCGAGGTTCCTTAGAGTTTGCAACTGCTCCATTGTTATCACAA
TAGAGGGTGAGAGGGAATTGCATATCTGGAACGACTTCCAACTCTGTTAG
Protein sequenceShow/hide protein sequence
MRGRDPFVQAPESALKGTTSLLSLIRGTNPNNYIVGASDLQKKFPAKKNKGGCCVFVRGASLAKNGQVFNEGKENSREDTKGCPREVGEGKHHKEFLSSELPREAQVEKR
LDIRESFADEQILAVKAIEIPWFSDYVNYLVNGLKPPEATTQQLKKFLKDVKEYYSDEPYLYKLGPDQILRRCVTEEEVPFILEACHSIPFGEHFAGLKTATKVLQLGYF
WPSLFRDAHLFARNGNQCQKTGNLTVRSEMPLNFILEVEIFDVWGIDFMGPFPLSFGNIYILLAVIDYVSKWIEAITTSSNDAKVVKKFLMNTIFTRTTARVFPAIIRAK
NDLWESIEIGIGKGILEVSSSKEFLLCECIYLRDDYISSLHDLPDEMIFALNVLSTLVTSRFLRVCNCSIVITIEGERELHIWNDFQLC