; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006223 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006223
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:39558500..39565134
RNA-Seq ExpressionLag0006223
SyntenyLag0006223
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031937.1 retroelement pol polyprotein-like [Cucumis melo var. makuwa]3.3e-5850Show/hide
Query:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI
        DK + K IG A++ +GLY+    +    +    T   ++     D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P+FF  I+TQ+G 
Subjt:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI

Query:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT
         IK  RSDNAPEL FT FF  +GV+HQ+SCVQCP+QNSVVERKHQH+LN ARAL FQS++P+ FWG+CILT  ++INRTPS +++W++P+ +L+    D 
Subjt:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT

Query:  HS---YGSLVLCVLLP
        +S   +GSL     LP
Subjt:  HS---YGSLVLCVLLP

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.0e-5150.54Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC
        N+ ++ +   + H D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P FF  I+TQ+G  IK  RSDNA +L FT FF  +GV+HQ+SC
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC

Query:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP
        VQ P+QNSVVE+KHQH+LN ARAL FQS++P+ FWG+CI+T  ++I+RTPS +++W+ P+ +L+    D +S   +GSL     LP
Subjt:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP

TYK16758.1 Copia protein [Cucumis melo var. makuwa]2.8e-5749.54Show/hide
Query:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI
        DK + K IG A++ +GLY+    +    +    T   ++     D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P+FF  I+TQ+G 
Subjt:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI

Query:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT
         IK  RSDNAPEL FT FF  +GV+HQ+SCVQCP+QNSVVERKHQH+LN ARAL FQS++P+ FWG+CILT  ++INRTPS +++W++ + +L+    D 
Subjt:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT

Query:  HS---YGSLVLCVLLP
        +S   +GSL     LP
Subjt:  HS---YGSLVLCVLLP

TYK18103.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.0e-5150.54Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC
        N+ ++ +   + H D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P FF  I+TQ+G  IK  RSDNA +L FT FF  +GV+HQ+SC
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC

Query:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP
        VQ P+QNSVVE+KHQH+LN ARAL FQS++P+ FWG+CI+T  ++I+RTPS +++W+ P+ +L+    D +S   +GSL     LP
Subjt:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]1.7e-5456.73Show/hide
Query:  DLNHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQF
        +L+++ + +V  + H + WGP+ TPTH  +++FLT+VDD+SR+TW F++K+KS  L ++P+FF+Y++TQ+   IK FRSDNAPELSF +FF  RGVLHQ+
Subjt:  DLNHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQF

Query:  SCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEAD
        SCV  PEQNSVVERKHQHLLNV+RAL FQS+ P++FW EC+LT  ++INRT S ++ W+TPY  L    AD
Subjt:  SCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEAD

TrEMBL top hitse value%identityAlignment
A0A151TRU9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5154.05Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC
        NH  TK  +++ HCD WGPY+ PT++  +YFLT+VDD+SRYTW  L++TK+EA   +  FF+ I TQFG+ IK FRSDNA EL+ T+F   +G +HQ SC
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC

Query:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEAD---THSYGSLVLCVLL
        V+ P+QN VVERKHQHLL+VARAL +QSKIPI+FWG+CI T +F+INR PS   +  +PY  L+ K+ D     S+G L     L
Subjt:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEAD---THSYGSLVLCVLL

A0A5A7SRC2 Retroelement pol polyprotein-like1.6e-5850Show/hide
Query:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI
        DK + K IG A++ +GLY+    +    +    T   ++     D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P+FF  I+TQ+G 
Subjt:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI

Query:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT
         IK  RSDNAPEL FT FF  +GV+HQ+SCVQCP+QNSVVERKHQH+LN ARAL FQS++P+ FWG+CILT  ++INRTPS +++W++P+ +L+    D 
Subjt:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT

Query:  HS---YGSLVLCVLLP
        +S   +GSL     LP
Subjt:  HS---YGSLVLCVLLP

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 85.0e-5250.54Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC
        N+ ++ +   + H D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P FF  I+TQ+G  IK  RSDNA +L FT FF  +GV+HQ+SC
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC

Query:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP
        VQ P+QNSVVE+KHQH+LN ARAL FQS++P+ FWG+CI+T  ++I+RTPS +++W+ P+ +L+    D +S   +GSL     LP
Subjt:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP

A0A5D3CZP1 Copia protein1.4e-5749.54Show/hide
Query:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI
        DK + K IG A++ +GLY+    +    +    T   ++     D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P+FF  I+TQ+G 
Subjt:  DKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVF-HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGI

Query:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT
         IK  RSDNAPEL FT FF  +GV+HQ+SCVQCP+QNSVVERKHQH+LN ARAL FQS++P+ FWG+CILT  ++INRTPS +++W++ + +L+    D 
Subjt:  PIKSFRSDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADT

Query:  HS---YGSLVLCVLLP
        +S   +GSL     LP
Subjt:  HS---YGSLVLCVLLP

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 85.0e-5250.54Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC
        N+ ++ +   + H D WGP+ T T+S Y YFLT+VDD +RYTW F++K KS+ + ++P FF  I+TQ+G  IK  RSDNA +L FT FF  +GV+HQ+SC
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELSFTDFFAHRGVLHQFSC

Query:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP
        VQ P+QNSVVE+KHQH+LN ARAL FQS++P+ FWG+CI+T  ++I+RTPS +++W+ P+ +L+    D +S   +GSL     LP
Subjt:  VQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHS---YGSLVLCVLLP

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.5e-1630.25Show/hide
Query:  VFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPEL---SFTDFFAHRGVLHQFSCVQCPEQN
        V H D  GP    T     YF+  VD ++ Y  T+L+K KS+   +   F A  +  F + +     DN  E        F   +G+ +  +    P+ N
Subjt:  VFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPEL---SFTDFFAHRGVLHQFSCVQCPEQN

Query:  SVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSS--VVEWQTPYFRLHKKE
         V ER  + +   AR ++  +K+   FWGE +LT +++INR PS   V   +TPY   H K+
Subjt:  SVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSS--VVEWQTPYFRLHKKE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-2130Show/hide
Query:  HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELS---FTDFFAHRGVLHQFSCVQCPEQNSV
        + D  GP +  +    +YF+T +DD SR  W +++KTK +   +  +F A ++ + G  +K  RSDN  E +   F ++ +  G+ H+ +    P+ N V
Subjt:  HCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELS---FTDFFAHRGVLHQFSCVQCPEQNSV

Query:  VERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTP
         ER ++ ++   R+++  +K+P  FWGE + T  ++INR+PS  + ++ P
Subjt:  VERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTP

P47024 Transposon Ty4-J Gag-Pol polyprotein1.9e-0825.66Show/hide
Query:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRY--TWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELS---FTDFFAHRGVL
        NH         +  D +GP  +      RY L +VD+ +RY  T T   K     L  V +   Y++TQF   ++   SD   E +     ++F  +G+ 
Subjt:  NHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRY--TWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPELS---FTDFFAHRGVL

Query:  HQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIIN
        H  +  Q    N   ER  + ++  A  L+ QS + ++FW   + + + I N
Subjt:  HQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIIN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-1830.5Show/hide
Query:  THSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPE-LSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVAR
        +H  YRY++  VD ++RYTW + +K KS+       F   ++ +F   I +F SDN  E ++  ++F+  G+ H  S    PE N + ERKH+H++    
Subjt:  THSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPE-LSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVAR

Query:  ALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRL
         L+  + IP  +W        ++INR P+ +++ ++P+ +L
Subjt:  ALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.6e-1729.34Show/hide
Query:  NHDITKSVNVVF-HCDTWGPYQTPTHSC--YRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPE-LSFTDFFAHRGVLH
        N  IT S  + + + D W    +P  S   YRY++  VD ++RYTW + +K KS+       F + ++ +F   I +  SDN  E +   D+ +  G+ H
Subjt:  NHDITKSVNVVF-HCDTWGPYQTPTHSC--YRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFRSDNAPE-LSFTDFFAHRGVLH

Query:  QFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRL
          S    PE N + ERKH+H++ +   L+  + +P  +W        ++INR P+ +++ Q+P+ +L
Subjt:  QFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGAGTTCGCGGTGACGACGGATGTCAGATCGGGATTGGATTAGTGGCGAAGTTCGTCGGAGTGGCGGCGACGATGTGGAAAAGAGGGGAAAAAGAGCGGCGGCT
CGCGCGAGACGAGACCAACGGCAAGCGAAAACCACGGCGCGAGGAGGAAGACGACGGCGACTGGGCGGCGGACTGCGGCAGCTCTGTGCGTGGAGGCTGCGGCGGCGGCG
CGGTTGCTGAGTGGAGAGGAGAAGAAAAAAAAAATGCAGAGATGGGGATGGGGGCTGATGCGCGTGGCCGAGAGAAAGGGAGAGAGAGAAAGAGAGAGGTGGCTGGTGCA
AAGAAACGGGGGGGGGACAAGTCTTCTTTGAAGATGATTGGCAGGGCTGAGGTTTGCAAAGGCTTATATTTGTTTTCTACTGATTCCTTGTTGGCTGACTTGAATCATGA
TATTACTAAATCTGTCAATGTTGTTTTCCACTGTGACACTTGGGGTCCCTATCAAACACCAACTCACTCATGCTATCGTTATTTTCTCACTGTCGTGGATGATTACAGTC
GTTATACCTGGACCTTTCTAATGAAAACAAAATCTGAAGCCCTCCTCTTGGTCCCTCGTTTTTTTGCCTACATTGACACACAGTTTGGTATTCCCATTAAAAGCTTCCGT
TCAGATAATGCCCCCGAGCTTTCCTTTACTGATTTTTTTGCTCATAGAGGGGTTCTTCACCAATTCTCCTGTGTGCAATGCCCTGAGCAAAATTCCGTGGTAGAAAGGAA
GCACCAACACCTTTTAAATGTTGCTAGAGCTCTCATGTTTCAATCTAAAATACCAATCCAATTCTGGGGAGAGTGTATATTAACAACTTCATTTATCATCAACCGAACTC
CTTCTTCTGTTGTTGAGTGGCAGACTCCTTATTTTCGGTTACATAAGAAAGAAGCTGATACTCACAGCTACGGGTCTTTGGTTCTTTGTGTTTTGCTTCCAGCTTGCCCT
CCCAAAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGAGTTCGCGGTGACGACGGATGTCAGATCGGGATTGGATTAGTGGCGAAGTTCGTCGGAGTGGCGGCGACGATGTGGAAAAGAGGGGAAAAAGAGCGGCGGCT
CGCGCGAGACGAGACCAACGGCAAGCGAAAACCACGGCGCGAGGAGGAAGACGACGGCGACTGGGCGGCGGACTGCGGCAGCTCTGTGCGTGGAGGCTGCGGCGGCGGCG
CGGTTGCTGAGTGGAGAGGAGAAGAAAAAAAAAATGCAGAGATGGGGATGGGGGCTGATGCGCGTGGCCGAGAGAAAGGGAGAGAGAGAAAGAGAGAGGTGGCTGGTGCA
AAGAAACGGGGGGGGGACAAGTCTTCTTTGAAGATGATTGGCAGGGCTGAGGTTTGCAAAGGCTTATATTTGTTTTCTACTGATTCCTTGTTGGCTGACTTGAATCATGA
TATTACTAAATCTGTCAATGTTGTTTTCCACTGTGACACTTGGGGTCCCTATCAAACACCAACTCACTCATGCTATCGTTATTTTCTCACTGTCGTGGATGATTACAGTC
GTTATACCTGGACCTTTCTAATGAAAACAAAATCTGAAGCCCTCCTCTTGGTCCCTCGTTTTTTTGCCTACATTGACACACAGTTTGGTATTCCCATTAAAAGCTTCCGT
TCAGATAATGCCCCCGAGCTTTCCTTTACTGATTTTTTTGCTCATAGAGGGGTTCTTCACCAATTCTCCTGTGTGCAATGCCCTGAGCAAAATTCCGTGGTAGAAAGGAA
GCACCAACACCTTTTAAATGTTGCTAGAGCTCTCATGTTTCAATCTAAAATACCAATCCAATTCTGGGGAGAGTGTATATTAACAACTTCATTTATCATCAACCGAACTC
CTTCTTCTGTTGTTGAGTGGCAGACTCCTTATTTTCGGTTACATAAGAAAGAAGCTGATACTCACAGCTACGGGTCTTTGGTTCTTTGTGTTTTGCTTCCAGCTTGCCCT
CCCAAAGGTTGA
Protein sequenceShow/hide protein sequence
MVGVRGDDGCQIGIGLVAKFVGVAATMWKRGEKERRLARDETNGKRKPRREEEDDGDWAADCGSSVRGGCGGGAVAEWRGEEKKNAEMGMGADARGREKGRERKREVAGA
KKRGGDKSSLKMIGRAEVCKGLYLFSTDSLLADLNHDITKSVNVVFHCDTWGPYQTPTHSCYRYFLTVVDDYSRYTWTFLMKTKSEALLLVPRFFAYIDTQFGIPIKSFR
SDNAPELSFTDFFAHRGVLHQFSCVQCPEQNSVVERKHQHLLNVARALMFQSKIPIQFWGECILTTSFIINRTPSSVVEWQTPYFRLHKKEADTHSYGSLVLCVLLPACP
PKG