; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025178 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025178
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr10:9475173..9483198
RNA-Seq ExpressionLag0025178
SyntenyLag0025178
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]9.1e-8681.63Show/hide
Query:  SGSAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYL
        S +A +W   LGHIN NRIERLVKSG+LNQLED+SLPPCESCLEGKMTKR F+ KG RAK PLELVHSDLCGPMNVKARGGYEYF+SFIDD SRY ++YL
Subjt:  SGSAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYL

Query:  MHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQL
        +HHKSE+ EKFKEYK EVEN++GKTIKTLRSD+GGEYMD +FQDYLIE GI SQLSAP+TPQQNGVSERR RTLLDMVRSMMSYAQLPDSFW   L
Subjt:  MHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQL

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-8485.71Show/hide
Query:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEK
        LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYRAKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEK
Subjt:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEK

Query:  FKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
        FKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  FKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

TYK04171.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein6.4e-8585.71Show/hide
Query:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEK
        LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYRAKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEK
Subjt:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEK

Query:  FKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
        FKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  FKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

A0A5A7UYE8 Gag/pol protein1.7e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

A0A5D3BUN8 Gag/pol protein1.7e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

A0A5D3BWT8 Gag/pol protein1.7e-8571.79Show/hide
Query:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR
        + K+   G T H  + ++  +T  FK +  S      G G + S  A     LGHIN +RI RLVK+GLLN+L+D SLPPCESCLEGKMTKRPF+ KGYR
Subjt:  IPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSA--GIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYR

Query:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP
        AKEPLEL+HSDLCGPMNVKARGG+EYF+SFIDD SRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEHGI SQLSAP
Subjt:  AKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAP

Query:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         TPQQNGVSERR RTLLDMVRSMMSYAQLP SFW
Subjt:  ATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

E2GK51 Gag/pol protein (Fragment)4.4e-8681.63Show/hide
Query:  SGSAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYL
        S +A +W   LGHIN NRIERLVKSG+LNQLED+SLPPCESCLEGKMTKR F+ KG RAK PLELVHSDLCGPMNVKARGGYEYF+SFIDD SRY ++YL
Subjt:  SGSAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYL

Query:  MHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQL
        +HHKSE+ EKFKEYK EVEN++GKTIKTLRSD+GGEYMD +FQDYLIE GI SQLSAP+TPQQNGVSERR RTLLDMVRSMMSYAQLPDSFW   L
Subjt:  MHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-2635.45Show/hide
Query:  GHINCNRIERLVK------SGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHH
        GHI+  ++  + +        LLN LE  S   CE CL GK  + PF +   +   K PL +VHSD+CGP+         YFV F+D  + Y   YL+ +
Subjt:  GHINCNRIERLVK------SGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHH

Query:  KSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
        KS+    F+++  + E      +  L  D G EY+  + + + ++ GI+  L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFW
Subjt:  KSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-3940Show/hide
Query:  SAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMH
        S  +W   +GH++   ++ L K  L++  +  ++ PC+ CL GK  +  F     R    L+LV+SD+CGPM +++ GG +YFV+FIDD SR  ++Y++ 
Subjt:  SAGIW---LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMH

Query:  HKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
         K +  + F+++   VE + G+ +K LRSD GGEY   +F++Y   HGI  + + P TPQ NGV+ER  RT+++ VRSM+  A+LP SFW
Subjt:  HKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein6.9e-2029.08Show/hide
Query:  LGHINCNRIERLVKSGLLNQLEDDSLP-------PCESCLEGKMTKRPFSEKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYL
        LGH N   I++ +K   +  L++  +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YF+SF D+ +R+ ++
Subjt:  LGHINCNRIERLVKSGLLNQLEDDSLP-------PCESCLEGKMTKRPFSEKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYL

Query:  YLMHHKSE--ALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW
        Y +H + E   L  F      ++NQ    +  ++ D+G EY +     +    GIT+  +  A  + +GV+ER  RTLL+  R+++  + LP+  W
Subjt:  YLMHHKSE--ALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-2435.37Show/hide
Query:  CESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYM
        C  CL  K  K PFS+    +  PLE ++SD+     + +   Y Y+V F+D  +RY +LY +  KS+  E F  +K  +EN+    I T  SD GGE++
Subjt:  CESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYM

Query:  DLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQLRLPYTF
         L   +Y  +HGI+   S P TP+ NG+SER++R +++   +++S+A +P ++W      PY F
Subjt:  DLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQLRLPYTF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-2534.72Show/hide
Query:  LGHINCNRIERLVKSGLLNQLE-DDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALE
        LGH +   +  ++ +  L  L     L  C  C   K  K PFS     + +PLE ++SD+     + +   Y Y+V F+D  +RY +LY +  KS+  +
Subjt:  LGHINCNRIERLVKSGLLNQLE-DDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALE

Query:  KFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQLRLPYTF
         F  +K+ VEN+    I TL SD GGE++ L  +DYL +HGI+   S P TP+ NG+SER++R +++M  +++S+A +P ++W      PY F
Subjt:  KFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERRYRTLLDMVRSMMSYAQLPDSFWDMQLRLPYTF

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.0e-0740.91Show/hide
Query:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNV
        L H++   +E LVK G L+  +  SL  CE C+ GK  +  FS   +  K PL+ VHSDL G  +V
Subjt:  LGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAKEPLELVHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTGAGGGAGAAATTGAGCTTCGATTACCCTCAGGCTCATCCTGGGGCTTTTGTTTGGAGTGGATTATGGGAGTGTGATGAAGATAAAGAAGAAGAAGAATTTGG
GGATTCCAGTAGGGCACACGAGGTGATTGCTAGAATTTGGGCTCCTATTAAGGTGTCTTTTAATGGGTCTTCGTCTTTTCAGCCGTTTGCTGTGAAATTTTGTAGCCCTG
CCTCAAGAGATAGAAGAGGAGGGAGTCTTGCACCGGAGAGGAGAGGTCCATTAGGTCCCATCGGTAGCTCATTTAGGGCGTTGAGGGACCCATACCCAAGGAAATCCTTG
GGTGGCCTAACTGGTCACACTGAAGGGACCATACCCAAGGAAATCCTTGGTGGCCTAACTGGTCACACTGAAGCACGCGTTAGAAATTTGCAAACAGGACGGTTCAAGGC
TGTTGGGCGGTCCGGTTCACGCGGTCTGGCTGGGCTAGGACCGATTTGGTCCGGTTCAGCTGGAATTTGGCTCGGCCACATAAATTGCAATAGAATTGAAAGGTTGGTAA
AGAGTGGACTACTAAATCAGTTAGAAGATGACTCGTTACCACCATGTGAGTCTTGCCTCGAGGGAAAAATGACAAAACGACCTTTTTCAGAAAAAGGTTATAGGGCCAAA
GAACCCTTAGAACTCGTGCACTCGGATCTCTGTGGTCCTATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCGTCAGCTTCATTGATGATTGTTCAAGGTATGACTA
TCTTTACCTAATGCATCATAAGTCTGAAGCCCTTGAAAAGTTCAAAGAGTACAAGACCGAGGTTGAGAACCAATTAGGTAAAACGATTAAAACACTACGATCAGATCAAG
GTGGAGAATATATGGACTTACAATTCCAAGACTATTTGATAGAACATGGAATTACGTCTCAACTCTCAGCCCCTGCTACACCACAACAAAATGGTGTATCAGAGAGGAGA
TATCGAACTCTGTTAGACATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGATATGCAGTTGAGACTGCCGTATACATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTTGAGGGAGAAATTGAGCTTCGATTACCCTCAGGCTCATCCTGGGGCTTTTGTTTGGAGTGGATTATGGGAGTGTGATGAAGATAAAGAAGAAGAAGAATTTGG
GGATTCCAGTAGGGCACACGAGGTGATTGCTAGAATTTGGGCTCCTATTAAGGTGTCTTTTAATGGGTCTTCGTCTTTTCAGCCGTTTGCTGTGAAATTTTGTAGCCCTG
CCTCAAGAGATAGAAGAGGAGGGAGTCTTGCACCGGAGAGGAGAGGTCCATTAGGTCCCATCGGTAGCTCATTTAGGGCGTTGAGGGACCCATACCCAAGGAAATCCTTG
GGTGGCCTAACTGGTCACACTGAAGGGACCATACCCAAGGAAATCCTTGGTGGCCTAACTGGTCACACTGAAGCACGCGTTAGAAATTTGCAAACAGGACGGTTCAAGGC
TGTTGGGCGGTCCGGTTCACGCGGTCTGGCTGGGCTAGGACCGATTTGGTCCGGTTCAGCTGGAATTTGGCTCGGCCACATAAATTGCAATAGAATTGAAAGGTTGGTAA
AGAGTGGACTACTAAATCAGTTAGAAGATGACTCGTTACCACCATGTGAGTCTTGCCTCGAGGGAAAAATGACAAAACGACCTTTTTCAGAAAAAGGTTATAGGGCCAAA
GAACCCTTAGAACTCGTGCACTCGGATCTCTGTGGTCCTATGAATGTCAAAGCACGAGGAGGGTATGAATATTTCGTCAGCTTCATTGATGATTGTTCAAGGTATGACTA
TCTTTACCTAATGCATCATAAGTCTGAAGCCCTTGAAAAGTTCAAAGAGTACAAGACCGAGGTTGAGAACCAATTAGGTAAAACGATTAAAACACTACGATCAGATCAAG
GTGGAGAATATATGGACTTACAATTCCAAGACTATTTGATAGAACATGGAATTACGTCTCAACTCTCAGCCCCTGCTACACCACAACAAAATGGTGTATCAGAGAGGAGA
TATCGAACTCTGTTAGACATGGTTCGATCAATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGATATGCAGTTGAGACTGCCGTATACATTTTGA
Protein sequenceShow/hide protein sequence
MYLREKLSFDYPQAHPGAFVWSGLWECDEDKEEEEFGDSSRAHEVIARIWAPIKVSFNGSSSFQPFAVKFCSPASRDRRGGSLAPERRGPLGPIGSSFRALRDPYPRKSL
GGLTGHTEGTIPKEILGGLTGHTEARVRNLQTGRFKAVGRSGSRGLAGLGPIWSGSAGIWLGHINCNRIERLVKSGLLNQLEDDSLPPCESCLEGKMTKRPFSEKGYRAK
EPLELVHSDLCGPMNVKARGGYEYFVSFIDDCSRYDYLYLMHHKSEALEKFKEYKTEVENQLGKTIKTLRSDQGGEYMDLQFQDYLIEHGITSQLSAPATPQQNGVSERR
YRTLLDMVRSMMSYAQLPDSFWDMQLRLPYTF