; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000444 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000444
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr4:7455825..7459292
RNA-Seq ExpressionLag0000444
SyntenyLag0000444
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-8383.85Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8181.77Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIW+MDV TAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KV FL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-8282.81Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIA +YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-8383.85Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-8484.38Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +   L +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.0e-8282.81Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIA +YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

A0A5A7TZD0 Gag/pol protein7.2e-8483.85Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

A0A5A7UYE8 Gag/pol protein7.2e-8483.85Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

A0A5D3CYF4 Gag/pol protein1.1e-8484.38Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIWQMDVKTAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KVAFL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +   L +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

A0A5D3CZY3 Gag/pol protein6.7e-8281.77Show/hide
Query:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR
        MLKSIRILLSIAT+YDYEIW+MDV TAFLNGNLEE+IFMSQPEGFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKK+NK 
Subjt:  MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS
        KV FL+LYVDDILLIGNDVGYLTD+K WLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT     +     +N  K L P     H+S
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQAT---SFVFPSLFENKPKPLAPLPPSQHIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-2638.41Show/hide
Query:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY--KKVNK
        + S R +LS+   Y+ ++ QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N 
Subjt:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY--KKVNK

Query:  RKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQA
         +  +++LYVDD+++   D+  + + K++L  +F+M DL E ++ +GI+I  + +   + LSQ+
Subjt:  RKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-4049.38Show/hide
Query:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY-KKVNKR
        + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++ 
Subjt:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY-KKVNKR

Query:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQ
            L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ
Subjt:  KVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQ

P25600 Putative transposon Ty5-1 protein YCL074W8.0e-1634.38Show/hide
Query:  MDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVAFLILYVDDILLIGNDVGY
        MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVAFLILYVDDILLIGNDVGY

Query:  LTDIKKWLAAQFQMKDLGEAQYVLGIQI
           +K+ L   + MKDLG+    LG+ I
Subjt:  LTDIKKWLAAQFQMKDLGEAQYVLGIQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-2538.51Show/hide
Query:  SIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVA
        SIRI+L +A    + I Q+DV  AFL G L ++++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++     + + 
Subjt:  SIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVA

Query:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR
        ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+  R
Subjt:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-2437.16Show/hide
Query:  SIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVA
        SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++     R + 
Subjt:  SIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVA

Query:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR
        ++++YVDDIL+ GND   L      L+ +F +K+  +  Y LGI+  R
Subjt:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-2737.66Show/hide
Query:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIA-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKV
        L S++++L+I+  Y++ + Q+D+  AFLNG+L+E I+M  P G+ A QG       VC L +SIYGLKQASR W ++F   +  FGF Q+  +   + K+
Subjt:  LKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIA-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKV

Query:  NKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR
               +++YVDDI++  N+   + ++K  L + F+++DLG  +Y LG++I R
Subjt:  NKRKVAFLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIR

ATMG00810.1 DNA/RNA polymerases superfamily protein6.5e-0552.17Show/hide
Query:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQI
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI
Subjt:  FLILYVDDILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAGTCCATAAGAATTCTCTTATCCATAGCCACATATTATGACTATGAAATATGGCAAATGGACGTCAAGACTGCCTTTCTGAATGGTAATCTTGAAGAGAATAT
CTTCATGTCTCAACCCGAAGGGTTCATAGCCCAAGGTCAGGAACAAAAGGTTTGTAAGCTCAATCGATCCATTTATGGATTGAAACAAGCATCCAGATCATGGAACATTA
GATTTGATACTGCAATCAAATCTTTTGGTTTTGACCAAAACGTTGATGAGCCTTGTGTGTACAAGAAAGTCAACAAAAGGAAAGTAGCTTTTCTAATACTTTATGTAGAC
GATATCCTACTCATTGGGAATGATGTAGGATACCTCACTGACATTAAGAAATGGCTAGCAGCCCAATTCCAAATGAAAGATCTGGGAGAGGCTCAATACGTTCTGGGAAT
CCAAATAATTAGGGATCGTAAGAACAAAACGCTAGCTCTGTCTCAAGCAACCAGCTTCGTTTTTCCCTCTCTCTTCGAAAACAAACCCAAACCCTTGGCGCCGCTTCCTC
CCTCGCAGCACATTTCCTCCCTCATCGACGGCGCTTCCTCCCTCATCGACGGCAACCCTCGACGACGAGCTCACCGACGGCGTCCGAAGATTTCTCCCTCACCGACGTCG
ACTGTCTTCTCGCATTCCGATTCCGACGAAGTCTTTGCTGACGACGACGCGTGGCCAGCGAACGGCGCGGTCAGGACTTTGGCTGCTGAAGGCGATCTTGTTGAAGGTTT
TGCAACTTCTAAATCCTCATCAAGCGTGAGAGAGCCTGTCGCGATTATTTACTCCGACAACAAGTCGCATCCGGATTCCACTGCTTCGGTTTGGCAGGGCAACGATGGCA
TCCAAGGTGCACAATCTCTTGGAGTAATAGAGCGTAAGTCTACTCGAGTATTGTCCACGACTGATGAGGAAGGTCGTTCTAAAAGTGAGAATCCCATCAAGCAGGTTACG
GACTCCATTGAGCGGGCAAATATCATTCCTATGATGAAATGGATATTCTCGAAAAAGGAGTTCAAAGAGGAACTTGGGAAAGAGCGTTTTAAGGAGTTGTGTACTTATTT
TGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAGTCCATAAGAATTCTCTTATCCATAGCCACATATTATGACTATGAAATATGGCAAATGGACGTCAAGACTGCCTTTCTGAATGGTAATCTTGAAGAGAATAT
CTTCATGTCTCAACCCGAAGGGTTCATAGCCCAAGGTCAGGAACAAAAGGTTTGTAAGCTCAATCGATCCATTTATGGATTGAAACAAGCATCCAGATCATGGAACATTA
GATTTGATACTGCAATCAAATCTTTTGGTTTTGACCAAAACGTTGATGAGCCTTGTGTGTACAAGAAAGTCAACAAAAGGAAAGTAGCTTTTCTAATACTTTATGTAGAC
GATATCCTACTCATTGGGAATGATGTAGGATACCTCACTGACATTAAGAAATGGCTAGCAGCCCAATTCCAAATGAAAGATCTGGGAGAGGCTCAATACGTTCTGGGAAT
CCAAATAATTAGGGATCGTAAGAACAAAACGCTAGCTCTGTCTCAAGCAACCAGCTTCGTTTTTCCCTCTCTCTTCGAAAACAAACCCAAACCCTTGGCGCCGCTTCCTC
CCTCGCAGCACATTTCCTCCCTCATCGACGGCGCTTCCTCCCTCATCGACGGCAACCCTCGACGACGAGCTCACCGACGGCGTCCGAAGATTTCTCCCTCACCGACGTCG
ACTGTCTTCTCGCATTCCGATTCCGACGAAGTCTTTGCTGACGACGACGCGTGGCCAGCGAACGGCGCGGTCAGGACTTTGGCTGCTGAAGGCGATCTTGTTGAAGGTTT
TGCAACTTCTAAATCCTCATCAAGCGTGAGAGAGCCTGTCGCGATTATTTACTCCGACAACAAGTCGCATCCGGATTCCACTGCTTCGGTTTGGCAGGGCAACGATGGCA
TCCAAGGTGCACAATCTCTTGGAGTAATAGAGCGTAAGTCTACTCGAGTATTGTCCACGACTGATGAGGAAGGTCGTTCTAAAAGTGAGAATCCCATCAAGCAGGTTACG
GACTCCATTGAGCGGGCAAATATCATTCCTATGATGAAATGGATATTCTCGAAAAAGGAGTTCAAAGAGGAACTTGGGAAAGAGCGTTTTAAGGAGTTGTGTACTTATTT
TGTTTAG
Protein sequenceShow/hide protein sequence
MLKSIRILLSIATYYDYEIWQMDVKTAFLNGNLEENIFMSQPEGFIAQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKVNKRKVAFLILYVD
DILLIGNDVGYLTDIKKWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATSFVFPSLFENKPKPLAPLPPSQHISSLIDGASSLIDGNPRRRAHRRRPKISPSPTS
TVFSHSDSDEVFADDDAWPANGAVRTLAAEGDLVEGFATSKSSSSVREPVAIIYSDNKSHPDSTASVWQGNDGIQGAQSLGVIERKSTRVLSTTDEEGRSKSENPIKQVT
DSIERANIIPMMKWIFSKKEFKEELGKERFKELCTYFV