; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008772 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008772
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposable element protein
Genome locationchr9:29625847..29630752
RNA-Seq ExpressionLag0008772
SyntenyLag0008772
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3473721.1 retroelement pol polyprotein-like [Gossypium australe]4.7e-6966.67Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFGTPRA++SDEGTHFVN  L  LL K+G+KH++AT YHPQ NGQAE++N+EIK IL+KVV  +++DWS RLD+ALWA++  YKTPLGMSPYRLV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFDEKDGRVFKVNGQRVKHYWGE
        ACHLPLELEH+ +WAL++LN DL  A   RMLQLNELEEFR FSY+NAK+ KE+ K WHDK I+ +EF+   G  F+VN QR+KHY+GE
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFDEKDGRVFKVNGQRVKHYWGE

XP_017221472.1 PREDICTED: uncharacterized protein LOC108198219 [Daucus carota subsp. sativus]1.0e-7159.83Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFGTPRA++SDEGTHFVN +L   LAKY ++H++AT YHPQ NGQAE+SNREIK IL+KVV+P+RKDWS RLDEALWA+RTAYKTPLGMSPYRLV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD-------------------------------
        ACHLP+ELEHK +WALK LNFD+  AG  R LQL+EL+E R FSY+NAK+YKEKTK WHDK I+S+ F+                               
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD-------------------------------

Query:  ----------EKDGRVFKVNGQRVKHYWG
                  +  G  FKVNGQR+KHYWG
Subjt:  ----------EKDGRVFKVNGQRVKHYWG

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]1.2e-6971.43Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFG PRAL+SDEGTHFVN ++  LL +Y +KHR+ATPYHPQ NGQAE+ N EIKSIL+KVV P+RKDWS +LD+A+WA+RTA+KTPLGMSPY+LV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF--DEKDG
        ACHLPLELEHK FWALKKLNFD +  G +R +QLNELEEFR  +Y+NAK+YKEKTK+WHD KI+ KEF  D K G
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF--DEKDG

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]8.0e-6955.36Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFGTPRA++SDEGTHF N +   LL+KYG+KH+IA  YHPQ NGQAEISNREIK+IL+K V+ +RKDW+ +LD+ALWA+RTA+KTP+GMSPYRLV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF--------------------------------
        ACHLP+ELEHK +WA+KK N DL  AG  R+LQLNE++EFR  +Y+NAKIYKE+TK WHDK+I  +EF                                
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF--------------------------------

Query:  ---------DEKDGRVFKVNGQRVKHYWGEEFQ
                  +K G +F+VNGQR+KHY+GE+ +
Subjt:  ---------DEKDGRVFKVNGQRVKHYWGEEFQ

XP_030478317.1 uncharacterized protein LOC115695391 [Cannabis sativa]5.9e-7260.78Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFGTPRAL+SDEGTHFVN +L  LLAKY +KH+IAT YHPQ NGQAEISNREIK IL+KVV+P+RKDWS RLD+ALWA+RTA+KTPLGMSPYRLVYGK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD-------------------------------
        ACHLP+ELEHK +WA KKLN DL  AG  R LQLNELEE R FSY+NAK+YKE TK WHDK+I+ + F+                               
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD-------------------------------

Query:  -----------EKDGRVFKVNGQRVKHYWGEE
                   E+  + FKVNGQR+KHY G E
Subjt:  -----------EKDGRVFKVNGQRVKHYWGEE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase7.3e-6845.34Show/hide
Query:  DRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAF-PVFPD----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRALVSDEGTHF
        DR      + R   + +  I+  E+ D W      P  P           D  + W+    V     N+D + V        F RFGTPRA++SD GTHF
Subjt:  DRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAF-PVFPD----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK
         N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK IL+K V  +RKDWS RLDEALWA+RTAYKTP+GMSPYRLV+GKACHLP+ELEH  +WA++K
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK

Query:  LNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF-----------------------------------------DEKDGR-VF
        LNFD+  AG  R+LQLNEL+EFR  +Y+NAKIYKEK K WH+KKI  + F                                         + K+ R  F
Subjt:  LNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF-----------------------------------------DEKDGR-VF

Query:  KVNGQRVKHYWGEEFQSKYPSL
        KVN QR+KHYWGE    ++ S+
Subjt:  KVNGQRVKHYWGEEFQSKYPSL

A0A2G9HBV9 DNA-directed DNA polymerase8.1e-6744.1Show/hide
Query:  DRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRALVSDEGTHF
        DR      + R   + +  I+  E+ D W      +F             D  + W+    V     N+D + V        F RFGTPRA++S+ GTHF
Subjt:  DRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK
         N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK IL+K V  +RKDWS RLDEALWA+RTA+KTP+GMSPY+LV+GKACHLP+ELEH  +WA++K
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK

Query:  LNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD------------------------------------------EKDGRVF
        LNFD+  AG  R+LQLNEL+EFR  +Y+NAKIYKEKTK WHDKKI  + F+                                          E     F
Subjt:  LNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFD------------------------------------------EKDGRVF

Query:  KVNGQRVKHYWGEEFQSKYPSL
        KVN QR+KHYWG      + S+
Subjt:  KVNGQRVKHYWGEEFQSKYPSL

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)2.1e-6744.72Show/hide
Query:  DQLSAAVREDRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVA-----RFARFGTPRAL
        D  +   R DR      + +   +    ++  EI D W       FP            D  + W+        + ND +  V+      F+RFG PRAL
Subjt:  DQLSAAVREDRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVA-----RFARFGTPRAL

Query:  VSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEH
        +SDEGTHF+N  +  LL KY + HRIATPYHPQ +GQ E+SNR+IK IL+K V+ SRKDWS +LD+ALWA+RTA+KTP+GMSP+++VYGK+CHLPLELEH
Subjt:  VSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEH

Query:  KTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF------------------------------------------
        K  WA K LNFDLS+AG  R+LQL+EL+EFR F+Y+NAKI+KEKTK WHDKKI+++EF                                          
Subjt:  KTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF------------------------------------------

Query:  DEKDGRVFKVNGQRVKHYWGEE
        D    + FKVNGQR+K Y+G+E
Subjt:  DEKDGRVFKVNGQRVKHYWGEE

A0A4Y1R644 Transposable element protein4.7e-6744Show/hide
Query:  DQLSAAVREDRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRA
        D  +  V+ DR      + R   + +  I+  E+ D W       FP            D  + W+         + +D + V +      F RFGTPRA
Subjt:  DQLSAAVREDRVLLAFAVLRSMSIDVGKIISTEIADCWRKKAFPVFPD-----------DLFNPWIPPPPVEREEENDDEEQVAR------FARFGTPRA

Query:  LVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELE
        ++SD G+HF N +   L+ KY I HR++TPYHPQ +GQ EISNREIK IL+KVV+ +RKDW+ +L++ALWA+RTAYKTP+GMSPYRLV+GKACHLP+ELE
Subjt:  LVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGKACHLPLELE

Query:  HKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF---------------------------DEKDGRVFKVNGQR
        H  FWA+KKLNFDL +AG +R  QLNELEE R  SY+NAK+YKE+TK +HD+ I+ KEF                           + KDG  FKVNGQR
Subjt:  HKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEF---------------------------DEKDGRVFKVNGQR

Query:  VKHYWGEEFQSKYPSLRNSVRFPNF
        +K ++ E  Q ++  +   + FP++
Subjt:  VKHYWGEEFQSKYPSLRNSVRFPNF

A0A5B6VWJ0 Retroelement pol polyprotein-like2.3e-6966.67Show/hide
Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK
        F RFGTPRA++SDEGTHFVN  L  LL K+G+KH++AT YHPQ NGQAE++N+EIK IL+KVV  +++DWS RLD+ALWA++  YKTPLGMSPYRLV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDEALWAFRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFDEKDGRVFKVNGQRVKHYWGE
        ACHLPLELEH+ +WAL++LN DL  A   RMLQLNELEEFR FSY+NAK+ KE+ K WHDK I+ +EF+   G  F+VN QR+KHY+GE
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFDEKDGRVFKVNGQRVKHYWGE

SwissProt top hitse value%identityAlignment
P03359 Gag-Pol polyprotein8.7e-1033.85Show/hide
Query:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS
        D F+ W+   P + E      +++      RFG P+ L SD G  FV  +   L  + GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW 
Subjt:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS

Query:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG
          L  AL   R    TP   G++PY ++YG
Subjt:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG

P10272 Gag-Pol polyprotein1.0e-1032.81Show/hide
Query:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS
        D F+ W+   P  +E  +   +++    F RFG P+ + SD G  FV+ +   L    GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW 
Subjt:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS

Query:  FRLDEALWAFRTAYKTPLGMSPYRLVYG
          L  AL   R       G++PY ++YG
Subjt:  FRLDEALWAFRTAYKTPLGMSPYRLVYG

P21414 Gag-Pol polyprotein1.9e-0933.85Show/hide
Query:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS
        D F+ W+   P + E      +++      RFG P+ L SD G  FV  +   L  + GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW 
Subjt:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS

Query:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG
          L  AL   R    TP   G++PY ++YG
Subjt:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG

P31792 Pol polyprotein (Fragment)4.6e-1132.81Show/hide
Query:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS
        D F+ W+   P  +E  +   +++    F RFG P+ + SD G  FV+ +   L    GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW 
Subjt:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS

Query:  FRLDEALWAFRTAYKTPLGMSPYRLVYG
          L  AL   R       G++PY ++YG
Subjt:  FRLDEALWAFRTAYKTPLGMSPYRLVYG

Q9TTC1 Gag-Pol polyprotein5.6e-0933.08Show/hide
Query:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS
        D F+ W+   P + E      +++      RFG P+ L SD G  FV  +   L  + GI  ++   Y PQ++GQ E  NR IK  L K+ +    KDW 
Subjt:  DLFNPWIPPPPVEREEENDDEEQVAR--FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKV-VHPSRKDWS

Query:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG
          L  AL   R    TP   G++PY +++G
Subjt:  FRLDEALWAFRTAYKTP--LGMSPYRLVYG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACGCGAGCGAGAAAAGAGAGAAATAATGAGGAAGAGGAGGTGCCGGTTACGCCGGAAGTGCAAAGAGGGAAAACTAAGAAAAAGAGAACGCCAGAAGAAAA
AGAGGCTAAAAGAATAAGGGGGCAGCAGAGGGCTGAGGAGCAAGAAGCCATTCGAGAAGAAACAGTGGATGACGTGGATACAGAGAGAGCTAAAAATCCTGAGGAAGAAT
CGAGAATTTCTGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAACAAACAGAAGTTGAGGAGCAGGCCGCAGTCGTAATGCCTGAACCACCAAAGCGTCGCCGCATC
AAACGGAAGGCGGGTCGCGTGAGGGTGATTCAGAACACTCCATCGCCTCCGACGTTGGACTCTGAGGAAGAAAAGAGGGCAGCTGAAAATAAGGCAAAGGAAGAAGAGGC
AAGGAAGGCAGAAGAAGAGATTTTGCGCGAACAAAGAGAAGACAAGGGCAAAGGAATTGCCGAAGCATCGGGTGAGATTGAGGAACCAAGGGTACCGTTCATTCGCTTTG
TCAACGAGCTTGCGAGAGCAAAATACCAAGAGGTGCTGAAGCATGATTTCTTGTTCGAGCGGGGATTTGGCAGTGATTTGCCAAGGTTCTTAGAGTCTGGAATAACGAAC
CTTGGGTGGAGGCAGTTTTGTGCTAAGCCTGAACCTGTCAATGCCAACATTGTTCGAGAATTCTACGCTAATCTTGACGTTAAGGATGACTTTGAAGTTATAGTGCGAGG
AGTGCTTGTACAATTGAGCCCAGAAGCCATTAATAGTTTGTTTGATCTCCAGGATTTTCCGCATGCAATTTTTAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAA
GTGCGGCGGTCCGAGAGGACAGAGTATTGCTTGCCTTTGCCGTCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGTTGGCGCAAA
AAGGCCTTCCCAGTGTTTCCCGATGACTTATTTAATCCTTGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAATGATGATGAAGAGCAGGTTGCAAGGTTTGCGCG
GTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTGTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTT
ATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATAAAATCGATTTTAAAGAAAGTAGTTCATCCATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAG
GCTCTTTGGGCTTTTAGGACAGCCTATAAGACTCCTCTAGGTATGTCTCCCTATAGGTTAGTATATGGGAAAGCTTGCCATTTACCATTGGAGCTAGAGCATAAAACATT
TTGGGCTTTGAAAAAGTTAAATTTTGATCTAAGCCGTGCAGGAGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAATTTCGCCAATTTTCTTATAAGAATGCGAAAA
TATATAAAGAAAAAACTAAGCTGTGGCATGACAAGAAAATAAAATCTAAAGAGTTTGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAACGTGTGAAGCATTAT
TGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGAATTCAGTACGTTTCCCTAACTTCATCTTCTTGCTTCGATCTTCTACTTTCTTTTTCGTTTTCATTCTCTG
TAAAACCCTTGAATCCTTCATGGCAAAAACGAGAGCTAGAAAACAGAGGGAGAGTGAGGAGGAAGAGATACCCGTGACCCCCGAAGTTCAGAAAGTAAAAGCCAAGAAGA
AGAGGACGCTGGAGGAGAAGGAAGCTAAGCGAAGACGACGACAGCAGAGGGCTACAGAGCAAGATGCTATCCAAGAAGAAGCGATGAATGACCCAGATACGGACGAAGTT
CAAAATCCTGCGGTAGAACCGATAGTCACAGATACGGTTCAAGAGGAAAATGCTGAGGAGAATCAAGAACAACAGGTTGATATGGTGAGAGACGAGCAGGCAGACGTAGT
GCCTGAAAGAGGAACCGAGCAGGAGCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCACCAAAACGTCGCCGCATTAAGCGAAAGGTCGGCCATATTCAGGACAAAG
TGCGGGAAGAAGCAGAGAAGAAGACTGAGGAAGAGCAGTTGCTCAAGCGCAGGGCAGAGAAGGGCAAAAATGTTGCTGAAGCATCAGAGGAACACAATGCAATAGAAGAA
CAGCAGTTACCATTTGATCTCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAGAAGGGATTTCTTATTTGAGCGAGGATTCAACGGTGATCTCCCACA
ATTTCTGAGGACAGGTATTGCAGACCACGGCTGGGAGCTATTTTGTGCGAAGCCTGAATCTGTAAACGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACGCGAGCGAGAAAAGAGAGAAATAATGAGGAAGAGGAGGTGCCGGTTACGCCGGAAGTGCAAAGAGGGAAAACTAAGAAAAAGAGAACGCCAGAAGAAAA
AGAGGCTAAAAGAATAAGGGGGCAGCAGAGGGCTGAGGAGCAAGAAGCCATTCGAGAAGAAACAGTGGATGACGTGGATACAGAGAGAGCTAAAAATCCTGAGGAAGAAT
CGAGAATTTCTGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAACAAACAGAAGTTGAGGAGCAGGCCGCAGTCGTAATGCCTGAACCACCAAAGCGTCGCCGCATC
AAACGGAAGGCGGGTCGCGTGAGGGTGATTCAGAACACTCCATCGCCTCCGACGTTGGACTCTGAGGAAGAAAAGAGGGCAGCTGAAAATAAGGCAAAGGAAGAAGAGGC
AAGGAAGGCAGAAGAAGAGATTTTGCGCGAACAAAGAGAAGACAAGGGCAAAGGAATTGCCGAAGCATCGGGTGAGATTGAGGAACCAAGGGTACCGTTCATTCGCTTTG
TCAACGAGCTTGCGAGAGCAAAATACCAAGAGGTGCTGAAGCATGATTTCTTGTTCGAGCGGGGATTTGGCAGTGATTTGCCAAGGTTCTTAGAGTCTGGAATAACGAAC
CTTGGGTGGAGGCAGTTTTGTGCTAAGCCTGAACCTGTCAATGCCAACATTGTTCGAGAATTCTACGCTAATCTTGACGTTAAGGATGACTTTGAAGTTATAGTGCGAGG
AGTGCTTGTACAATTGAGCCCAGAAGCCATTAATAGTTTGTTTGATCTCCAGGATTTTCCGCATGCAATTTTTAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAA
GTGCGGCGGTCCGAGAGGACAGAGTATTGCTTGCCTTTGCCGTCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGTTGGCGCAAA
AAGGCCTTCCCAGTGTTTCCCGATGACTTATTTAATCCTTGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAATGATGATGAAGAGCAGGTTGCAAGGTTTGCGCG
GTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTGTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTT
ATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATAAAATCGATTTTAAAGAAAGTAGTTCATCCATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAG
GCTCTTTGGGCTTTTAGGACAGCCTATAAGACTCCTCTAGGTATGTCTCCCTATAGGTTAGTATATGGGAAAGCTTGCCATTTACCATTGGAGCTAGAGCATAAAACATT
TTGGGCTTTGAAAAAGTTAAATTTTGATCTAAGCCGTGCAGGAGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAATTTCGCCAATTTTCTTATAAGAATGCGAAAA
TATATAAAGAAAAAACTAAGCTGTGGCATGACAAGAAAATAAAATCTAAAGAGTTTGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAACGTGTGAAGCATTAT
TGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGAATTCAGTACGTTTCCCTAACTTCATCTTCTTGCTTCGATCTTCTACTTTCTTTTTCGTTTTCATTCTCTG
TAAAACCCTTGAATCCTTCATGGCAAAAACGAGAGCTAGAAAACAGAGGGAGAGTGAGGAGGAAGAGATACCCGTGACCCCCGAAGTTCAGAAAGTAAAAGCCAAGAAGA
AGAGGACGCTGGAGGAGAAGGAAGCTAAGCGAAGACGACGACAGCAGAGGGCTACAGAGCAAGATGCTATCCAAGAAGAAGCGATGAATGACCCAGATACGGACGAAGTT
CAAAATCCTGCGGTAGAACCGATAGTCACAGATACGGTTCAAGAGGAAAATGCTGAGGAGAATCAAGAACAACAGGTTGATATGGTGAGAGACGAGCAGGCAGACGTAGT
GCCTGAAAGAGGAACCGAGCAGGAGCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCACCAAAACGTCGCCGCATTAAGCGAAAGGTCGGCCATATTCAGGACAAAG
TGCGGGAAGAAGCAGAGAAGAAGACTGAGGAAGAGCAGTTGCTCAAGCGCAGGGCAGAGAAGGGCAAAAATGTTGCTGAAGCATCAGAGGAACACAATGCAATAGAAGAA
CAGCAGTTACCATTTGATCTCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAGAAGGGATTTCTTATTTGAGCGAGGATTCAACGGTGATCTCCCACA
ATTTCTGAGGACAGGTATTGCAGACCACGGCTGGGAGCTATTTTGTGCGAAGCCTGAATCTGTAAACGCATAG
Protein sequenceShow/hide protein sequence
MAKTRARKERNNEEEEVPVTPEVQRGKTKKKRTPEEKEAKRIRGQQRAEEQEAIREETVDDVDTERAKNPEEESRISDTVQEKIAEKNQQTEVEEQAAVVMPEPPKRRRI
KRKAGRVRVIQNTPSPPTLDSEEEKRAAENKAKEEEARKAEEEILREQREDKGKGIAEASGEIEEPRVPFIRFVNELARAKYQEVLKHDFLFERGFGSDLPRFLESGITN
LGWRQFCAKPEPVNANIVREFYANLDVKDDFEVIVRGVLVQLSPEAINSLFDLQDFPHAIFNEMMVAPSSDQLSAAVREDRVLLAFAVLRSMSIDVGKIISTEIADCWRK
KAFPVFPDDLFNPWIPPPPVEREEENDDEEQVARFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKSILKKVVHPSRKDWSFRLDE
ALWAFRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYKNAKIYKEKTKLWHDKKIKSKEFDEKDGRVFKVNGQRVKHY
WGEEFQSKYPSLRNSVRFPNFIFLLRSSTFFFVFILCKTLESFMAKTRARKQRESEEEEIPVTPEVQKVKAKKKRTLEEKEAKRRRRQQRATEQDAIQEEAMNDPDTDEV
QNPAVEPIVTDTVQEENAEENQEQQVDMVRDEQADVVPERGTEQEQEARVEVIMPEPPKRRRIKRKVGHIQDKVREEAEKKTEEEQLLKRRAEKGKNVAEASEEHNAIEE
QQLPFDLFVNNFARAKYAELLRRDFLFERGFNGDLPQFLRTGIADHGWELFCAKPESVNA