; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031502 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031502
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr11:9302222..9303263
RNA-Seq ExpressionLag0031502
SyntenyLag0031502
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]1.8e-2928.16Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +GNGQS   FKDPW+ RP +F  ++    S  +VKV  +IT    W+   ++Q  +  D+++I  +P+S     D W WHY S+G+Y VKSGYK      
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVW-----------KLRNHH-------------VPVNMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR
        ++ S S       WWK  W             R +H             + ++  C +C    ++  HA+F CP AQEVWE++    ++   +++  KD 
Subjt:  QEASLSESGKEASWWKTVW-----------KLRNHH-------------VPVNMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR

Query:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLAD----YCSRNSSGSSFVQSEEDVHKIISGGEDIIMHIDASFMDEKSNCG
         L + + L K  ++ + +  W IW +RN L H ++  +PS    W+  Y  +    Y S N + S          + +S G  + +    S   E+   G
Subjt:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLAD----YCSRNSSGSSFVQSEEDVHKIISGGEDIIMHIDASFMDEKSNCG

Query:  IGVVMCNNQ
          ++  NN+
Subjt:  IGVVMCNNQ

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.3e-3531.94Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPI-SMSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +GNG +I  F DPW+PRP TFK +  F+   +D  VA+FIT    WD   +      ED ++I  +PI S +  D W+WHY+ RG+Y V+SGYK  M  +
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPI-SMSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKL-------------RNHHVPVNM-----------GCLVCHKEMETTDHALFRCPRAQEVWEVLVPTI-MMDHWDQLDIKD
          A+ + +    + W ++WKL              + H+P               C +C    E+  HA F C RA+++W  L P +  +   D +   +
Subjt:  QEASLSESGKEASWWKTVWKL-------------RNHHVPVNM-----------GCLVCHKEMETTDHALFRCPRAQEVWEVLVPTI-MMDHWDQLDIKD

Query:  RWLSLVDNLK-QTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSS
         W SL + L+ + L   ++  W IWNDRNSL H +Q+     + EW+  +L  +     S  S
Subjt:  RWLSLVDNLK-QTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSS

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]8.6e-2729.21Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +G G +I    D WIP    FK   +         VA++IT T +W+   LQ      DV+ I ++P+S +   D+WIWHYE  GDY V SGY  +    
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR
        +E   S S  + +WWK+ WKL                 +PV           +  C +C    E+  HALF C  A+EVW+    +I   + D+L   D 
Subjt:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR

Query:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIIS----GGEDIIMHIDASFMDEKSNCG
         + L     K   E I    W IW+DRN+  H +++ +P         Y+  Y S  S+ +    +      + S          +++DA+    +S  G
Subjt:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIIS----GGEDIIMHIDASFMDEKSNCG

Query:  IGVVMCNNQGHLKAA
        IGV++ N+ G +KAA
Subjt:  IGVVMCNNQGHLKAA

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]2.5e-2629.15Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISM-STPDKWIWHYESRGDYFVKSGYKHSMMNR
        +G+G  +    D WIP    FK +  F  S  ++ VA++IT T +WD   L       D++ I  +P+S  ST D+W WHY+S GDY VKSGY  +    
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISM-STPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-
         +   S S  + +WW+  W L              N  +PV           +  C +C +  E+  HALF C  A+ VW+    +  +D      +KD 
Subjt:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-

Query:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGS---SFVQSEEDVHKIISGGE-DIIMHIDASFMDEKSN
         +L  +  +  K  LE++    W IW+DRN+  H +Q+  P       + YLA++ S  S+ +   S V ++    K +   E ++ M++DA+    ++ 
Subjt:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGS---SFVQSEEDVHKIISGGE-DIIMHIDASFMDEKSN

Query:  CGIGVVMCNNQGHLKAAQT
         GIGV++ ++ G + AA +
Subjt:  CGIGVVMCNNQGHLKAAQT

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]8.0e-2528.21Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISM-STPDKWIWHYESRGDYFVKSGYKHSMMNR
        +G G ++    D WIP    FK      PS  +  VA++IT   +W+   L +     DVE I  +P+S  S  D WIWHYE  G+Y VKS Y  +    
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISM-STPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTV-------------WKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-
             S SG + +WWK               WK+ N  +PV           +  C +C +  E+  HALF C  A+ VW+       +D      +KD 
Subjt:  QEASLSESGKEASWWKTV-------------WKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-

Query:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCS-RNSSGSSFVQSEEDVHKI---ISGGEDIIMHIDASFMDEKSN
         +L  +  +     LER+    W IW+DRN+  H +Q+  P       + YLA++ S +  +  +  +   DV+++         + M++DA+    +S 
Subjt:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCS-RNSSGSSFVQSEEDVHKI---ISGGEDIIMHIDASFMDEKSN

Query:  CGIGVVMCNNQGHLKAAQT
         G+GV++ ++ G + AA +
Subjt:  CGIGVVMCNNQGHLKAAQT

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248746.4e-3631.94Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPI-SMSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +GNG +I  F DPW+PRP TFK +  F+   +D  VA+FIT    WD   +      ED ++I  +PI S +  D W+WHY+ RG+Y V+SGYK  M  +
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPI-SMSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKL-------------RNHHVPVNM-----------GCLVCHKEMETTDHALFRCPRAQEVWEVLVPTI-MMDHWDQLDIKD
          A+ + +    + W ++WKL              + H+P               C +C    E+  HA F C RA+++W  L P +  +   D +   +
Subjt:  QEASLSESGKEASWWKTVWKL-------------RNHHVPVNM-----------GCLVCHKEMETTDHALFRCPRAQEVWEVLVPTI-MMDHWDQLDIKD

Query:  RWLSLVDNLK-QTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSS
         W SL + L+ + L   ++  W IWNDRNSL H +Q+     + EW+  +L  +     S  S
Subjt:  RWLSLVDNLK-QTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSS

A0A803PKJ2 Uncharacterized protein5.4e-2729.32Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISMST-PDKWIWHYESRGDYFVKSGYKHSMMNR
        +G G ++    D WIP    FK      PS  +  VA +IT   +W+   L +     DVE I  +P+S S+  D WIWHY+  G+Y VKSGY  +    
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISMST-PDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTV-------------WKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-
             S SG + +WWK               WK+ N  +PV           +  C +C +  E+  HALF C  A+ VW+    +  +D      +KD 
Subjt:  QEASLSESGKEASWWKTV-------------WKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD-

Query:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCS---RNSSGSSFVQSEEDVHKIISGGE-DIIMHIDASFMDEKSN
         +L  +  +  K  LER+    W IW+DRN+  H +Q+  P       + YLA++ S     +  +  V ++ +  K I   E  + M++DA+    +S 
Subjt:  RWLSLVDNL--KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCS---RNSSGSSFVQSEEDVHKIISGGE-DIIMHIDASFMDEKSN

Query:  CGIGVVMCNNQGHLKAAQTLYTNG
         GIGV++ ++ G +  A +  T G
Subjt:  CGIGVVMCNNQGHLKAAQTLYTNG

A0A803Q185 Uncharacterized protein1.9e-2727.39Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +G+G +I + +DPW+PRP+TFK+     P    + V +       WD+  +      +D E+I  LP S     DK +WHY   G+Y VKSGY+ +   +
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKLR------------------------NHHVPVNMGCLVCH-KEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD
         E   S+      WW+ +W+L+                          HV    GC  C     ET  HAL+ C +++  W++    + +    Q D   
Subjt:  QEASLSESGKEASWWKTVWKLR------------------------NHHVPVNMGCLVCH-KEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD

Query:  RWLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIISGGEDIIMHIDASFMDEKSNCGIGV
          + L   + K   E   V  W++WN RN+  H+  +P P+   EW  + L D+    S     ++ EE V K+   GE + +++DAS        G+G 
Subjt:  RWLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIISGGEDIIMHIDASFMDEKSNCGIGV

Query:  VMCNNQGHLKAAQT
        V+   QG +  A +
Subjt:  VMCNNQGHLKAAQT

A0A803Q8J4 Uncharacterized protein1.5e-2929.57Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHF-DPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISMST-PDKWIWHYESRGDYFVKSGYKHSMMN
        +GNG SI    DPWIP  + F  + +  +P+ V   VA++ITP  +W+ +KL       DV  I  LP+S +  PD WIWH  + G+Y VKSGY  +  +
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHF-DPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISMST-PDKWIWHYESRGDYFVKSGYKHSMMN

Query:  RQEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD
          + + S S    +WWK+ W+L+             ++ +PV           +  C +C    E+  HALF C  A+ VW+V   T        ++I+D
Subjt:  RQEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKD

Query:  RWLSLVDN-LKQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIISGGED-----IIMHIDASFMDEKSN
            + +N  K  LE I    W+IW+DRN++ H +    P+V     Q +L +Y S          S      I           + +++DA+F +  + 
Subjt:  RWLSLVDN-LKQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIISGGED-----IIMHIDASFMDEKSN

Query:  CGIGVVMCNNQGHLKAAQTLYTNGWPLP
         G G ++ ++ G++KAA +   NG  LP
Subjt:  CGIGVVMCNNQGHLKAAQTLYTNGWPLP

A0A803QGT2 Uncharacterized protein4.1e-2729.21Show/hide
Query:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR
        +G G +I    D WIP    FK   +         VA++IT T +W+   LQ      DV+ I ++P+S +   D+WIWHYE  GDY V SGY  +    
Subjt:  MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPIS-MSTPDKWIWHYESRGDYFVKSGYKHSMMNR

Query:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR
        +E   S S  + +WWK+ WKL                 +PV           +  C +C    E+  HALF C  A+EVW+    +I   + D+L   D 
Subjt:  QEASLSESGKEASWWKTVWKLR-------------NHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDR

Query:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIIS----GGEDIIMHIDASFMDEKSNCG
         + L     K   E I    W IW+DRN+  H +++ +P         Y+  Y S  S+ +    +      + S          +++DA+    +S  G
Subjt:  WLSLVDNL-KQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWVQEYLADYCSRNSSGSSFVQSEEDVHKIIS----GGEDIIMHIDASFMDEKSNCG

Query:  IGVVMCNNQGHLKAA
        IGV++ N+ G +KAA
Subjt:  IGVVMCNNQGHLKAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.4e-1326.73Show/hide
Query:  WDEAKLQQYVVWEDVEVIKRLPISMS-TPDKWIWHYESRGDYFVKSGY---KHSMMNRQEASLSESG----KEASW------------WK-------TVW
        WD++K+ Q+V   D   I R+ ++ S  PDK IW+Y + G+Y V+SGY    H       A     G    K   W            W+       T  
Subjt:  WDEAKLQQYVVWEDVEVIKRLPISMS-TPDKWIWHYESRGDYFVKSGY---KHSMMNRQEASLSESG----KEASW------------WK-------TVW

Query:  KLRNHHVPVNMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDRWLSLVDNLKQTL-----ERISVG-AWAIWNDRNSLHHNRQIPS
        +L    + ++  C  CH+E E+ +HALF CP A   W +   +++ +     D ++   ++++ ++ T      + + V   W IW  RN++  N+   S
Subjt:  KLRNHHVPVNMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDRWLSLVDNLKQTL-----ERISVG-AWAIWNDRNSLHHNRQIPS

Query:  PS
        PS
Subjt:  PS

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-1122.85Show/hide
Query:  MGNGQSIYMFKDPWI-PRPITFKVVSHFDPSMVDVKVANFITPTVQWDEA--KLQQYVVWEDVEVIKRLPISMSTP------DKWIWHYESRGDYFVKSG
        +GNG+ I +++  W+  +P +  +     P      V++ +  +   DE+  + ++ V+      ++R  I    P      D + W Y S GDY VKSG
Subjt:  MGNGQSIYMFKDPWI-PRPITFKVVSHFDPSMVDVKVANFITPTVQWDEA--KLQQYVVWEDVEVIKRLPISMSTP------DKWIWHYESRGDYFVKSG

Query:  Y---------KHSMMNRQEASLSESGKEASWWKT---------VWKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEV-LVPT
        Y         + S     E SL+   ++   WK+         +WK  ++ +PV              C+ C    ET +H LF+C  A+  W +  +P 
Subjt:  Y---------KHSMMNRQEASLSESGKEASWWKT---------VWKLRNHHVPV-----------NMGCLVCHKEMETTDHALFRCPRAQEVWEV-LVPT

Query:  IMMDHW-DQLDIKDRWLSLVDNLKQTLERISVGA----WAIWNDRNSL-HHNRQIPSPSV-------RGEWVQEYLADYCSRNS--SGSSFVQSEEDVHK
         +   W D + +   W+  + N     E+ S       W +W +RN L    R+  +  V         EW     A+ C      + SS  +     H+
Subjt:  IMMDHW-DQLDIKDRWLSLVDNLKQTLERISVGA----WAIWNDRNSL-HHNRQIPSPSV-------RGEWVQEYLADYCSRNS--SGSSFVQSEEDVHK

Query:  IISGGEDIIMHIDASFMDEKSNCGIGVVMCNNQGHLK
         +        + DA++  +   CGIG V+ N +G +K
Subjt:  IISGGEDIIMHIDASFMDEKSNCGIGVVMCNNQGHLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACGGTCAGTCAATTTATATGTTTAAAGATCCATGGATTCCTCGACCCATTACTTTCAAGGTTGTTTCTCATTTTGATCCAAGTATGGTGGATGTGAAAGTGGC
GAATTTTATTACTCCAACTGTGCAATGGGATGAGGCGAAGCTTCAACAATATGTGGTGTGGGAGGATGTGGAAGTGATTAAACGCTTGCCTATCAGTATGTCTACCCCAG
ACAAATGGATTTGGCACTATGAAAGCAGAGGTGATTACTTTGTTAAGAGTGGATACAAACACAGTATGATGAATCGGCAAGAAGCATCATTGTCTGAGAGTGGAAAGGAA
GCTAGTTGGTGGAAAACAGTTTGGAAGTTGAGGAATCATCATGTACCAGTCAACATGGGTTGCCTAGTTTGTCACAAGGAGATGGAAACCACAGATCACGCCCTTTTCCG
GTGCCCCAGAGCTCAGGAGGTTTGGGAGGTTCTAGTGCCAACAATTATGATGGATCATTGGGATCAGTTAGACATCAAAGATCGTTGGTTGAGCTTAGTTGACAATCTGA
AACAAACATTAGAGCGCATTAGTGTAGGGGCCTGGGCAATTTGGAATGACAGAAATAGCCTGCATCATAATCGTCAAATTCCAAGTCCATCAGTTAGAGGTGAATGGGTC
CAGGAATATCTAGCAGATTACTGCTCGAGAAACTCGTCAGGTAGCTCCTTTGTCCAGTCGGAGGAGGATGTTCATAAAATCATCTCAGGAGGTGAAGATATCATTATGCA
CATTGATGCATCATTTATGGATGAAAAGTCTAATTGCGGTATTGGGGTAGTAATGTGTAATAATCAGGGTCATCTCAAGGCGGCACAGACTCTATATACAAATGGTTGGC
CACTCCCCTTTGGGAGCGGAAACTATAGCTGTTCTGGAAGGGATGCGATTAGCAAGGAATTTGGATGTGCGGAGATTAACAGTTATTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACGGTCAGTCAATTTATATGTTTAAAGATCCATGGATTCCTCGACCCATTACTTTCAAGGTTGTTTCTCATTTTGATCCAAGTATGGTGGATGTGAAAGTGGC
GAATTTTATTACTCCAACTGTGCAATGGGATGAGGCGAAGCTTCAACAATATGTGGTGTGGGAGGATGTGGAAGTGATTAAACGCTTGCCTATCAGTATGTCTACCCCAG
ACAAATGGATTTGGCACTATGAAAGCAGAGGTGATTACTTTGTTAAGAGTGGATACAAACACAGTATGATGAATCGGCAAGAAGCATCATTGTCTGAGAGTGGAAAGGAA
GCTAGTTGGTGGAAAACAGTTTGGAAGTTGAGGAATCATCATGTACCAGTCAACATGGGTTGCCTAGTTTGTCACAAGGAGATGGAAACCACAGATCACGCCCTTTTCCG
GTGCCCCAGAGCTCAGGAGGTTTGGGAGGTTCTAGTGCCAACAATTATGATGGATCATTGGGATCAGTTAGACATCAAAGATCGTTGGTTGAGCTTAGTTGACAATCTGA
AACAAACATTAGAGCGCATTAGTGTAGGGGCCTGGGCAATTTGGAATGACAGAAATAGCCTGCATCATAATCGTCAAATTCCAAGTCCATCAGTTAGAGGTGAATGGGTC
CAGGAATATCTAGCAGATTACTGCTCGAGAAACTCGTCAGGTAGCTCCTTTGTCCAGTCGGAGGAGGATGTTCATAAAATCATCTCAGGAGGTGAAGATATCATTATGCA
CATTGATGCATCATTTATGGATGAAAAGTCTAATTGCGGTATTGGGGTAGTAATGTGTAATAATCAGGGTCATCTCAAGGCGGCACAGACTCTATATACAAATGGTTGGC
CACTCCCCTTTGGGAGCGGAAACTATAGCTGTTCTGGAAGGGATGCGATTAGCAAGGAATTTGGATGTGCGGAGATTAACAGTTATTTCTAA
Protein sequenceShow/hide protein sequence
MGNGQSIYMFKDPWIPRPITFKVVSHFDPSMVDVKVANFITPTVQWDEAKLQQYVVWEDVEVIKRLPISMSTPDKWIWHYESRGDYFVKSGYKHSMMNRQEASLSESGKE
ASWWKTVWKLRNHHVPVNMGCLVCHKEMETTDHALFRCPRAQEVWEVLVPTIMMDHWDQLDIKDRWLSLVDNLKQTLERISVGAWAIWNDRNSLHHNRQIPSPSVRGEWV
QEYLADYCSRNSSGSSFVQSEEDVHKIISGGEDIIMHIDASFMDEKSNCGIGVVMCNNQGHLKAAQTLYTNGWPLPFGSGNYSCSGRDAISKEFGCAEINSYF