; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006847 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006847
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:46407866..46410721
RNA-Seq ExpressionLag0006847
SyntenyLag0006847
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4263564.1 unnamed protein product [Prunus armeniaca]1.3e-1526.5Show/hide
Query:  GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG----VSSGSVFSFVFGLRGRYFPQSGFLEAGLGR--------
        GL +EI   +AKFWW+   D R IHW +W  +C   +  G GF         L    G        S+ + +F  + RYFP S FL A  G         
Subjt:  GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG----VSSGSVFSFVFGLRGRYFPQSGFLEAGLGR--------

Query:  -----------------DRRLSG----------------AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL------------------
                         D RL                  ++P+LP TS V DLF ASGGW+   + A F   + EAIL IPL                  
Subjt:  -----------------DRRLSG----------------AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL------------------

Query:  -------------------RHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPSPSVLFPFRFEE-VIWAMKDNLPGPDFELVVI
                             G   PS+       W  LW+L    +  HL W             LPS  VLF  R  +  +W   D  P  +F L  +
Subjt:  -------------------RHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPSPSVLFPFRFEE-VIWAMKDNLPGPDFELVVI

Query:  ----FWCPSMWWEALPNGD---AYGQQGTMRTC--VWRPPAYWELKLNVDASVRPDSGELGVVVY-------CVG-LRVRGPACSSAGVCD---------
             W  ++W    P+     A+    +   C   WRPP     KLNVD +   ++G  G            VG L +R P+  S    +         
Subjt:  ----FWCPSMWWEALPNGD---AYGQQGTMRTC--VWRPPAYWELKLNVDASVRPDSGELGVVVY-------CVG-LRVRGPACSSAGVCD---------

Query:  FVVEPT---LEIGQNPSWWCAGCVGS--------GRLMDDVRMVLHPWDNSKVLFSPRQGNKVAHALA
        F ++ +   LEI ++ S      V S        G L+D VR +L    ++ V   PRQ NK AH +A
Subjt:  FVVEPT---LEIGQNPSWWCAGCVGS--------GRLMDDVRMVLHPWDNSKVLFSPRQGNKVAHALA

XP_010682933.1 PREDICTED: uncharacterized protein LOC104897695 [Beta vulgaris subsp. vulgaris]2.9e-1825.74Show/hide
Query:  KLDMSKAYDRVEW-----------------------LSSGDYA-------TIGVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGALA
        KLDMSKAYDRVEW                       LSS  Y+          +IPSRGL +G P     F        A     +GD  +        A
Subjt:  KLDMSKAYDRVEW-----------------------LSSGDYA-------TIGVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGALA

Query:  SDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEKG------------------------------------------------------
          ++ L     S  F RA   E SV+ D+L  YE+ASGQ +N++K                                                       
Subjt:  SDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEKG------------------------------------------------------

Query:  ----------------------------------------LIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG---
                                                ++ EI+   A+FWW      RR+HW+SWE +CLP A+ G GF         L +  G   
Subjt:  ----------------------------------------LIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG---

Query:  -VSSGSVFSFVFGLRGRYFPQSGFLEAGLGRD-----RRLSGA---------------------------------VPS----LPATSTVSDLFAASGGW
           +GS+   VF    RY+P+S FL A  G D     R + GA                                 VP+     PA   VSDL  ASG W
Subjt:  -VSSGSVFSFVFGLRGRYFPQSGFLEAGLGRD-----RRLSGA---------------------------------VPS----LPATSTVSDLFAASGGW

Query:  NEAVLRAHFDESDCEAILRIPLRHGPTFPSNPDRMRAWW
        +E VLR HF E D   I  IPL         P  ++ WW
Subjt:  NEAVLRAHFDESDCEAILRIPLRHGPTFPSNPDRMRAWW

XP_019157375.1 PREDICTED: uncharacterized protein LOC109153940 [Ipomoea nil]5.1e-1531.84Show/hide
Query:  GCVKLDMSKAYDRVEWLSSGDYATIGVIPSRGLG-RGIPFLRTCFCYVLRDCPAYCVVLSGD-----LRLRAFGWGALASDLTPLLCGR------QSPFF
        G +KLD++KAYDR+EW         G++ + G   R +  +  C   V     +Y  +L+G      +  R    G   S    ++C        Q    
Subjt:  GCVKLDMSKAYDRVEWLSSGDYATIGVIPSRGLG-RGIPFLRTCFCYVLRDCPAYCVVLSGD-----LRLRAFGWGALASDLTPLLCGR------QSPFF

Query:  RANGSEASVIRDLLIWYEKASGQTVNYEKGLI-KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSVG---VSSGSVFS
         AN  EA+ +++ L  YE  SGQ VNY K  I   I RTM ++WW GS   R IHW +W+ LC+P  + G GF     F  ++L         ++ S+ S
Subjt:  RANGSEASVIRDLLIWYEKASGQTVNYEKGLI-KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSVG---VSSGSVFS

Query:  FVFGLRGRYFPQSGFLEAGLGRD
         V+  + RY+P+S F EA +G +
Subjt:  FVFGLRGRYFPQSGFLEAGLGRD

XP_021714646.1 uncharacterized protein LOC110682599 [Chenopodium quinoa]4.9e-1825.6Show/hide
Query:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL
        +KLDMSKAYDRVEW                       +S+  Y+ I        +IPSRGL +G P     F  V         + S    + +      
Subjt:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL

Query:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEK------------------------------------------------------
          D+T L     S  F RA   E SVI DLL  YE  SGQ +N EK                                                      
Subjt:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEK------------------------------------------------------

Query:  ----------------------------------------GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSV--
                                                GLI+EIH  MA+FWW  +E  R+IHW SW++LC P    G GF     F  +LL   +  
Subjt:  ----------------------------------------GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSV--

Query:  -GVSSGSVFSFVFGLRGRYFPQSGFLEAGLG-----RDRRL--SGAVPSLPATSTVSDLFAASG-GWNEAVLRAHFDESDCEAILRIPLRHGPTFPSNPD
           + GS+ S V  L+ +Y+P +   EA LG       +RL  S A+ +     TV++L       W+ A L   F+E D +AIL IP+      P++  
Subjt:  -GVSSGSVFSFVFGLRGRYFPQSGFLEAGLG-----RDRRL--SGAVPSLPATSTVSDLFAASG-GWNEAVLRAHFDESDCEAILRIPLRHGPTFPSNPD

Query:  RMRAWWSGLWRLMCLDRRHLFWNALWLRVCGCAPSLPSPSVL-FPFRFEEVIWAMKDNLPGPDFELVVI------FWCPSMWWEAL--PNGDAYGQ
             W  +W+     +   F   LW   C C+ SLP+ S+L    + E+ +    ++       +V++       W  S  +EAL     D++G+
Subjt:  RMRAWWSGLWRLMCLDRRHLFWNALWLRVCGCAPSLPSPSVL-FPFRFEEVIWAMKDNLPGPDFELVVI------FWCPSMWWEAL--PNGDAYGQ

XP_024164458.1 uncharacterized protein LOC112171517 [Rosa chinensis]3.0e-1529.66Show/hide
Query:  VKLDMSKAYDRVEWLSSGDYATIGVIPSRGLGR-GIPFLRTCFC-----YVLRDCPAYCVVLSGDLRLRAFGWGALASDLTPLLCGRQSPFFRANGS--E
        +KLD+SKAYDR+E L         V+   G  R  I F+  C C     +++R  P   V  S  LR      G   S    LL         A  S   
Subjt:  VKLDMSKAYDRVEWLSSGDYATIGVIPSRGLGR-GIPFLRTCFC-----YVLRDCPAYCVVLSGDLRLRAFGWGALASDLTPLLCGRQSPFFRANGS--E

Query:  ASVIRDLLIWYEKASGQTVNYEKGLI--------------------------KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTW-NFLT
           I+D++  Y +ASGQ VN++K  +                           ++ +  A+FWW  +ED R+IHW +W SLC P+     GF +   F  
Subjt:  ASVIRDLLIWYEKASGQTVNYEKGLI--------------------------KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTW-NFLT

Query:  SLLASSVGVSSGSVFSFVFGL-RGRYFPQSGF-LEAGLGRDRRLSG-AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL
        ++LA      +    S +  + + RYFP S F L   LG    +    +P +    T+++L    G W+E  +RA F +   EAIL IPL
Subjt:  SLLASSVGVSSGSVFSFVFGL-RGRYFPQSGF-LEAGLGRDRRLSG-AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL

TrEMBL top hitse value%identityAlignment
A0A2N9FZG8 Reverse transcriptase domain-containing protein1.6e-1726.49Show/hide
Query:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL
        +KLDMSKAYDRVEW                       + +  +A +        + PSRG+ +G P     F        A       D RLR       
Subjt:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL

Query:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYE-----------------------------------KGLIKEIHRTMAKFWWNGSE
           L+ LL    S  F  A   E+  + ++L  YE++S Q   YE                                   KGL  +I+   AKFWW  S 
Subjt:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYE-----------------------------------KGLIKEIHRTMAKFWWNGSE

Query:  DTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSVGVSSGSVFSFVFGL-RGRYFPQSGFLEAGLGR--------DRRLSGAVPSLPATSTVSDLF
          R++HW +W  LC      G GF  +N F  +LLA        +  S VF + + RYFP+  F++A +G+           +S   P+      V DL 
Subjt:  DTRRIHWMSWESLCLPSAWVGWGFVTWN-FLTSLLASSVGVSSGSVFSFVFGL-RGRYFPQSGFLEAGLGR--------DRRLSGAVPSLPATSTVSDLF

Query:  AAS-GGWNEAVLRAHFDESDCE-----AILRIP-LRHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPS
              WNEA++ + FD +  E     A  +   +R      S+P+    +W  LWRL    + +H  W A       C  SLP+
Subjt:  AAS-GGWNEAVLRAHFDESDCE-----AILRIP-LRHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPS

A0A4D8ZIP5 Reverse transcriptase domain-containing protein5.7e-1223.56Show/hide
Query:  FFRANGSEASVIRDLLIWYEKASGQTVNYEK----------------------GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN
        FFRAN  E +V++  +I YE+A GQ +NY K                       +  ++ + +  +WW  +E+ ++IH  +W+ LC+P    G GF    
Subjt:  FFRANGSEASVIRDLLIWYEKASGQTVNYEK----------------------GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWN

Query:  FLTSLLASSVG-------VSSGSVFSFVFGLRGRYFPQSGFLEAGL----------------------------GRDRRLSG----AVPSLPATSTVSD-
         L     S++G        +  S+   V  L+ R FP+ G L+AG                             GR+  + G    +V  L   S   D 
Subjt:  FLTSLLASSVG-------VSSGSVFSFVFGLRGRYFPQSGFLEAGL----------------------------GRDRRLSG----AVPSLPATSTVSD-

Query:  ---------LFAASGGWNEAVLRAHFDESDCEAILRIPLRHGPTFPSNPDRMRAWWSGLWRLMCLDRRHLFWNALWLRVCGCAPSLPSPSVLFPFRFEEV
                  F  +  WNEAV+RA F+E + E +L I L       S+ +    W+         +R+ L   A  +  C          V  P    + 
Subjt:  ---------LFAASGGWNEAVLRAHFDESDCEAILRIPLRHGPTFPSNPDRMRAWWSGLWRLMCLDRRHLFWNALWLRVCGCAPSLPSPSVLFPFRFEEV

Query:  IWAMKDNLPGPDFELVV------IFWCPSMWWEALPNGDAYGQQGTMR-TCVWRPPAYWELKLNVDASVRPDSGEL--GVVVYCVGLRVRGPACSSAG-V
         W  +D        L++         C + W EA    D   + G +     W  P    LK N+DA+V     ++  G VV      V    C   G V
Subjt:  IWAMKDNLPGPDFELVV------IFWCPSMWWEALPNGDAYGQQGTMR-TCVWRPPAYWELKLNVDASVRPDSGEL--GVVVYCVGLRVRGPACSSAG-V

Query:  CDFVVEPTLEIGQNPS
         D  +   L + + PS
Subjt:  CDFVVEPTLEIGQNPS

A0A5B6UZH6 Reverse transcriptase8.0e-1430.83Show/hide
Query:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL
        VKLDMSKAYDRVEW                       ++S  YA            PSRGL +G P     F        A       +  +R       
Subjt:  VKLDMSKAYDRVEW-----------------------LSSGDYATI-------GVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGAL

Query:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEKGLI------KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTW-N
          +++ LL       F  A    A +++DLL  YE+ SGQ VNY+K LI       EI    A+FWW      R +HW  W  LC      G GF     
Subjt:  ASDLTPLLCGRQSPFF-RANGSEASVIRDLLIWYEKASGQTVNYEKGLI------KEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTW-N

Query:  FLTSLLASSVGVSSGSVFSFVFGL-RGRYFPQSGFLEAGL
        F  SLLA          +S V  + + +YFP++ FL++ L
Subjt:  FLTSLLASSVGVSSGSVFSFVFGL-RGRYFPQSGFLEAGL

A0A6J5TIF9 Reverse transcriptase domain-containing protein6.5e-1626.5Show/hide
Query:  GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG----VSSGSVFSFVFGLRGRYFPQSGFLEAGLGR--------
        GL +EI   +AKFWW+   D R IHW +W  +C   +  G GF         L    G        S+ + +F  + RYFP S FL A  G         
Subjt:  GLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVG----VSSGSVFSFVFGLRGRYFPQSGFLEAGLGR--------

Query:  -----------------DRRLSG----------------AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL------------------
                         D RL                  ++P+LP TS V DLF ASGGW+   + A F   + EAIL IPL                  
Subjt:  -----------------DRRLSG----------------AVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL------------------

Query:  -------------------RHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPSPSVLFPFRFEE-VIWAMKDNLPGPDFELVVI
                             G   PS+       W  LW+L    +  HL W             LPS  VLF  R  +  +W   D  P  +F L  +
Subjt:  -------------------RHGPTFPSNPDRMRAWWSGLWRLMCLDR-RHLFWNALWLRVCGCAPSLPSPSVLFPFRFEE-VIWAMKDNLPGPDFELVVI

Query:  ----FWCPSMWWEALPNGD---AYGQQGTMRTC--VWRPPAYWELKLNVDASVRPDSGELGVVVY-------CVG-LRVRGPACSSAGVCD---------
             W  ++W    P+     A+    +   C   WRPP     KLNVD +   ++G  G            VG L +R P+  S    +         
Subjt:  ----FWCPSMWWEALPNGD---AYGQQGTMRTC--VWRPPAYWELKLNVDASVRPDSGELGVVVY-------CVG-LRVRGPACSSAGVCD---------

Query:  FVVEPT---LEIGQNPSWWCAGCVGS--------GRLMDDVRMVLHPWDNSKVLFSPRQGNKVAHALA
        F ++ +   LEI ++ S      V S        G L+D VR +L    ++ V   PRQ NK AH +A
Subjt:  FVVEPT---LEIGQNPSWWCAGCVGS--------GRLMDDVRMVLHPWDNSKVLFSPRQGNKVAHALA

A0A6J5XES2 Uncharacterized protein1.0e-1335.86Show/hide
Query:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLGRDRRLS------
        KGL KE+H  MA+FWW  ++D R IHW+ WE LC+                            S+ + +F  R RY+P   FLEA +  D+ L       
Subjt:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLGRDRRLS------

Query:  -GAVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL
          + P LP ++ V DLF +SG WN  +L+  F + + +AILRIPL
Subjt:  -GAVPSLPATSTVSDLFAASGGWNEAVLRAHFDESDCEAILRIPL

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003105.5e-0429.67Show/hide
Query:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCL---PSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLG
        K L K++   M +FWW+  E+ R+I W++W+ LC        +G+  + W     L   S  +           LR RYFP S  +E  +G
Subjt:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCL---PSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLG

Arabidopsis top hitse value%identityAlignment
ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-0529.67Show/hide
Query:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCL---PSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLG
        K L K++   M +FWW+  E+ R+I W++W+ LC        +G+  + W     L   S  +           LR RYFP S  +E  +G
Subjt:  KGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCL---PSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGCGTGAAGCTTGATATGAGCAAGGCATATGATAGGGTGGAATGGCTTTCCTCCGGAGATTATGCTACGATTGGGGTGATACCGTCTCGAGGTCTCGGCAGGGG
GATCCCCTTTCTCCGTACTTGTTTCTGTTATGTGCTGAGGGACTGTCCAGCCTATTGCGTGGTGCTGAGCGGAGATCTCAGATTACGGGCTTTCGGGTGGGGCGCTCTAG
CCAGCGATCTCACACCTCTTCTTTGCGGACGACAGTCTCCTTTTTTCAGGGCCAATGGGAGTGAAGCGTCGGTTATTCGGGATCTGTTGATATGGTATGAGAAAGCTTCA
GGACAGACTGTCAACTATGAGAAAGGTCTGATCAAAGAGATCCACAGGACTATGGCTAAATTTTGGTGGAATGGGTCCGAGGATACAAGGCGAATTCATTGGATGAGTTG
GGAGTCGTTGTGCCTCCCAAGTGCATGGGTGGGTTGGGGTTTCGTGACATGGAACTTTTTAACAAGCCTGTTGGCAAGCAGTGTTGGCGTGTCTTCAGGATCCGTCTTCT
CTTTTGTGTTCGGTCTGAGGGGCCGTTATTTTCCCCAGTCAGGTTTCTTGGAGGCAGGTCTTGGTCGCGACCGTCGTTTGTCTGGCGCAGTCCCGTCGCTTCCTGCTACT
AGTACGGTTAGTGATCTATTTGCTGCATCTGGGGGATGGAACGAGGCTGTGCTCAGAGCCCATTTTGATGAGTCGGACTGTGAGGCCATCTTGAGAATCCCATTACGGCA
TGGACCGACCTTCCCTTCGAATCCTGACAGGATGCGTGCGTGGTGGTCCGGCCTTTGGAGGCTAATGTGCCTAGACCGTCGGCATCTGTTCTGGAATGCCCTGTGGTTGA
GAGTATGTGGTTGTGCTCCAAGTTTGCCCTCTCCATCGGTCCTTTTCCCATTCCGGTTCGAGGAAGTCATTTGGGCGATGAAGGATAATCTTCCAGGGCCGGATTTCGAG
CTTGTGGTCATTTTCTGGTGTCCTTCCATGTGGTGGGAGGCGTTGCCTAACGGGGATGCTTACGGCCAGCAAGGGACAATGAGAACGTGTGTGTGGAGGCCGCCAGCCTA
CTGGGAGCTGAAGCTTAATGTTGATGCCTCTGTCAGGCCTGATTCAGGGGAGCTAGGGGTGGTTGTGTACTGCGTGGGGCTGAGGGTGAGGGGTCCAGCTTGCTCGTCAG
CTGGGGTTTGTGATTTTGTAGTGGAACCGACTCTTGAGATTGGTCAAAATCCTTCATGGTGGTGTGCAGGATGTGTCGGAAGTGGGCGGTTGATGGATGACGTCCGAATG
GTCCTCCATCCTTGGGACAACAGCAAGGTTTTGTTTTCGCCACGCCAGGGAAACAAGGTGGCACATGCTCTGGCTAGCTTGGCCTTTTCTTATGTTGACCGTTGTGTTGT
TCCTTTGCCTGGGATTGAGGTGTTAGCTGGTGGCATATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGCGTGAAGCTTGATATGAGCAAGGCATATGATAGGGTGGAATGGCTTTCCTCCGGAGATTATGCTACGATTGGGGTGATACCGTCTCGAGGTCTCGGCAGGGG
GATCCCCTTTCTCCGTACTTGTTTCTGTTATGTGCTGAGGGACTGTCCAGCCTATTGCGTGGTGCTGAGCGGAGATCTCAGATTACGGGCTTTCGGGTGGGGCGCTCTAG
CCAGCGATCTCACACCTCTTCTTTGCGGACGACAGTCTCCTTTTTTCAGGGCCAATGGGAGTGAAGCGTCGGTTATTCGGGATCTGTTGATATGGTATGAGAAAGCTTCA
GGACAGACTGTCAACTATGAGAAAGGTCTGATCAAAGAGATCCACAGGACTATGGCTAAATTTTGGTGGAATGGGTCCGAGGATACAAGGCGAATTCATTGGATGAGTTG
GGAGTCGTTGTGCCTCCCAAGTGCATGGGTGGGTTGGGGTTTCGTGACATGGAACTTTTTAACAAGCCTGTTGGCAAGCAGTGTTGGCGTGTCTTCAGGATCCGTCTTCT
CTTTTGTGTTCGGTCTGAGGGGCCGTTATTTTCCCCAGTCAGGTTTCTTGGAGGCAGGTCTTGGTCGCGACCGTCGTTTGTCTGGCGCAGTCCCGTCGCTTCCTGCTACT
AGTACGGTTAGTGATCTATTTGCTGCATCTGGGGGATGGAACGAGGCTGTGCTCAGAGCCCATTTTGATGAGTCGGACTGTGAGGCCATCTTGAGAATCCCATTACGGCA
TGGACCGACCTTCCCTTCGAATCCTGACAGGATGCGTGCGTGGTGGTCCGGCCTTTGGAGGCTAATGTGCCTAGACCGTCGGCATCTGTTCTGGAATGCCCTGTGGTTGA
GAGTATGTGGTTGTGCTCCAAGTTTGCCCTCTCCATCGGTCCTTTTCCCATTCCGGTTCGAGGAAGTCATTTGGGCGATGAAGGATAATCTTCCAGGGCCGGATTTCGAG
CTTGTGGTCATTTTCTGGTGTCCTTCCATGTGGTGGGAGGCGTTGCCTAACGGGGATGCTTACGGCCAGCAAGGGACAATGAGAACGTGTGTGTGGAGGCCGCCAGCCTA
CTGGGAGCTGAAGCTTAATGTTGATGCCTCTGTCAGGCCTGATTCAGGGGAGCTAGGGGTGGTTGTGTACTGCGTGGGGCTGAGGGTGAGGGGTCCAGCTTGCTCGTCAG
CTGGGGTTTGTGATTTTGTAGTGGAACCGACTCTTGAGATTGGTCAAAATCCTTCATGGTGGTGTGCAGGATGTGTCGGAAGTGGGCGGTTGATGGATGACGTCCGAATG
GTCCTCCATCCTTGGGACAACAGCAAGGTTTTGTTTTCGCCACGCCAGGGAAACAAGGTGGCACATGCTCTGGCTAGCTTGGCCTTTTCTTATGTTGACCGTTGTGTTGT
TCCTTTGCCTGGGATTGAGGTGTTAGCTGGTGGCATATTCTGA
Protein sequenceShow/hide protein sequence
MGCVKLDMSKAYDRVEWLSSGDYATIGVIPSRGLGRGIPFLRTCFCYVLRDCPAYCVVLSGDLRLRAFGWGALASDLTPLLCGRQSPFFRANGSEASVIRDLLIWYEKAS
GQTVNYEKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWESLCLPSAWVGWGFVTWNFLTSLLASSVGVSSGSVFSFVFGLRGRYFPQSGFLEAGLGRDRRLSGAVPSLPAT
STVSDLFAASGGWNEAVLRAHFDESDCEAILRIPLRHGPTFPSNPDRMRAWWSGLWRLMCLDRRHLFWNALWLRVCGCAPSLPSPSVLFPFRFEEVIWAMKDNLPGPDFE
LVVIFWCPSMWWEALPNGDAYGQQGTMRTCVWRPPAYWELKLNVDASVRPDSGELGVVVYCVGLRVRGPACSSAGVCDFVVEPTLEIGQNPSWWCAGCVGSGRLMDDVRM
VLHPWDNSKVLFSPRQGNKVAHALASLAFSYVDRCVVPLPGIEVLAGGIF