; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027686 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027686
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:3492469..3493881
RNA-Seq ExpressionLag0027686
SyntenyLag0027686
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]3.5e-2635.94Show/hide
Query:  VGWYDFEEIVVLFWALWSRRNCELFQGQNAV-SDVGLWAAEYIRVFRDTRNRGCHARPVLS----SRSQIR-------WIPPVGLVFKVNIDAAFSTSSL
        + W DFEE+VV  W+LW+RRN  +F  +     D+  W + YI  F+ T      A   +S      SQI        W P    VFK+  DA+FS+   
Subjt:  VGWYDFEEIVVLFWALWSRRNCELFQGQNAV-SDVGLWAAEYIRVFRDTRNRGCHARPVLS----SRSQIR-------WIPPVGLVFKVNIDAAFSTSSL

Query:  RAGAG-IIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLR
         AG G II+RD  G+VL  AT++     S D AE     +G R++ + G  P+ LE DS R++ + + + + LS+ G +++ V+  L+     S SFT R
Subjt:  RAGAG-IIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLR

Query:  EGNQVAHLLAGYAYSRQ
         GN +AHLLA  A   Q
Subjt:  EGNQVAHLLAGYAYSRQ

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.1e-3029.93Show/hide
Query:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELF-QGQNAVSDVGL---
        RG+++ N C+ C R  ED  H+FW CK    +W +S+F +L        FL ++      +   DFEE+ V+ W LW++RN   F      V  +G+   
Subjt:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELF-QGQNAVSDVGL---

Query:  -WAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDL
         WA +Y   FR+ ++     R  +++ ++I W PP   ++K+N DA+F  S   AG GII+ +  G+V+  AT++     S D+AE     +G +L+ ++
Subjt:  -WAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDL

Query:  GFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFVQFEFYEDV
        G  P                 ++DLSE G ++ + + F +     S +F  REGN+ AH+LA  A        WME     + S ++ E  E++
Subjt:  GFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFVQFEFYEDV

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]3.6e-3136.43Show/hide
Query:  RLVWQSSRFWR-LYSQW-SSGSFLDLVHFLQM---EVGWYDFEEIVVLFWALWSRRN--CELFQGQNAVSDVGLWAAEYIRVFRDTRNRGCHARPVLSSR
        R +WQ S+F   L+  W   GS  D+ HF+++    V W     IVVL WA+W+ RN   + F    ++SD+  W+  Y++V++  +     A  V    
Subjt:  RLVWQSSRFWR-LYSQW-SSGSFLDLVHFLQM---EVGWYDFEEIVVLFWALWSRRN--CELFQGQNAVSDVGLWAAEYIRVFRDTRNRGCHARPVLSSR

Query:  SQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSE
            W PP   + KVN+DAAF   S  AG G+I+RD  G V   A R        D  EGF   +G  L+ + GF+  Q+E DS R+F +L+ +  D SE
Subjt:  SQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSE

Query:  LGLLLQEVRGFLSM-VPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISS
        +G+L   ++ FLS      S SFT R GN  AHLLA  A +  + + W+E   D ISS
Subjt:  LGLLLQEVRGFLSM-VPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISS

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]8.5e-2528.98Show/hide
Query:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELFQGQNAVSDVGLWAA-
        R +   + C RC R  E   H  W+C  ++ VW+ S    +Y QW   SF+DL   +       + E   V+ W LW  RN    +G    S   +W+A 
Subjt:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELFQGQNAVSDVGLWAA-

Query:  EYIRVFRDT-RNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFL
        E++  F+D  R+     +     R +++W PP     K+N DAA      +A  G++VRD  G++     +  P   S    E      G  L R+ GF 
Subjt:  EYIRVFRDT-RNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFL

Query:  PLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFV
         L +E DS  V   L++   DLS  G +L +++   S   +       REGN  AH +A +A     D  W EAG  +++  +
Subjt:  PLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFV

XP_030509082.1 uncharacterized protein LOC115723746 [Cannabis sativa]2.9e-2527.76Show/hide
Query:  ETMVYRGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELFQGQNA-VSDV
        E + +R       C RC  CYE V H  + C+  +  W  + F          + LD++ ++Q+      F   + + W  W+ RN  +F+ Q++    +
Subjt:  ETMVYRGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELFQGQNA-VSDV

Query:  GLWAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRD
           A +Y+  ++  +++        +  + + W PP     K+N DAA S    R G G++VRD  G+VL            S +AEG+   +G + SRD
Subjt:  GLWAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRD

Query:  LGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLA
         GF    +EVD K +   L    ++LS  G ++  +R  LS +P  +   T R+GN  AH LA
Subjt:  LGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLA

TrEMBL top hitse value%identityAlignment
A0A2N9FJZ7 Uncharacterized protein2.2e-2629.14Show/hide
Query:  ETMVYRGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRN-CELFQGQNAVSDV
        + +  R + V   C  C  C ED  H  W+C   +LVW+   + +         F DL+  +       + E  + LFWALW RRN   L Q  + ++ V
Subjt:  ETMVYRGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRN-CELFQGQNAVSDV

Query:  GLWAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRD
        G  A  Y+  +    N     +P   S    RW+PP  L +KVN D A    +  AG GIIVRD +GRV+    +   + +S    E +  +   + + +
Subjt:  GLWAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRD

Query:  LGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEA
        +G    + E DS+ +   L +    L+  GLL+ + +     +   S     R+GN +AH LA  A      E WMEA
Subjt:  LGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEA

A0A2N9J5F1 Uncharacterized protein4.1e-2528.57Show/hide
Query:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRN-CELFQGQNAVSDVGLWAA
        R I V   C  C    EDV H  W+C H   VW    + +       G F DL+  +  +    + ++ +++ WALW RRN   L Q  +++  VG  A 
Subjt:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRN-CELFQGQNAVSDVGLWAA

Query:  EYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLP
         Y+  +    +   H +P      ++RW+PP    +K+N D A    +  AG G+IVRD  G V+    +   + +S    E +T +   + + ++G   
Subjt:  EYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLP

Query:  LQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEA
         + E DS+ V   L++    L+  GLL+ + +     +   S S   R+GN++AH LA  A      E WMEA
Subjt:  LQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEA

A0A6J1CDQ4 uncharacterized protein LOC1110105331.7e-2635.94Show/hide
Query:  VGWYDFEEIVVLFWALWSRRNCELFQGQNAV-SDVGLWAAEYIRVFRDTRNRGCHARPVLS----SRSQIR-------WIPPVGLVFKVNIDAAFSTSSL
        + W DFEE+VV  W+LW+RRN  +F  +     D+  W + YI  F+ T      A   +S      SQI        W P    VFK+  DA+FS+   
Subjt:  VGWYDFEEIVVLFWALWSRRNCELFQGQNAV-SDVGLWAAEYIRVFRDTRNRGCHARPVLS----SRSQIR-------WIPPVGLVFKVNIDAAFSTSSL

Query:  RAGAG-IIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLR
         AG G II+RD  G+VL  AT++     S D AE     +G R++ + G  P+ LE DS R++ + + + + LS+ G +++ V+  L+     S SFT R
Subjt:  RAGAG-IIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLR

Query:  EGNQVAHLLAGYAYSRQ
         GN +AHLLA  A   Q
Subjt:  EGNQVAHLLAGYAYSRQ

A0A6J1DAR4 uncharacterized protein LOC1110189542.5e-3029.93Show/hide
Query:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELF-QGQNAVSDVGL---
        RG+++ N C+ C R  ED  H+FW CK    +W +S+F +L        FL ++      +   DFEE+ V+ W LW++RN   F      V  +G+   
Subjt:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQMEVGWYDFEEIVVLFWALWSRRNCELF-QGQNAVSDVGL---

Query:  -WAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDL
         WA +Y   FR+ ++     R  +++ ++I W PP   ++K+N DA+F  S   AG GII+ +  G+V+  AT++     S D+AE     +G +L+ ++
Subjt:  -WAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDL

Query:  GFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFVQFEFYEDV
        G  P                 ++DLSE G ++ + + F +     S +F  REGN+ AH+LA  A        WME     + S ++ E  E++
Subjt:  GFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSFVQFEFYEDV

A0A6J1DBJ7 uncharacterized protein LOC1110189731.7e-3136.43Show/hide
Query:  RLVWQSSRFWR-LYSQW-SSGSFLDLVHFLQM---EVGWYDFEEIVVLFWALWSRRN--CELFQGQNAVSDVGLWAAEYIRVFRDTRNRGCHARPVLSSR
        R +WQ S+F   L+  W   GS  D+ HF+++    V W     IVVL WA+W+ RN   + F    ++SD+  W+  Y++V++  +     A  V    
Subjt:  RLVWQSSRFWR-LYSQW-SSGSFLDLVHFLQM---EVGWYDFEEIVVLFWALWSRRN--CELFQGQNAVSDVGLWAAEYIRVFRDTRNRGCHARPVLSSR

Query:  SQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSE
            W PP   + KVN+DAAF   S  AG G+I+RD  G V   A R        D  EGF   +G  L+ + GF+  Q+E DS R+F +L+ +  D SE
Subjt:  SQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSE

Query:  LGLLLQEVRGFLSM-VPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISS
        +G+L   ++ FLS      S SFT R GN  AHLLA  A +  + + W+E   D ISS
Subjt:  LGLLLQEVRGFLSM-VPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein7.0e-1726.71Show/hide
Query:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWS-SGSFLD-LVHFLQME----VGWYDFEEIVVLFWALWSRRNCELFQGQNAVSDV
        R ID    C RC    E + H+ + C +T+ VW+S+    + +QW    SF D L   +Q+         D      + W LW  RN  LFQ +    D 
Subjt:  RGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWS-SGSFLD-LVHFLQME----VGWYDFEEIVVLFWALWSRRNCELFQGQNAVSDV

Query:  ----GLW-AAEYIRVFRDTRNRGCH--ARPVLSS-RSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTR
            G+  A E++     T N   H    P+ +S R   +W PP     K N D+ ++  S    +G  +R+C G ++            S  AE     
Subjt:  ----GLW-AAEYIRVFRDTRNRGCH--ARPVLSS-RSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRFYPWCYSSDLAEGFTTR

Query:  DGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSR
           ++    G   +  E DSK +  +++   +D S LG L+ ++R ++  +P  S  F  RE N  A  LA + ++R
Subjt:  DGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGTTTGGGATATAGACCTTCTTTCATATGGCGCAACTTACTATGGGGGCGAGATTTTCTCTGCCAGGGTATTCGTTGGCGCATTGGGGATGGCAGCTCTGTTCCA
ATTTATAATTCTAATTGGCTTCCCCGAAACTATGGTTTATAGAGGTATTGATGTGCCCAATTTTTGTCATCGGTGCAAGAGATGTTATGAGGACGTGTTCCATGTTTTTT
GGGCTTGCAAGCATACGAGGTTGGTGTGGCAAAGTTCCAGATTTTGGCGTTTGTATAGCCAGTGGTCTTCCGGCAGTTTCCTTGATTTGGTCCATTTTCTCCAGATGGAG
GTTGGCTGGTATGACTTTGAGGAAATTGTGGTTTTGTTTTGGGCTTTGTGGAGTCGACGGAATTGTGAGTTATTTCAGGGGCAGAATGCTGTGAGTGATGTGGGGCTTTG
GGCAGCGGAGTATATCAGAGTTTTCAGGGACACGCGTAATCGGGGCTGCCATGCCCGACCTGTCTTATCTTCTCGCTCTCAGATTCGATGGATTCCACCGGTGGGTTTGG
TTTTCAAGGTCAATATTGATGCTGCTTTTTCGACGTCTTCATTACGAGCGGGTGCCGGTATTATTGTTCGAGATTGTGCTGGTCGTGTTCTGTTTTTCGCTACTCGATTT
TATCCTTGGTGCTACTCCTCTGATTTGGCCGAGGGTTTCACGACGAGAGATGGTTTTCGTTTGTCCCGTGATCTTGGGTTTCTTCCGTTGCAACTTGAGGTAGATTCTAA
GCGGGTGTTTCAGATTTTGTCCGAGGAAGTTGATGATCTTTCTGAGTTGGGTCTTCTTCTTCAAGAGGTTCGTGGCTTTTTGTCTATGGTGCCTGCTAACTCTGCTAGTT
TTACCTTAAGGGAAGGGAATCAGGTGGCTCATCTCCTTGCGGGCTATGCTTACTCTCGACAGTATGATGAAGAATGGATGGAAGCTGGACATGATTTTATTTCTTCGTTT
GTTCAATTTGAATTCTATGAGGATGTTTCCTCTTCTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGTTTGGGATATAGACCTTCTTTCATATGGCGCAACTTACTATGGGGGCGAGATTTTCTCTGCCAGGGTATTCGTTGGCGCATTGGGGATGGCAGCTCTGTTCCA
ATTTATAATTCTAATTGGCTTCCCCGAAACTATGGTTTATAGAGGTATTGATGTGCCCAATTTTTGTCATCGGTGCAAGAGATGTTATGAGGACGTGTTCCATGTTTTTT
GGGCTTGCAAGCATACGAGGTTGGTGTGGCAAAGTTCCAGATTTTGGCGTTTGTATAGCCAGTGGTCTTCCGGCAGTTTCCTTGATTTGGTCCATTTTCTCCAGATGGAG
GTTGGCTGGTATGACTTTGAGGAAATTGTGGTTTTGTTTTGGGCTTTGTGGAGTCGACGGAATTGTGAGTTATTTCAGGGGCAGAATGCTGTGAGTGATGTGGGGCTTTG
GGCAGCGGAGTATATCAGAGTTTTCAGGGACACGCGTAATCGGGGCTGCCATGCCCGACCTGTCTTATCTTCTCGCTCTCAGATTCGATGGATTCCACCGGTGGGTTTGG
TTTTCAAGGTCAATATTGATGCTGCTTTTTCGACGTCTTCATTACGAGCGGGTGCCGGTATTATTGTTCGAGATTGTGCTGGTCGTGTTCTGTTTTTCGCTACTCGATTT
TATCCTTGGTGCTACTCCTCTGATTTGGCCGAGGGTTTCACGACGAGAGATGGTTTTCGTTTGTCCCGTGATCTTGGGTTTCTTCCGTTGCAACTTGAGGTAGATTCTAA
GCGGGTGTTTCAGATTTTGTCCGAGGAAGTTGATGATCTTTCTGAGTTGGGTCTTCTTCTTCAAGAGGTTCGTGGCTTTTTGTCTATGGTGCCTGCTAACTCTGCTAGTT
TTACCTTAAGGGAAGGGAATCAGGTGGCTCATCTCCTTGCGGGCTATGCTTACTCTCGACAGTATGATGAAGAATGGATGGAAGCTGGACATGATTTTATTTCTTCGTTT
GTTCAATTTGAATTCTATGAGGATGTTTCCTCTTCTGTGTAG
Protein sequenceShow/hide protein sequence
MHVWDIDLLSYGATYYGGEIFSARVFVGALGMAALFQFIILIGFPETMVYRGIDVPNFCHRCKRCYEDVFHVFWACKHTRLVWQSSRFWRLYSQWSSGSFLDLVHFLQME
VGWYDFEEIVVLFWALWSRRNCELFQGQNAVSDVGLWAAEYIRVFRDTRNRGCHARPVLSSRSQIRWIPPVGLVFKVNIDAAFSTSSLRAGAGIIVRDCAGRVLFFATRF
YPWCYSSDLAEGFTTRDGFRLSRDLGFLPLQLEVDSKRVFQILSEEVDDLSELGLLLQEVRGFLSMVPANSASFTLREGNQVAHLLAGYAYSRQYDEEWMEAGHDFISSF
VQFEFYEDVSSSV