; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionzf-RVT domain-containing protein
Genome locationchr3:13646360..13651256
RNA-Seq ExpressionMoc03g20200
SyntenyMoc03g20200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7844993.1 ribonuclease H [Senna tora]1.1e-2831.47Show/hide
Query:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIM
        V  +++++ RNFL G T  +  IH +SWD++  PK  GG+G++     N A I K+AW LL    +LW + + NKY    +    L     DS LW  I+
Subjt:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIM

Query:  KVWNEIKHWLGWGIRNGCRVKFWTYQW-VNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGS
        K+W E    + W I NG  + FW  +W VN ++L   + D   S   R   + +Y+   G+WK +    K+   ++  I SI+PP   L   D P W   
Subjt:  KVWNEIKHWLGWGIRNGCRVKFWTYQW-VNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGS

Query:  RNEEYSVLATCANLLALSSPSQQSK-IWKAIW
        +N  +++ +     LA+    ++++ IW +IW
Subjt:  RNEEYSVLATCANLLALSSPSQQSK-IWKAIW

PKI32035.1 hypothetical protein CRG98_047574 [Punica granatum]2.2e-3233.19Show/hide
Query:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLW
        +AG+V      L +N + GH+  R  +HL+ W+ IT+PK  GG+GL   +++N AL+ K+  G LTRP  LW + L+ KY+   EG  + +V + DSWLW
Subjt:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLW

Query:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW
         +I + WN++     W +  G  V+FW   W  G + L       I      R + ++    G W W AF+  +++  LL +ASIRPP I   +PD+   
Subjt:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW

Query:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIWS
            +  +S ++T   LL   + +++  +W+ IWS
Subjt:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIWS

PKI77361.1 hypothetical protein CRG98_002306 [Punica granatum]7.9e-3531.61Show/hide
Query:  IPTWGPSSSISQLRSSTEARDGRDYMEQDVAIGLEQADGAISVSMEEGGEFSKGKISK----GSFSGLFTVVYGSPQRGSRGWLNMECLPNCLANIQSEL
        + T GP  SIS    + +     +  E+ ++      D    VS ++    ++G  SK    G + G+  +V+G  ++            N +A I+S L
Subjt:  IPTWGPSSSISQLRSSTEARDGRDYMEQDVAIGLEQADGAISVSMEEGGEFSKGKISK----GSFSGLFTVVYGSPQRGSRGWLNMECLPNCLANIQSEL

Query:  VKSNKEVFRSSAGRKRSILARLAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINK
                   +    S+  +L   + K +++L R+F+ GH+S + +IH ++W+ ITRPK  GG+GL    DFN  L+GKI WGL+TRP++    F   K
Subjt:  VKSNKEVFRSSAGRKRSILARLAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINK

Query:  YKGGWEGDWI-LKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSL
              GD I L     DS LW +I   WN++ H + W + +G R +FW  Q V G + + N   G+I E  R +P+R +++S+G W W +    +N+S 
Subjt:  YKGGWEGDWI-LKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSL

Query:  LLTIASIRPP
        LL IA  RPP
Subjt:  LLTIASIRPP

XP_015389227.1 uncharacterized protein LOC107178483 [Citrus sinensis]3.2e-2832.05Show/hide
Query:  AGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKV-NEGDSWLW
        AGV+  K++Q+C+ F+   ++   R+ LI WDK+ +PK  GG+GL      N AL+ K+AWGL+  P  LW + L  KY G  + D+ L +     S+LW
Subjt:  AGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKV-NEGDSWLW

Query:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW
         SI KVW++    L W I NG RV+FW   WV     L +     I        + D++ + G W W++F   + + ++L IA+++PP        F  W
Subjt:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW

Query:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIW
          S+  ++SV +    +    +  + S+ W   W
Subjt:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIW

XP_024448418.1 uncharacterized protein LOC112325720 [Populus trichocarpa]2.5e-2829.08Show/hide
Query:  KLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKVW
        +++++CR F+ G    R +IHL++W+K+ RPK EGG+GL   +  N A + K+AWG++ + N LW + L +KY      +  L     DS LW +I K W
Subjt:  KLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKVW

Query:  NEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNEE
        + +     W + NG R+ FW   W+    L+ N  + E+        + D +   G WKW+ FA  I   ++++I    PP +     D   W  S NE+
Subjt:  NEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNEE

Query:  YSVLATCANLLALSSPSQQSKIWKAI--WSDGFSIKKATSSLGGDQVDSSTSELLKSQYEWIGSWEPRNGWSRRHCQECGWE
         +V  T   +L          IW+ I  W     IK    ++  + +   T++L   +Y        R+  + R C  CG E
Subjt:  YSVLATCANLLALSSPSQQSKIWKAI--WSDGFSIKKATSSLGGDQVDSSTSELLKSQYEWIGSWEPRNGWSRRHCQECGWE

TrEMBL top hitse value%identityAlignment
A0A151TJZ2 Putative ribonuclease H protein At1g65750 family3.5e-2829.07Show/hide
Query:  KLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKVW
        ++D+ CR+FL GHTS + +IH ++W KI + K EGG+GL   +  N +  G +A  + ++P+ LW Q L +KYK G E   ++K     S +W  I   W
Subjt:  KLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKVW

Query:  NEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNEE
        + ++  + W + +G +V+FW   W+  +  L  +    + E ++ + +  YI+ EG W        +  S+ L +    PP + +  PD  +W GS    
Subjt:  NEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNEE

Query:  YSVLATCANLLALSSPSQQSKIWKAIW
        YS+ +    +  +S+ S  + ++  IW
Subjt:  YSVLATCANLLALSSPSQQSKIWKAIW

A0A2I0HK16 Reverse transcriptase domain-containing protein1.0e-3233.19Show/hide
Query:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLW
        +AG+V      L +N + GH+  R  +HL+ W+ IT+PK  GG+GL   +++N AL+ K+  G LTRP  LW + L+ KY+   EG  + +V + DSWLW
Subjt:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLW

Query:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW
         +I + WN++     W +  G  V+FW   W  G + L       I      R + ++    G W W AF+  +++  LL +ASIRPP I   +PD+   
Subjt:  VSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW

Query:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIWS
            +  +S ++T   LL   + +++  +W+ IWS
Subjt:  IGSRNEEYSVLATCANLLALSSPSQQSKIWKAIWS

A0A2I0L9K6 Reverse transcriptase domain-containing protein3.8e-3531.61Show/hide
Query:  IPTWGPSSSISQLRSSTEARDGRDYMEQDVAIGLEQADGAISVSMEEGGEFSKGKISK----GSFSGLFTVVYGSPQRGSRGWLNMECLPNCLANIQSEL
        + T GP  SIS    + +     +  E+ ++      D    VS ++    ++G  SK    G + G+  +V+G  ++            N +A I+S L
Subjt:  IPTWGPSSSISQLRSSTEARDGRDYMEQDVAIGLEQADGAISVSMEEGGEFSKGKISK----GSFSGLFTVVYGSPQRGSRGWLNMECLPNCLANIQSEL

Query:  VKSNKEVFRSSAGRKRSILARLAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINK
                   +    S+  +L   + K +++L R+F+ GH+S + +IH ++W+ ITRPK  GG+GL    DFN  L+GKI WGL+TRP++    F   K
Subjt:  VKSNKEVFRSSAGRKRSILARLAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINK

Query:  YKGGWEGDWI-LKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSL
              GD I L     DS LW +I   WN++ H + W + +G R +FW  Q V G + + N   G+I E  R +P+R +++S+G W W +    +N+S 
Subjt:  YKGGWEGDWI-LKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSL

Query:  LLTIASIRPP
        LL IA  RPP
Subjt:  LLTIASIRPP

A0A2Z6N4W3 RNase H domain-containing protein3.5e-2833.67Show/hide
Query:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIM
        V ++++Q+CRNF+ G T+   ++HLI+WDK+  PK EGG+     +  N A + K++W +LT+P+ LW + L  KY  G       K     S LW +I+
Subjt:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIM

Query:  KVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW
          W+ +K  + W IR+G   +FW   W+     L++     I + +   P+ +Y +S+G WKW+     + + +   IA I+PP  S  +PDFP W
Subjt:  KVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIW

A0A6N2M5M2 zf-RVT domain-containing protein8.3e-3031.58Show/hide
Query:  KKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKV
        K+++++CR F+ G T  R +IHL++W  I +PK EGG+GL      N A I K+AWG++ +   LW +FL +KY     G+     +  DS LW +I + 
Subjt:  KKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKV

Query:  WNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNE
        W+ ++  + W + NG  + FW  +WV     L N+ + E +       + D + S G WKWE FA  +    +++IA   PP +     D   W  S N 
Subjt:  WNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRCRPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNE

Query:  EYSVLATCANLLALSSPSQQSKIWKAIW
          SV  T   +        +  +W+ IW
Subjt:  EYSVLATCANLLALSSPSQQSKIWKAIW

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.7e-2237.33Show/hide
Query:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGD--WILKVNEGDSWLWVS
        +  +LDQL R FL G T+ + + HL+ W K+  PK+EGG+G+   +  N ALI K+ W LL   N LW   L  KY  G   D  W++      S  W S
Subjt:  VRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGD--WILKVNEGDSWLWVS

Query:  I-MKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDR
        I + + + + H +GW   +G +++FWT +WV+GK LL      E+  G+R
Subjt:  I-MKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDR

Arabidopsis top hitse value%identityAlignment
AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-0830.91Show/hide
Query:  PKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGD------WILKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQW
        PK EGG+GL  F ++N  L  K+ W L +    LW  +    +  G  GD      W  +    DSW W  ++++    + +L   I NG   +FWT  W
Subjt:  PKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGD------WILKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQW

Query:  VNGKNLLDNI
             LL +I
Subjt:  VNGKNLLDNI

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-0923.7Show/hide
Query:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWI-LKVNEGDSWL
        L   V K++  +  +F   +      +H  +WD ++  K EGG+G    + FN+AL+GK  W +L+RP  L A+   ++Y    + D +   +    S++
Subjt:  LAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREGGVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWI-LKVNEGDSWL

Query:  WVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNL-----LDNIRDGEISEGDRCRPIRDYISSEG-EWK
        W SI      ++      + NG  +  W ++W++ K       +  +   E +       + D I   G EW+
Subjt:  WVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNL-----LDNIRDGEISEGDRCRPIRDYISSEG-EWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTGAACTAGATCCAGGTAAGTCTAGCACGAATCAGGAGGAGTCCCAGGGGAATCCTCCTTGGATGGATGGTGAACTAGATCCAGATAAGTCTAGCTCGAATAA
GGAGGAGTCCCAGGCCAAAGTGTTTGAAAATCTTTTTCCTCGTAGGACACGTGTCATCTGTTTTGACTCCATTATCCCAACATGGGGGCCGAGCTCTTCGATTTCTCAAT
TGAGGAGTTCTACTGAAGCCAGGGATGGTAGGGATTATATGGAACAGGATGTAGCAATTGGTCTGGAGCAGGCGGATGGAGCTATCAGCGTTAGTATGGAGGAAGGGGGA
GAGTTCTCCAAAGGGAAAATTTCTAAGGGATCCTTCTCTGGGTTATTTACCGTTGTTTATGGTAGCCCTCAGCGGGGGTCTAGAGGTTGGTTGAACATGGAGTGTCTCCC
AAACTGCCTAGCAAACATTCAGAGTGAGTTGGTCAAATCGAATAAGGAGGTTTTCCGAAGTAGTGCAGGGAGGAAGAGATCTATTTTAGCTCGGCTTGCAGGGGTTGTCC
GTAAGAAACTGGACCAATTATGTAGAAATTTTCTTTTGGGTCATACTAGTTCTAGAGCTCGGATTCACCTAATTAGTTGGGATAAGATTACAAGGCCTAAGAGGGAAGGG
GGTGTTGGTTTACATATATTTCAAGACTTTAATATCGCCTTGATTGGGAAAATTGCATGGGGTCTACTAACACGACCAAATGATTTATGGGCTCAATTTCTTATTAATAA
GTATAAGGGCGGCTGGGAAGGGGACTGGATTTTGAAGGTCAATGAAGGAGACTCCTGGTTATGGGTTTCAATAATGAAGGTTTGGAATGAAATCAAACATTGGTTGGGGT
GGGGTATTAGGAACGGTTGTAGAGTAAAATTCTGGACATATCAATGGGTTAATGGGAAGAATTTATTAGACAATATTCGAGATGGTGAAATTTCAGAGGGGGATAGGTGT
CGCCCAATTAGGGACTATATTTCAAGTGAAGGAGAATGGAAGTGGGAAGCTTTTGCGGGTAAGATCAATCATTCTTTGTTATTAACCATTGCTAGCATTAGACCGCCGCA
CATATCTTTAAATTCCCCAGATTTCCCTATTTGGATTGGTTCTAGGAATGAGGAGTATTCTGTTTTGGCTACTTGTGCTAATCTCCTTGCTTTAAGCTCTCCCTCTCAAC
AGTCTAAGATTTGGAAAGCTATTTGGAGTGATGGCTTCTCTATCAAGAAAGCCACTTCGTCGTTGGGTGGAGATCAGGTGGATTCCTCCACTAGTGAATTACTGAAATCT
CAATACGAATGGATCGGCTCATGGGAACCCAGGAATGGCTGGAGTAGGAGGCATTGTCAGGAATGCGGATGGGAATCGGATAAAAGGCTTTCAATAGAATCTGGGCTGCT
CGACCACAATCGTATTAGACAGGGAGTGGACGAGCCTCCTAGAGGCCTTCTGCGTTTGTTAATGGCGGATATTGTAGGAGTCTCCATGTCCCGTTTCATTACCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGTGAACTAGATCCAGGTAAGTCTAGCACGAATCAGGAGGAGTCCCAGGGGAATCCTCCTTGGATGGATGGTGAACTAGATCCAGATAAGTCTAGCTCGAATAA
GGAGGAGTCCCAGGCCAAAGTGTTTGAAAATCTTTTTCCTCGTAGGACACGTGTCATCTGTTTTGACTCCATTATCCCAACATGGGGGCCGAGCTCTTCGATTTCTCAAT
TGAGGAGTTCTACTGAAGCCAGGGATGGTAGGGATTATATGGAACAGGATGTAGCAATTGGTCTGGAGCAGGCGGATGGAGCTATCAGCGTTAGTATGGAGGAAGGGGGA
GAGTTCTCCAAAGGGAAAATTTCTAAGGGATCCTTCTCTGGGTTATTTACCGTTGTTTATGGTAGCCCTCAGCGGGGGTCTAGAGGTTGGTTGAACATGGAGTGTCTCCC
AAACTGCCTAGCAAACATTCAGAGTGAGTTGGTCAAATCGAATAAGGAGGTTTTCCGAAGTAGTGCAGGGAGGAAGAGATCTATTTTAGCTCGGCTTGCAGGGGTTGTCC
GTAAGAAACTGGACCAATTATGTAGAAATTTTCTTTTGGGTCATACTAGTTCTAGAGCTCGGATTCACCTAATTAGTTGGGATAAGATTACAAGGCCTAAGAGGGAAGGG
GGTGTTGGTTTACATATATTTCAAGACTTTAATATCGCCTTGATTGGGAAAATTGCATGGGGTCTACTAACACGACCAAATGATTTATGGGCTCAATTTCTTATTAATAA
GTATAAGGGCGGCTGGGAAGGGGACTGGATTTTGAAGGTCAATGAAGGAGACTCCTGGTTATGGGTTTCAATAATGAAGGTTTGGAATGAAATCAAACATTGGTTGGGGT
GGGGTATTAGGAACGGTTGTAGAGTAAAATTCTGGACATATCAATGGGTTAATGGGAAGAATTTATTAGACAATATTCGAGATGGTGAAATTTCAGAGGGGGATAGGTGT
CGCCCAATTAGGGACTATATTTCAAGTGAAGGAGAATGGAAGTGGGAAGCTTTTGCGGGTAAGATCAATCATTCTTTGTTATTAACCATTGCTAGCATTAGACCGCCGCA
CATATCTTTAAATTCCCCAGATTTCCCTATTTGGATTGGTTCTAGGAATGAGGAGTATTCTGTTTTGGCTACTTGTGCTAATCTCCTTGCTTTAAGCTCTCCCTCTCAAC
AGTCTAAGATTTGGAAAGCTATTTGGAGTGATGGCTTCTCTATCAAGAAAGCCACTTCGTCGTTGGGTGGAGATCAGGTGGATTCCTCCACTAGTGAATTACTGAAATCT
CAATACGAATGGATCGGCTCATGGGAACCCAGGAATGGCTGGAGTAGGAGGCATTGTCAGGAATGCGGATGGGAATCGGATAAAAGGCTTTCAATAGAATCTGGGCTGCT
CGACCACAATCGTATTAGACAGGGAGTGGACGAGCCTCCTAGAGGCCTTCTGCGTTTGTTAATGGCGGATATTGTAGGAGTCTCCATGTCCCGTTTCATTACCTTTTAG
Protein sequenceShow/hide protein sequence
MDGELDPGKSSTNQEESQGNPPWMDGELDPDKSSSNKEESQAKVFENLFPRRTRVICFDSIIPTWGPSSSISQLRSSTEARDGRDYMEQDVAIGLEQADGAISVSMEEGG
EFSKGKISKGSFSGLFTVVYGSPQRGSRGWLNMECLPNCLANIQSELVKSNKEVFRSSAGRKRSILARLAGVVRKKLDQLCRNFLLGHTSSRARIHLISWDKITRPKREG
GVGLHIFQDFNIALIGKIAWGLLTRPNDLWAQFLINKYKGGWEGDWILKVNEGDSWLWVSIMKVWNEIKHWLGWGIRNGCRVKFWTYQWVNGKNLLDNIRDGEISEGDRC
RPIRDYISSEGEWKWEAFAGKINHSLLLTIASIRPPHISLNSPDFPIWIGSRNEEYSVLATCANLLALSSPSQQSKIWKAIWSDGFSIKKATSSLGGDQVDSSTSELLKS
QYEWIGSWEPRNGWSRRHCQECGWESDKRLSIESGLLDHNRIRQGVDEPPRGLLRLLMADIVGVSMSRFITF