; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039545 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039545
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptiontRNA-specific adenosine deaminase TAD3
Genome locationchr2:46025362..46030922
RNA-Seq ExpressionLag0039545
SyntenyLag0039545
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002125 - Cytidine and deoxycytidylate deaminase domain
IPR002156 - Ribonuclease H domain
IPR016193 - Cytidine deaminase-like
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN83063.1 hypothetical protein VITISV_010308 [Vitis vinifera]4.4e-4639.69Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD   W L  SG FS K  F  L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

RVW31800.1 putative ribonuclease H protein [Vitis vinifera]2.4e-4728.62Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG+      N ALL KW+WR++   + LW++V+ + YG ++ G  S  P G      W +IMK   W  + +  +V KG+K+ FWTD W     L+ S
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQCW--NSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLGDGSDKSLWCLEKSGIFSSKSAFLKLA-----------------
        F  LF +++ +NATVE+ W  NSD   W+L F R   DWE+     L+ ++  +R     D   W   KSG +  K A+  L                  
Subjt:  FSDLFNISMKKNATVEQCW--NSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLGDGSDKSLWCLEKSGIFSSKSAFLKLA-----------------

Query:  ---------------------------EVSPTLKAPLIKYIWKNCAPRKVKVF----------------------LWALAH----------------RSL
                                    V+P L    I  I +    R ++                        LWAL +                  +
Subjt:  ---------------------------EVSPTLKAPLIKYIWKNCAPRKVKVF----------------------LWALAH----------------RSL

Query:  NTLDLLQKKGNNRVLS----------PLVCSLC-----WGNAESLDHIFLHC------------------------------------PFACA-AWRWC-
         +   L K  + +V++           ++ S C     W        +   C                                      +C   WRW  
Subjt:  NTLDLLQKKGNNRVLS----------PLVCSLC-----WGNAESLDHIFLHC------------------------------------PFACA-AWRWC-

Query:  --------WGNLTSMFLFL--------RRLISG---------WMKYFWAGILEAKSKE-----------EKVGTDAEGIYSKSGRPYLCTGYDIYLVWEP
                W  L    +          RRL  G          + +  +  + +++K            EK+     G +S + RPYLCTGYDIYLVWEP
Subjt:  --------WGNLTSMFLFL--------RRLISG---------WMKYFWAGILEAKSKE-----------EKVGTDAEGIYSKSGRPYLCTGYDIYLVWEP

Query:  CIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAV--FRVL
        C MCAMALVHQR+RR+FYAFPNPN GALGSVHRLQ EKSLNHHYAV  FR L
Subjt:  CIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAV--FRVL

RVW35046.1 putative ribonuclease H protein [Vitis vinifera]2.4e-4740.08Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD  +W L  SG FS KS F  L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

RVX04562.1 putative ribonuclease H protein [Vitis vinifera]3.4e-4640.08Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+L FRR L D EI     L+  +  V L    SD   W L  SG FS KS F  L++ S  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

RVX06748.1 putative ribonuclease H protein [Vitis vinifera]2.6e-4639.31Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ LL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD  +W L  S  FS KS    L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
         AP KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

TrEMBL top hitse value%identityAlignment
A0A438DI60 Putative ribonuclease H protein1.1e-4740.08Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD  +W L  SG FS KS F  L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

A0A438J6H5 Putative ribonuclease H protein1.6e-4640.08Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+L FRR L D EI     L+  +  V L    SD   W L  SG FS KS F  L++ S  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

A0A438JCR4 Putative ribonuclease H protein1.3e-4639.31Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ LL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD  +W L  S  FS KS    L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
         AP KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

A0A438JRF4 LINE-1 retrotransposable element ORF2 protein2.8e-4639.31Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ +L S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLGDG-SDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+L FRR L D EI     L+  +  VR     +D   W L  SG+F+ KS FL L++VS  +     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLGDG-SDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH+FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

A5BBD6 zf-RVT domain-containing protein2.1e-4639.69Show/hide
Query:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS
        GGLG G  + +N+ALL KW+WRF +    LW KV+ SIYG    G  +      +   PW  I +    F+ F+R  V  G+++ FW D W G+ TL S 
Subjt:  GGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASS

Query:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN
        F+DL+ +   KN TV     NS  + W+  FRR L D EI     L+  ++ V L    SD   W L  SG FS K  F  L++VS  L     K++W +
Subjt:  FSDLFNISMKKNATVEQ-CWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLG-DGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKN

Query:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW
          P KVK   W +AH  +NT D LQ +   + L P  C LC GN ES+DH FLHCP     W
Subjt:  CAPRKVKVFLWALAHRSLNTLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAW

SwissProt top hitse value%identityAlignment
F4KH86 tRNA-specific adenosine deaminase TAD37.4e-2880Show/hide
Query:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED
        RPYLCTGYDI+L+ EPC MCAMALVHQR++R+FYAFPN   G LGSVHRLQ EKSLNHHYAVFRVLL +D
Subjt:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED

Q561R2 Probable inactive tRNA-specific adenosine deaminase-like protein 35.9e-1757.35Show/hide
Query:  PYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE
        PY+CTGYD+Y+  EPC+MCAMALVH R++RVFY  P+P DGALG+  R+ +   LNH + VFR +L +
Subjt:  PYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE

Q6PAT0 Probable inactive tRNA-specific adenosine deaminase-like protein 32.0e-1757.35Show/hide
Query:  PYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE
        PY+CTGYD+Y+  EPC+MCAMALVH R++RVFY  P+P DGALG++ R+ +   LNH + VFR +L +
Subjt:  PYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE

Q8JFW4 Probable inactive tRNA-specific adenosine deaminase-like protein 38.5e-1653.73Show/hide
Query:  KSGRPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFR
        ++G PY+CTGYD+Y+  EPC+MCAMALVH R+ RVFY   +  DGA GS +++  +K LNH + VF+
Subjt:  KSGRPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFR

Q96EY9 Probable inactive tRNA-specific adenosine deaminase-like protein 31.5e-1758.33Show/hide
Query:  KSGRPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE
        + G PYLCTGYD+Y+  EPC MCAMALVH R+ RVFY  P+P DGALG+  R+ +   LNH + VFR +L E
Subjt:  KSGRPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHE

Arabidopsis top hitse value%identityAlignment
AT5G24670.1 Cytidine/deoxycytidylate deaminase family protein5.2e-2980Show/hide
Query:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED
        RPYLCTGYDI+L+ EPC MCAMALVHQR++R+FYAFPN   G LGSVHRLQ EKSLNHHYAVFRVLL +D
Subjt:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED

AT5G24670.2 Cytidine/deoxycytidylate deaminase family protein5.2e-2980Show/hide
Query:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED
        RPYLCTGYDI+L+ EPC MCAMALVHQR++R+FYAFPN   G LGSVHRLQ EKSLNHHYAVFRVLL +D
Subjt:  RPYLCTGYDIYLVWEPCIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTGGAGGCCTCGGTATTGGAGCTTACAATCAGAAAAATAACGCCTTATTGTTAAAATGGATGTGGCGTTTCAGTCAGCTTGAAGATGCTCTTTGGAGGAAAGT
CGTGGTGAGTATCTATGGCATAGATCAGAGAGGGTTGAGCTCCCTTCCCCCGAAAGGTAAAGCTAGAGGCAGACCCTGGTTTGATATCATGAAAACGGGCATATGGTTCA
ATAATTTCATCCGTTTTAAGGTTAATAAGGGTGATAAGGTTCTTTTTTGGACTGATCGATGGATTGGGCATGACACGCTAGCCTCTTCCTTCTCAGATTTGTTTAACATC
TCTATGAAGAAAAATGCCACGGTGGAACAATGTTGGAACTCAGACTCGATGGACTGGGACTTGGGATTCAGAAGAGGCCTTTTTGATTGGGAGATCACCAGCTGGATTGG
GCTGATTGAGAGGATTGATGTTGTTCGCCTAGGGGATGGTAGTGATAAATCTCTATGGTGCCTTGAAAAATCCGGAATTTTTTCTTCAAAATCTGCTTTCTTAAAGCTGG
CAGAAGTGAGTCCTACCCTCAAAGCCCCCTTGATCAAGTACATTTGGAAGAATTGTGCTCCTAGGAAAGTCAAGGTCTTCCTTTGGGCCCTTGCCCACAGAAGCCTCAAT
ACCCTTGACCTTCTTCAGAAGAAAGGCAACAATCGAGTGTTATCCCCGTTGGTCTGTTCTTTGTGTTGGGGGAATGCTGAGTCTCTAGACCATATCTTTCTCCACTGCCC
TTTTGCTTGTGCGGCTTGGAGGTGGTGTTGGGGGAATTTAACATCCATGTTTCTTTTCCTGAGAAGATTGATCAGTGGATGGATGAAGTATTTTTGGGCTGGAATTTTAG
AGGCCAAAAGCAAGGAGGAGAAGGTAGGTACTGATGCTGAAGGCATATATTCTAAATCAGGCAGACCTTATCTGTGCACTGGTTATGACATTTATCTTGTGTGGGAGCCA
TGTATTATGTGTGCAATGGCGCTCGTTCACCAACGAGTTCGGCGTGTGTTTTATGCGTTCCCGAATCCTAACGACGGGGCACTGGGCAGTGTTCACAGGCTACAGAGTGA
AAAGAGCTTAAACCATCACTATGCTGTTTTTAGAGTTTTGTTACATGAAGATATCTGGACTCATAGGAACATGGTGGTTCAAAACAAAAACAATCTGGATATGCAGATGT
TGTCCACCAAAATTCAACAATATATGGCAGAATTCCTACATCAAGAAGTGTTCGGGGATGAATCGGGTACCTGGGCAACTGGAAATGCGTTCAAACCGAGTTGGCAAGAA
GAGAAGATGCGCAACAGGAGCACGCGTCTGTGTGGATTCCCCCACCGATTGGACTCTTCAAACTCAACTGTGATGCAACTTGGAGCATGCGTCTTCGCTGAGGAGGGGTT
GCTTGGATCCTTAGAGACTCGAGCGGGCGCCCTTTGCTTGGAGGGCCTGCGAGATATCCCCTCTGATTTTCCTAGTTTCCAGCTTGAGTCGGATGCTCTTCAGGTGGTGA
AATTGCTGAATAACGAGTGGAGGGATGACACTGAATTAGGGGGATTCATAATGGAAGCACAAGCACTAATAAGTTCCCTTAAAGTTGACTCTATAAGGCATGCAACAAGA
CTTCATAATGGGATGGCCCATCGTATGGCCCATAAGACATGTGAGCTAAATGTTTCTAGTAATTGGGCTGCTGATTTTCCTACATGGTTGTTAGATTTTAATGCTTGTGA
TATTATGAGAGATTATCATATTTGTGGGGGTCTCTGTCCCACAGGTGATTTTGGACCACTCTGGAATTCGAGGACTAAGCAGGACAATGCGACACAACCCGGAAGCATGA
CCAGGAAACGGATCCTAGAGGAGACTCGACCTCAAGCCGAGGCCGACTATTCGGCCTACTCGTGCAGGAAAAAGTATTTAAACCCTTCTTTGTCACTGAAGAAGGGATCC
CGAACTCTGTTATCTGATTCTCCTCTTACTCTTGCTCTTTGGCTTCCCATCGTTCTGTTTGCTGACTTAAGCATCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGTAG
GTTTTACTGTCTTGCAGTTGGCACCATCTGTGGGGAAGAAAGCTTGCTAGTTAAATCTGTGTTTCGGTCATTCAATGAGAATAGGATGGAAAAGGGAAACCACAATCCAA
ACACAGAAACTTTAGAAAATAATCGTCAGGCACAGAGGTCGCGAGAAGATGGTAACACACAGAGGTCATTGAGACAAACAGGCCAGGAAGCCGAGGTCGAGGAGAAGAAG
CTGACGCCAAAATTTCCGCCCTTGAGGATAAGGATCGAAGGGAAAGAGACCAACAACGCGACCAACAAGGTCGGGGGGCTGAAGCATGCCGAACGCACAGTTCTGAGGAG
TCTAGAATCAATTACCAGCCGTAGAACAGACTTAAGGAACCTGATTGAAGAGAAGCGCAAATGGCCAAATTGCCTAGGCCAAGGCCAAGGCTGCGAGGCTGAGGCCAGAG
CTGTCGAGGCCGAGGCTAGGGCAGCCGAGACCGAGACCGAGGCCAAGAAGGACAATCTCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTGGAGGCCTCGGTATTGGAGCTTACAATCAGAAAAATAACGCCTTATTGTTAAAATGGATGTGGCGTTTCAGTCAGCTTGAAGATGCTCTTTGGAGGAAAGT
CGTGGTGAGTATCTATGGCATAGATCAGAGAGGGTTGAGCTCCCTTCCCCCGAAAGGTAAAGCTAGAGGCAGACCCTGGTTTGATATCATGAAAACGGGCATATGGTTCA
ATAATTTCATCCGTTTTAAGGTTAATAAGGGTGATAAGGTTCTTTTTTGGACTGATCGATGGATTGGGCATGACACGCTAGCCTCTTCCTTCTCAGATTTGTTTAACATC
TCTATGAAGAAAAATGCCACGGTGGAACAATGTTGGAACTCAGACTCGATGGACTGGGACTTGGGATTCAGAAGAGGCCTTTTTGATTGGGAGATCACCAGCTGGATTGG
GCTGATTGAGAGGATTGATGTTGTTCGCCTAGGGGATGGTAGTGATAAATCTCTATGGTGCCTTGAAAAATCCGGAATTTTTTCTTCAAAATCTGCTTTCTTAAAGCTGG
CAGAAGTGAGTCCTACCCTCAAAGCCCCCTTGATCAAGTACATTTGGAAGAATTGTGCTCCTAGGAAAGTCAAGGTCTTCCTTTGGGCCCTTGCCCACAGAAGCCTCAAT
ACCCTTGACCTTCTTCAGAAGAAAGGCAACAATCGAGTGTTATCCCCGTTGGTCTGTTCTTTGTGTTGGGGGAATGCTGAGTCTCTAGACCATATCTTTCTCCACTGCCC
TTTTGCTTGTGCGGCTTGGAGGTGGTGTTGGGGGAATTTAACATCCATGTTTCTTTTCCTGAGAAGATTGATCAGTGGATGGATGAAGTATTTTTGGGCTGGAATTTTAG
AGGCCAAAAGCAAGGAGGAGAAGGTAGGTACTGATGCTGAAGGCATATATTCTAAATCAGGCAGACCTTATCTGTGCACTGGTTATGACATTTATCTTGTGTGGGAGCCA
TGTATTATGTGTGCAATGGCGCTCGTTCACCAACGAGTTCGGCGTGTGTTTTATGCGTTCCCGAATCCTAACGACGGGGCACTGGGCAGTGTTCACAGGCTACAGAGTGA
AAAGAGCTTAAACCATCACTATGCTGTTTTTAGAGTTTTGTTACATGAAGATATCTGGACTCATAGGAACATGGTGGTTCAAAACAAAAACAATCTGGATATGCAGATGT
TGTCCACCAAAATTCAACAATATATGGCAGAATTCCTACATCAAGAAGTGTTCGGGGATGAATCGGGTACCTGGGCAACTGGAAATGCGTTCAAACCGAGTTGGCAAGAA
GAGAAGATGCGCAACAGGAGCACGCGTCTGTGTGGATTCCCCCACCGATTGGACTCTTCAAACTCAACTGTGATGCAACTTGGAGCATGCGTCTTCGCTGAGGAGGGGTT
GCTTGGATCCTTAGAGACTCGAGCGGGCGCCCTTTGCTTGGAGGGCCTGCGAGATATCCCCTCTGATTTTCCTAGTTTCCAGCTTGAGTCGGATGCTCTTCAGGTGGTGA
AATTGCTGAATAACGAGTGGAGGGATGACACTGAATTAGGGGGATTCATAATGGAAGCACAAGCACTAATAAGTTCCCTTAAAGTTGACTCTATAAGGCATGCAACAAGA
CTTCATAATGGGATGGCCCATCGTATGGCCCATAAGACATGTGAGCTAAATGTTTCTAGTAATTGGGCTGCTGATTTTCCTACATGGTTGTTAGATTTTAATGCTTGTGA
TATTATGAGAGATTATCATATTTGTGGGGGTCTCTGTCCCACAGGTGATTTTGGACCACTCTGGAATTCGAGGACTAAGCAGGACAATGCGACACAACCCGGAAGCATGA
CCAGGAAACGGATCCTAGAGGAGACTCGACCTCAAGCCGAGGCCGACTATTCGGCCTACTCGTGCAGGAAAAAGTATTTAAACCCTTCTTTGTCACTGAAGAAGGGATCC
CGAACTCTGTTATCTGATTCTCCTCTTACTCTTGCTCTTTGGCTTCCCATCGTTCTGTTTGCTGACTTAAGCATCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGTAG
GTTTTACTGTCTTGCAGTTGGCACCATCTGTGGGGAAGAAAGCTTGCTAGTTAAATCTGTGTTTCGGTCATTCAATGAGAATAGGATGGAAAAGGGAAACCACAATCCAA
ACACAGAAACTTTAGAAAATAATCGTCAGGCACAGAGGTCGCGAGAAGATGGTAACACACAGAGGTCATTGAGACAAACAGGCCAGGAAGCCGAGGTCGAGGAGAAGAAG
CTGACGCCAAAATTTCCGCCCTTGAGGATAAGGATCGAAGGGAAAGAGACCAACAACGCGACCAACAAGGTCGGGGGGCTGAAGCATGCCGAACGCACAGTTCTGAGGAG
TCTAGAATCAATTACCAGCCGTAGAACAGACTTAAGGAACCTGATTGAAGAGAAGCGCAAATGGCCAAATTGCCTAGGCCAAGGCCAAGGCTGCGAGGCTGAGGCCAGAG
CTGTCGAGGCCGAGGCTAGGGCAGCCGAGACCGAGACCGAGGCCAAGAAGGACAATCTCCCTTGA
Protein sequenceShow/hide protein sequence
MNFGGLGIGAYNQKNNALLLKWMWRFSQLEDALWRKVVVSIYGIDQRGLSSLPPKGKARGRPWFDIMKTGIWFNNFIRFKVNKGDKVLFWTDRWIGHDTLASSFSDLFNI
SMKKNATVEQCWNSDSMDWDLGFRRGLFDWEITSWIGLIERIDVVRLGDGSDKSLWCLEKSGIFSSKSAFLKLAEVSPTLKAPLIKYIWKNCAPRKVKVFLWALAHRSLN
TLDLLQKKGNNRVLSPLVCSLCWGNAESLDHIFLHCPFACAAWRWCWGNLTSMFLFLRRLISGWMKYFWAGILEAKSKEEKVGTDAEGIYSKSGRPYLCTGYDIYLVWEP
CIMCAMALVHQRVRRVFYAFPNPNDGALGSVHRLQSEKSLNHHYAVFRVLLHEDIWTHRNMVVQNKNNLDMQMLSTKIQQYMAEFLHQEVFGDESGTWATGNAFKPSWQE
EKMRNRSTRLCGFPHRLDSSNSTVMQLGACVFAEEGLLGSLETRAGALCLEGLRDIPSDFPSFQLESDALQVVKLLNNEWRDDTELGGFIMEAQALISSLKVDSIRHATR
LHNGMAHRMAHKTCELNVSSNWAADFPTWLLDFNACDIMRDYHICGGLCPTGDFGPLWNSRTKQDNATQPGSMTRKRILEETRPQAEADYSAYSCRKKYLNPSLSLKKGS
RTLLSDSPLTLALWLPIVLFADLSIGGGVASTTPVCRFYCLAVGTICGEESLLVKSVFRSFNENRMEKGNHNPNTETLENNRQAQRSREDGNTQRSLRQTGQEAEVEEKK
LTPKFPPLRIRIEGKETNNATNKVGGLKHAERTVLRSLESITSRRTDLRNLIEEKRKWPNCLGQGQGCEAEARAVEAEARAAETETEAKKDNLP