; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005385 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005385
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr6:15957177..15964925
RNA-Seq ExpressionLag0005385
SyntenyLag0005385
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]9.3e-1222.33Show/hide
Query:  IPKASIF----PNGLNHLS-VADFITPSLHWDVGKLDQFLGRKDVDEILTLPI-SGTTPDRWILHYDKRGEYTVKSGYKLG-------------------
        IP+ + F    P  L H + VAD I     W V +L+Q   ++D++ IL + + SG   D  + H+DK+GEY+VKSGY+L                    
Subjt:  IPKASIF----PNGLNHLS-VADFITPSLHWDVGKLDQFLGRKDVDEILTLPI-SGTTPDRWILHYDKRGEYTVKSGYKLG-------------------

Query:  ---------------------PLGPTGSSIGALRGQNSP-------------------KSLRNFGSHKPLNTYPKEN------TGVTPWVSKNDGPRVVL
                              + PT  ++   R    P                   K+ R      PL   P ++      + +    S++      L
Subjt:  ---------------------PLGPTGSSIGALRGQNSP-------------------KSLRNFGSHKPLNTYPKEN------TGVTPWVSKNDGPRVVL

Query:  MFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVPCGPRVCTRLEEGILFQ-----SRGSLIMNVDAAFDTK-------------
        M          W IW+ RN F+     +D           L  Y + + P    V    + GI  Q     S+  L +NVDAA  TK             
Subjt:  MFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVPCGPRVCTRLEEGILFQ-----SRGSLIMNVDAAFDTK-------------

Query:  ----------------------AIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQL
                              A A+  G+ +A  ++ S +    D    V  L       + I  +  D+ R    F+++ FS + R  NT AH LA+ 
Subjt:  ----------------------AIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQL

Query:  GVK-SRTHVWTSQFPEWISSL
         ++ S T VW   FP  + ++
Subjt:  GVK-SRTHVWTSQFPEWISSL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.4e-2024.88Show/hide
Query:  FPNGLNHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL----------GPLGPTGSSIGALRGQNSPKS
        F NG    +VA FIT   +WDV  +      +D D IL++PIS     D W+ HYDKRG Y+V+SGYKL                G+   ++     P  
Subjt:  FPNGLNHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL----------GPLGPTGSSIGALRGQNSPKS

Query:  LRNF---GSHKPLNTYPK---ENTGVTP-------------------------W---------VSKNDGPRVVLMFT------RPTYMGGA----WAIWN
        ++ F    +H+ + T         G  P                         W         +S  D    + +++       P  +  A    W IWN
Subjt:  LRNF---GSHKPLNTYPK---ENTGVTP-------------------------W---------VSKNDGPRVVLMFT------RPTYMGGA----WAIWN

Query:  DRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVP-CGPRVCTRLEEGILF---QSRGSLIMNVDAA----------------------------FDTKAI
        DRN+ +H + ++ V  +C W+  +L  + +A +    PR  +     + +    S  SL +N DAA                            F    +
Subjt:  DRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVP-CGPRVCTRLEEGILF---QSRGSLIMNVDAA----------------------------FDTKAI

Query:  -----ALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGV--KSRTHVWTSQFPEWI
              +LEG+  A   N + +    DSL  ++ ++ E+           +I  L   F  I+FSH  RQ N  AH LA+ G+   S T+ W   FP W+
Subjt:  -----ALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGV--KSRTHVWTSQFPEWI

Query:  SSLAQPLSPS
          L Q   PS
Subjt:  SSLAQPLSPS

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]5.6e-1725.38Show/hide
Query:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG----PLGPTGSSIG------ALRGQNSPKSLRNFG----
        VAD+IT +  WD+  L       D+D ILT+P+S  +T DRW  HYD  G+YTVKSGY L         + SS           G N P  +R FG    
Subjt:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG----PLGPTGSSIG------ALRGQNSPKSLRNFG----

Query:  ----------SHKPLNTYPKENTGVTPWVSKNDG-------------PRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP
                   H+ + T    +     W S                     L FT+ ++M                         W IW+DRNN++H + 
Subjt:  ----------SHKPLNTYPKENTGVTPWVSKNDG-------------PRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP

Query:  IADVAIRCTWIQDYLIDYGKANVPCGPRV-CTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL
        +       +  + YL ++        P V C   +   +     +  +L MNVDAA D+                                   +A A+ 
Subjt:  IADVAIRCTWIQDYLIDYGKANVPCGPRV-CTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL

Query:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL
         G+  A  L         D L  V  LQG+ S  SS   +  DI     SF     SHVRR  N  AH LA+  ++     +W  + P  I S+
Subjt:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL

XP_030503504.1 uncharacterized protein LOC115718825 [Cannabis sativa]8.4e-1324.33Show/hide
Query:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG-PLGPTGSSIGALRGQNSPKSLRNFGSHKPLNTYPKENT
        VAD+ITP+  WDV +L       DVD IL +P+S     D +I HY   G YTV+SGY L   L     + G+    +  K  + F S +  +  P    
Subjt:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG-PLGPTGSSIGALRGQNSPKSLRNFGSHKPLNTYPKENT

Query:  GVTPWVSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKAN------------------VPCGPRVC----------
                     VV +  R   +   +++ N  N  VH++P    A        YL  + +A+                  +P  P +C          
Subjt:  GVTPWVSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKAN------------------VPCGPRVC----------

Query:  --TRLEEGILFQSRGSLIMNV----------DAAFDTKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFS
           +L  G + +    +++                + KA+      +L +NL    + T  DSL  V+ +    SC  +   +  D+  L  +F  +  S
Subjt:  --TRLEEGILFQSRGSLIMNV----------DAAFDTKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFS

Query:  HVRRQLNTIAHELA-QLGVKSRTHVWTSQFPEWISSL
        HVRR  N  AH LA Q        +W  + P  I S+
Subjt:  HVRRQLNTIAHELA-QLGVKSRTHVWTSQFPEWISSL

XP_040367476.1 uncharacterized protein LOC112178299 [Rosa chinensis]4.2e-1224.92Show/hide
Query:  IPKASIFPNGLNHLS----VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLGPLGPTGSSIGALRGQNSPKS
        IP+ S F   + H      V+D + P   W++  ++Q+    DVD IL++P+S    PDR   HYDK+G ++ KS Y L        + G +    S + 
Subjt:  IPKASIFPNGLNHLS----VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLGPLGPTGSSIGALRGQNSPKS

Query:  LRNFGSHKPLNTYPKENTGVTPW-VSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDY----GKANVPCGPRVCTRLEE
        L +F  H      P +   V  W V  +  P   L+ T+   +       ND     H+  I  +   C ++QD L  +    G   + C       +  
Subjt:  LRNFGSHKPLNTYPKENTGVTPW-VSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDY----GKANVPCGPRVCTRLEE

Query:  GILFQSRGSLIMNVDAAFD---TKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELA
         ++    G + + V++       +A+A     +LA+  + S +    D  S V T++ E    SS+  +  DI+         +  HV R+ N  AH LA
Subjt:  GILFQSRGSLIMNVDAAFD---TKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELA

Query:  QLGVKSRTHV-WTSQFP
        +L + S  ++ W+   P
Subjt:  QLGVKSRTHV-WTSQFP

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.2e-2024.88Show/hide
Query:  FPNGLNHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL----------GPLGPTGSSIGALRGQNSPKS
        F NG    +VA FIT   +WDV  +      +D D IL++PIS     D W+ HYDKRG Y+V+SGYKL                G+   ++     P  
Subjt:  FPNGLNHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL----------GPLGPTGSSIGALRGQNSPKS

Query:  LRNF---GSHKPLNTYPK---ENTGVTP-------------------------W---------VSKNDGPRVVLMFT------RPTYMGGA----WAIWN
        ++ F    +H+ + T         G  P                         W         +S  D    + +++       P  +  A    W IWN
Subjt:  LRNF---GSHKPLNTYPK---ENTGVTP-------------------------W---------VSKNDGPRVVLMFT------RPTYMGGA----WAIWN

Query:  DRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVP-CGPRVCTRLEEGILF---QSRGSLIMNVDAA----------------------------FDTKAI
        DRN+ +H + ++ V  +C W+  +L  + +A +    PR  +     + +    S  SL +N DAA                            F    +
Subjt:  DRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVP-CGPRVCTRLEEGILF---QSRGSLIMNVDAA----------------------------FDTKAI

Query:  -----ALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGV--KSRTHVWTSQFPEWI
              +LEG+  A   N + +    DSL  ++ ++ E+           +I  L   F  I+FSH  RQ N  AH LA+ G+   S T+ W   FP W+
Subjt:  -----ALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGV--KSRTHVWTSQFPEWI

Query:  SSLAQPLSPS
          L Q   PS
Subjt:  SSLAQPLSPS

A0A803NG99 Uncharacterized protein4.1e-1324.33Show/hide
Query:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG-PLGPTGSSIGALRGQNSPKSLRNFGSHKPLNTYPKENT
        VAD+ITP+  WDV +L       DVD IL +P+S     D +I HY   G YTV+SGY L   L     + G+    +  K  + F S +  +  P    
Subjt:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG-PLGPTGSSIGALRGQNSPKSLRNFGSHKPLNTYPKENT

Query:  GVTPWVSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKAN------------------VPCGPRVC----------
                     VV +  R   +   +++ N  N  VH++P    A        YL  + +A+                  +P  P +C          
Subjt:  GVTPWVSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKAN------------------VPCGPRVC----------

Query:  --TRLEEGILFQSRGSLIMNV----------DAAFDTKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFS
           +L  G + +    +++                + KA+      +L +NL    + T  DSL  V+ +    SC  +   +  D+  L  +F  +  S
Subjt:  --TRLEEGILFQSRGSLIMNV----------DAAFDTKAIALLEGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFS

Query:  HVRRQLNTIAHELA-QLGVKSRTHVWTSQFPEWISSL
        HVRR  N  AH LA Q        +W  + P  I S+
Subjt:  HVRRQLNTIAHELA-QLGVKSRTHVWTSQFPEWISSL

A0A803PIB6 Uncharacterized protein2.7e-1725.38Show/hide
Query:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG----PLGPTGSSIG------ALRGQNSPKSLRNFG----
        VAD+IT +  WD+  L       D+D ILT+P+S  +T DRW  HYD  G+YTVKSGY L         + SS           G N P  +R FG    
Subjt:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG----PLGPTGSSIG------ALRGQNSPKSLRNFG----

Query:  ----------SHKPLNTYPKENTGVTPWVSKNDG-------------PRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP
                   H+ + T    +     W S                     L FT+ ++M                         W IW+DRNN++H + 
Subjt:  ----------SHKPLNTYPKENTGVTPWVSKNDG-------------PRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP

Query:  IADVAIRCTWIQDYLIDYGKANVPCGPRV-CTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL
        +       +  + YL ++        P V C   +   +     +  +L MNVDAA D+                                   +A A+ 
Subjt:  IADVAIRCTWIQDYLIDYGKANVPCGPRV-CTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL

Query:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL
         G+  A  L         D L  V  LQG+ S  SS   +  DI     SF     SHVRR  N  AH LA+  ++     +W  + P  I S+
Subjt:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL

A0A803PKJ2 Uncharacterized protein1.5e-1224.87Show/hide
Query:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG---PLGPTGSSIGALR-------GQNSPKSLRNFG----
        VA +IT +  W+   L +     DV++ILT+P+S  +  D WI HYD  GEYTVKSGY L          SS G+         G   P  +R FG    
Subjt:  VADFITPSLHWDVGKLDQFLGRKDVDEILTLPIS-GTTPDRWILHYDKRGEYTVKSGYKLG---PLGPTGSSIGALR-------GQNSPKSLRNFG----

Query:  -SHKPLNT---YPKENTGVT------PWVSKN-------------DGPRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP
         S  P+ T   + K  T  T       W S                     L F + +YM                         W IW+DRNNF+H + 
Subjt:  -SHKPLNT---YPKENTGVT------PWVSKN-------------DGPRVVLMFTRPTYMGGA----------------------WAIWNDRNNFVHNRP

Query:  IADVAIRCTWIQDYLIDYGKANVPCGP-RVCTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL
        +       +  + YL ++    +   P   C   +   +         L MNVDAA D+                                   +A A+ 
Subjt:  IADVAIRCTWIQDYLIDYGKANVPCGP-RVCTRLEEGILF---QSRGSLIMNVDAAFDT-----------------------------------KAIALL

Query:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL
         G+  A +L         D +  V  + G  S  SS   +  DI     S      SHVRR  N  AH+LA+  ++     +W  + P  I S+
Subjt:  EGMALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL

A0A803QGT2 Uncharacterized protein5.9e-1222.86Show/hide
Query:  NHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL-GPLGPTGSSIGALRGQ---------NSPKSLRNFG
        NH  VAD+IT +  W++  L       DVD IL +P+S     DRWI HY+  G+Y+V SGY L   LG    S  +   +         N P  ++ FG
Subjt:  NHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISG-TTPDRWILHYDKRGEYTVKSGYKL-GPLGPTGSSIGALRGQ---------NSPKSLRNFG

Query:  --------------SHKPLNTYPKENTGVTPWVS---------------KNDGPRV------------VLMFTRPTYMGGA--------WAIWNDRNNFV
                       H+ + T    +   + W S               K  G  +             LM     Y   A        W IW+DRNNF+
Subjt:  --------------SHKPLNTYPKENTGVTPWVS---------------KNDGPRV------------VLMFTRPTYMGGA--------WAIWNDRNNFV

Query:  HNRPIADVAIRCTWIQDYLIDYGKANVPCGPRVCTRLEEGILFQ----SRGSLIMNVDAAFDTKAIAL-----------------------------LEG
        H + +       T    Y+  Y        P    R  +  +         +  +NVDAA D+    +                             +E 
Subjt:  HNRPIADVAIRCTWIQDYLIDYGKANVPCGPRVCTRLEEGILFQ----SRGSLIMNVDAAFDTKAIAL-----------------------------LEG

Query:  MALAINLNKSRMTTF------FDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL
         A+++ L+ ++           D L  V  L G++S  S    +  D+     +F     SH+RR  N  AH LA+  ++     +W    P  I S+
Subjt:  MALAINLNKSRMTTF------FDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVK-SRTHVWTSQFPEWISSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCCAAAGCTTCCATTTTTCCAAATGGGTTAAATCATTTATCAGTCGCTGACTTCATTACTCCTTCTCTCCATTGGGATGTCGGAAAGCTAGACCAATTCCTGGG
AAGGAAGGATGTGGATGAAATTTTGACATTACCTATTAGTGGAACGACGCCTGATCGTTGGATTTTGCACTATGATAAGAGGGGAGAGTACACGGTCAAGAGCGGTTATA
AGTTGGGTCCATTAGGTCCCACCGGTAGCTCCATTGGGGCGTTGAGAGGCCAAAATTCTCCAAAATCCCTTAGAAATTTTGGTTCCCACAAGCCGCTCAACACTTATCCT
AAGGAGAATACCGGTGTAACTCCGTGGGTGTCCAAAAACGACGGACCACGTGTTGTTCTTATGTTCACGAGACCAACGTATATGGGAGGTGCTTGGGCCATCTGGAATGA
CCGCAACAATTTCGTCCATAATCGCCCTATTGCGGACGTTGCTATCAGATGTACTTGGATCCAAGATTACTTGATTGATTATGGTAAGGCCAATGTTCCATGTGGTCCTA
GAGTTTGTACACGATTGGAGGAAGGCATCTTGTTCCAATCCAGGGGAAGTCTTATTATGAACGTCGATGCAGCGTTCGACACAAAGGCTATTGCATTATTGGAAGGTATG
GCGTTGGCCATAAACTTGAATAAGTCAAGAATGACGACCTTTTTCGACTCATTATCTTTTGTTCGTACTCTTCAAGGCGAAATGTCGTGTGATTCTAGTATATCTACTGT
AAAAGGGGATATCGACCGTCTTAAGATGTCCTTTCAGAGGATCACTTTCTCCCATGTCAGACGGCAGCTGAATACAATAGCGCATGAATTAGCTCAACTAGGAGTTAAGT
CTAGGACACATGTGTGGACCTCTCAGTTTCCTGAGTGGATTTCATCTTTAGCTCAGCCCCTCTCTCCCTCTGTAACAGACCGTCGCCTTCCTTCGCACCTCCACTGTCGT
TTCCCCGCGCCGCCTTCGTTAACCTCCCACGCCGCCGCCGGTGAGTCTCTCTCTCTCTCTCCCTCTTATTTTATTCTCCTTCGGTCTCACGCTCTCTCTCTCTCTCTCTT
TTCCGTAGATCCCGCTATTGCTCGCCTTCGCCCCCTCTGCCTTCCACCGTCACCCGGAACGACACGTCGCCGTCGAGCCCAAGTTGCCGCCATCGATAGTTTTCATTCGA
CCTCTCTTAGATCTGTGCGCTGCCACAACCCCTCCGTTCATGTTATAGTCGGTGTTGTGAGGCGTTGTTGTCAGCCCCGTGCATCTCTCGCTAGTGTTAACGCAGCCGCT
GCCCAAAACTCTAGTGTAGAGAAGGATTGGGATTTTTGGTGGCTAGGAAAACTTGTGAGCCAAGCTAACTGTTGTGATACTGAATTATTAAGAGTCGGGCTCACTATTGA
TCAACTAGAACAGGAGTTCTACTCCAACATTGATGAAAATGAAGGATTCTTAGTTATTGTTCGTAGAGTTGCTCTCGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCCAAAGCTTCCATTTTTCCAAATGGGTTAAATCATTTATCAGTCGCTGACTTCATTACTCCTTCTCTCCATTGGGATGTCGGAAAGCTAGACCAATTCCTGGG
AAGGAAGGATGTGGATGAAATTTTGACATTACCTATTAGTGGAACGACGCCTGATCGTTGGATTTTGCACTATGATAAGAGGGGAGAGTACACGGTCAAGAGCGGTTATA
AGTTGGGTCCATTAGGTCCCACCGGTAGCTCCATTGGGGCGTTGAGAGGCCAAAATTCTCCAAAATCCCTTAGAAATTTTGGTTCCCACAAGCCGCTCAACACTTATCCT
AAGGAGAATACCGGTGTAACTCCGTGGGTGTCCAAAAACGACGGACCACGTGTTGTTCTTATGTTCACGAGACCAACGTATATGGGAGGTGCTTGGGCCATCTGGAATGA
CCGCAACAATTTCGTCCATAATCGCCCTATTGCGGACGTTGCTATCAGATGTACTTGGATCCAAGATTACTTGATTGATTATGGTAAGGCCAATGTTCCATGTGGTCCTA
GAGTTTGTACACGATTGGAGGAAGGCATCTTGTTCCAATCCAGGGGAAGTCTTATTATGAACGTCGATGCAGCGTTCGACACAAAGGCTATTGCATTATTGGAAGGTATG
GCGTTGGCCATAAACTTGAATAAGTCAAGAATGACGACCTTTTTCGACTCATTATCTTTTGTTCGTACTCTTCAAGGCGAAATGTCGTGTGATTCTAGTATATCTACTGT
AAAAGGGGATATCGACCGTCTTAAGATGTCCTTTCAGAGGATCACTTTCTCCCATGTCAGACGGCAGCTGAATACAATAGCGCATGAATTAGCTCAACTAGGAGTTAAGT
CTAGGACACATGTGTGGACCTCTCAGTTTCCTGAGTGGATTTCATCTTTAGCTCAGCCCCTCTCTCCCTCTGTAACAGACCGTCGCCTTCCTTCGCACCTCCACTGTCGT
TTCCCCGCGCCGCCTTCGTTAACCTCCCACGCCGCCGCCGGTGAGTCTCTCTCTCTCTCTCCCTCTTATTTTATTCTCCTTCGGTCTCACGCTCTCTCTCTCTCTCTCTT
TTCCGTAGATCCCGCTATTGCTCGCCTTCGCCCCCTCTGCCTTCCACCGTCACCCGGAACGACACGTCGCCGTCGAGCCCAAGTTGCCGCCATCGATAGTTTTCATTCGA
CCTCTCTTAGATCTGTGCGCTGCCACAACCCCTCCGTTCATGTTATAGTCGGTGTTGTGAGGCGTTGTTGTCAGCCCCGTGCATCTCTCGCTAGTGTTAACGCAGCCGCT
GCCCAAAACTCTAGTGTAGAGAAGGATTGGGATTTTTGGTGGCTAGGAAAACTTGTGAGCCAAGCTAACTGTTGTGATACTGAATTATTAAGAGTCGGGCTCACTATTGA
TCAACTAGAACAGGAGTTCTACTCCAACATTGATGAAAATGAAGGATTCTTAGTTATTGTTCGTAGAGTTGCTCTCGACTGA
Protein sequenceShow/hide protein sequence
MIPKASIFPNGLNHLSVADFITPSLHWDVGKLDQFLGRKDVDEILTLPISGTTPDRWILHYDKRGEYTVKSGYKLGPLGPTGSSIGALRGQNSPKSLRNFGSHKPLNTYP
KENTGVTPWVSKNDGPRVVLMFTRPTYMGGAWAIWNDRNNFVHNRPIADVAIRCTWIQDYLIDYGKANVPCGPRVCTRLEEGILFQSRGSLIMNVDAAFDTKAIALLEGM
ALAINLNKSRMTTFFDSLSFVRTLQGEMSCDSSISTVKGDIDRLKMSFQRITFSHVRRQLNTIAHELAQLGVKSRTHVWTSQFPEWISSLAQPLSPSVTDRRLPSHLHCR
FPAPPSLTSHAAAGESLSLSPSYFILLRSHALSLSLFSVDPAIARLRPLCLPPSPGTTRRRRAQVAAIDSFHSTSLRSVRCHNPSVHVIVGVVRRCCQPRASLASVNAAA
AQNSSVEKDWDFWWLGKLVSQANCCDTELLRVGLTIDQLEQEFYSNIDENEGFLVIVRRVALD