; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042156 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042156
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr13:37548193..37549150
RNA-Seq ExpressionLag0042156
SyntenyLag0042156
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.5e-2929.55Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNV-FLEVDFCGSFVDRWIK
        SAS ++ G  W ++WKL +P+KIK+F W++    +P   NL  RG+     C IC  + E+  H  F C RA+++W   F  +  L  +   SF++ W  
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNV-FLEVDFCGSFVDRWIK

Query:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAAC-------
        L   L  ++L L  +  W IW+D+N L+HG  +SP   K  W+  +LDS  +A    + +  + R +++       W P  S   KLN DAAC       
Subjt:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAAC-------

Query:  ---LQYASLSRLRVESI--------------------------------VELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNEL
           ++ +S S +   SI                                VE +SLLAI  I        D +  V  +  LT  F  I FS   R CN  
Subjt:  ---LQYASLSRLRVESI--------------------------------VELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNEL

Query:  ADLGGVWG
        A     WG
Subjt:  ADLGGVWG

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]2.6e-2434.85Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL
        +AS+S M   W   WK+KIP K+++F WK    +LP    L  R +  S  C IC+S  E+  H LF+C RA+ VW+L+  N+        +  D  + L
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL

Query:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA
         + LT  E E  +V CW  W ++N + HG+S+    A + +  SYL  F+ A  R K +Q      A+ + RPS       +W  PP    KLN +AA
Subjt:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]1.2e-2636.32Show/hide
Query:  DGISASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRW
        D  + SNS +   W N WKLK+P K+++F WK    SLP    L  R +  S  C IC+S  E  +H LF+C RA+ VW L+  ++  +     S  D  
Subjt:  DGISASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRW

Query:  IKLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDA
        + L + L+  ELEL +V CW+IW ++N + HG+S+   +A + +  SYL  F++A  R K  +      A+  +RPS       +W  PP    KLN DA
Subjt:  IKLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDA

Query:  A
        A
Subjt:  A

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]1.5e-2433.84Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL
        +AS+  M   W   WK+K+P K+++F WK    +LP    L  R +  S  C IC+S  E+ NH LF C RA+ VW L+  ++        +  D  + L
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL

Query:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA
         + L+  E EL +V CW  W ++N + HG+++  + A +++  SYL  F+  N R K +Q      A+ + RPS       +W  PP    KLN DAA
Subjt:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA

XP_030969753.1 uncharacterized protein LOC115990032 [Quercus lobata]3.3e-2427.82Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDR--WI
        S++++ M   WK +W+L +P+KI+ F W+A +  LP  +NL  R +   N   +C S  E T H L+ C+ A+EVW     N    +D C  F+D   + 
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDR--WI

Query:  KLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAACLQYASL
        +      +E++ L+V   W IW+++N++ HG S  PAS    W +  L++F  AN         +  R   ++  +RW PP   ++K N+D A   +   
Subjt:  KLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAACLQYASL

Query:  SRLRVESIVELNSLLAINFINKNY--------VVWEDMEADVARVWEL
            +E ++  +    +  ++K          +V + ME  V   WE+
Subjt:  SRLRVESIVELNSLLAINFINKNY--------VVWEDMEADVARVWEL

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248747.5e-3029.55Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNV-FLEVDFCGSFVDRWIK
        SAS ++ G  W ++WKL +P+KIK+F W++    +P   NL  RG+     C IC  + E+  H  F C RA+++W   F  +  L  +   SF++ W  
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNV-FLEVDFCGSFVDRWIK

Query:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAAC-------
        L   L  ++L L  +  W IW+D+N L+HG  +SP   K  W+  +LDS  +A    + +  + R +++       W P  S   KLN DAAC       
Subjt:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAAC-------

Query:  ---LQYASLSRLRVESI--------------------------------VELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNEL
           ++ +S S +   SI                                VE +SLLAI  I        D +  V  +  LT  F  I FS   R CN  
Subjt:  ---LQYASLSRLRVESI--------------------------------VELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNEL

Query:  ADLGGVWG
        A     WG
Subjt:  ADLGGVWG

A0A803P8W5 Uncharacterized protein2.1e-2432.02Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVW-NLTFQNVFLEVDFCGSFVDRWIK
        S+++S  G  WK +W LK+P K+K F W+ +  +LP  +NL +R V  SN+C +C+   E  +H LF C RA  VW  L F      +     F + +  
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVW-NLTFQNVFLEVDFCGSFVDRWIK

Query:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKAN-----------PRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNV
        L +  +  ELE ++   W+IW+++NK +HG    PAS   ++  SY+  F  ++           P+         +R + E+   +W PP + ++KLNV
Subjt:  LDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKAN-----------PRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNV

Query:  DAA
        DAA
Subjt:  DAA

A0A803Q2K8 Uncharacterized protein7.2e-2533.84Show/hide
Query:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL
        +AS+  M   W   WK+K+P K+++F WK    +LP    L  R +  S  C IC+S  E+ NH LF C RA+ VW L+  ++        +  D  + L
Subjt:  SASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKL

Query:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA
         + L+  E EL +V CW  W ++N + HG+++  + A +++  SYL  F+  N R K +Q      A+ + RPS       +W  PP    KLN DAA
Subjt:  DSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPS-------RWLPPPSDYWKLNVDAA

A0A803Q6Z2 Uncharacterized protein4.0e-2332.72Show/hide
Query:  WKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKLDSPLTLEELE
        W+  W LK+PSKI++F W+A   +LP    LQ R +  S  CP+C   +E  NH  F CNRA++VW    +++   +    SF D  + + S L  E++E
Subjt:  WKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKLDSPLTLEELE

Query:  LVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTN----------------------QLAHRIRASDENRPS--------------
        L +   W IW+ +N   H  S   A     +  SYL  FRKA  + KTN                      QL+H + ASD    +              
Subjt:  LVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTN----------------------QLAHRIRASDENRPS--------------

Query:  RWLPPPSDYWKLNVDAA
        +WL PPS   K+N DAA
Subjt:  RWLPPPSDYWKLNVDAA

A0A803QH76 Uncharacterized protein4.0e-2330.85Show/hide
Query:  ASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKLD
        +S+S   + W  +W L++P K+K+F W+ +  +LP  +NL +R +  S  C +C    E+  H LF C+RA+ VW+    NVF+         D +  L 
Subjt:  ASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKLD

Query:  SPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRK--------TNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAACL
        +     +LE++    W IWS++NK +HG    PA    ++  +YL  ++ A  ++K        +   +  + A ++ RP +W PP    +KLNVDAAC 
Subjt:  SPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRK--------TNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAACL

Query:  Q
        +
Subjt:  Q

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.4e-0723.76Show/hide
Query:  KIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLT-----FQNVFLEVDFCGSFVDRWIKLDSPLTLEELELVVVAC
        KIKLF WKA   +LP    L  R +  +  C  C +  E + H LF C+ A +VWNL         +   +    + + + I L  P+ +    L    C
Subjt:  KIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLT-----FQNVFLEVDFCGSFVDRWIKLDSPLTLEELELVVVAC

Query:  WAIWSDQNKLVHGDS-----------ISPASAKSNWIRSYLDSFRKANPR-RKTNQLAHRI--------RASDENRPSRWLPPPSDYWKLNV---DAACL
        W IW  +N+L+  +S           +  A A  +  +  L   R A PR   T  L H          +       S W+   + + +  +    A C 
Subjt:  WAIWSDQNKLVHGDS-----------ISPASAKSNWIRSYLDSFRKANPR-RKTNQLAHRI--------RASDENRPSRWLPPPSDYWKLNV---DAACL

Query:  QYAS----------------LSRLRVESIVELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNELAD
        ++ S                L   R + +V  +S   ++ +N N V   ++   +  +  + ++F  I F FIPR  N +AD
Subjt:  QYAS----------------LSRLRVESIVELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNELAD

AT2G02650.1 Ribonuclease H-like superfamily protein2.0e-1124.74Show/hide
Query:  VWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVW---NLTFQNVFLEVDFCGSFVDRWIKLDSPLTLEELE
        +WKL +  KIK F W+ +  +L  N  L++R +    +C  C  + E  +H +F C   Q VW   N+   N +         ++R I+L    T   L+
Subjt:  VWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVW---NLTFQNVFLEVDFCGSFVDRWIKLDSPLTLEELE

Query:  --LVVVACWAIWSDQNKLVHGDSI-SPASAKSNWIRSYLDSFRKANPRRKTN-QLAHRIRASDENRPSRWLPPPSDYWKLNVDAACLQYASLSR
          L     W +W  +N  +      SP       I+   +          TN  +A     +     S+W PPP  + K N D+   Q +  +R
Subjt:  --LVVVACWAIWSDQNKLVHGDSI-SPASAKSNWIRSYLDSFRKANPRRKTN-QLAHRIRASDENRPSRWLPPPSDYWKLNVDAACLQYASLSR

AT3G25270.1 Ribonuclease H-like superfamily protein7.7e-1127.72Show/hide
Query:  VWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLT-FQNVFLEVDFCGSFVDRWIKLDSPLTLEELELV
        +WKLK   KIK F WK L  +L    NL+ R +R    C  C  + E + H  F C  AQ+VW  +   +  L            + L S L   + +L 
Subjt:  VWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLT-FQNVFLEVDFCGSFVDRWIKLDSPLTLEELELV

Query:  VVA---CWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPR-RKTNQLAHRIRASDENRP-SRWLPPPSDYWKLNVDAA
         +A    W +W  +N+LV               R+ +  +   N   +  NQ  H  R        ++W  PPS + K N D A
Subjt:  VVA---CWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPR-RKTNQLAHRIRASDENRP-SRWLPPPSDYWKLNVDAA

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0640.98Show/hide
Query:  NVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQ
        ++W LKI  KIKL  WKAL  +LP    L +R + +   C  C    E   H LF C  AQ
Subjt:  NVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQ

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-1424.18Show/hide
Query:  SNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGS-FVDRW----
        S   +  I++ +WK +   KI+ F WK L  SLP    L  R +   + C  C S  E  NH LF C  A+  W ++   + L  ++  S +V+ +    
Subjt:  SNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGS-FVDRW----

Query:  IKLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAA------
        +   +P   +  +LV    W +W ++N+LV       A          L+ +R               R+S      RW PPP  + K N DA       
Subjt:  IKLDSPLTLEELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAA------

Query:  -----------------------------------CLQYA--SLSRLRVESIV-ELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRY
                                            +++A  SLSR +   ++ E +S + I  +N N  +W  ++  +  +  L SQF ++ F FIPR 
Subjt:  -----------------------------------CLQYA--SLSRLRVESIV-ELNSLLAINFINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRY

Query:  CNELAD
         N LA+
Subjt:  CNELAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGGATCTCTGCTAGCAACTCTCATATGGGTAGGATTTGGAAAAATGTGTGGAAACTTAAAATCCCCTCTAAGATCAAGCTATTTTGTTGGAAAGCCTTAAAAAG
ATCTCTCCCAAATAATCTTAATCTCCAAAATAGAGGTGTTCGACTATCAAATTTATGTCCTATTTGTGATTCCCAAATTGAAAATACAAATCACTGCCTTTTCACATGTA
ACAGAGCACAAGAAGTATGGAATCTAACCTTTCAAAATGTTTTTCTGGAGGTGGATTTTTGCGGGAGTTTTGTTGATCGATGGATTAAACTGGATTCACCTCTAACCTTG
GAAGAGCTCGAGTTGGTGGTTGTGGCTTGCTGGGCGATTTGGAGCGACCAAAATAAGCTAGTCCATGGCGATAGCATCTCCCCTGCATCTGCCAAAAGCAATTGGATCAG
AAGCTATCTCGATTCTTTCAGGAAAGCGAACCCTAGACGGAAGACTAACCAATTGGCCCATCGTATTCGCGCTTCTGATGAGAATCGACCTTCCCGCTGGTTGCCGCCTC
CTTCGGATTATTGGAAGCTTAATGTTGATGCTGCATGTCTTCAATACGCTTCTTTGTCGAGGCTGAGAGTGGAATCTATAGTGGAATTAAATAGTTTGCTGGCGATCAAT
TTCATTAATAAGAATTATGTGGTTTGGGAGGACATGGAAGCTGATGTGGCCAGAGTGTGGGAGTTGACTTCTCAATTTTTGGACATTGACTTCTCCTTTATTCCAAGATA
TTGTAACGAGTTAGCAGACTTAGGGGGTGTTTGGGGCACTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGGATCTCTGCTAGCAACTCTCATATGGGTAGGATTTGGAAAAATGTGTGGAAACTTAAAATCCCCTCTAAGATCAAGCTATTTTGTTGGAAAGCCTTAAAAAG
ATCTCTCCCAAATAATCTTAATCTCCAAAATAGAGGTGTTCGACTATCAAATTTATGTCCTATTTGTGATTCCCAAATTGAAAATACAAATCACTGCCTTTTCACATGTA
ACAGAGCACAAGAAGTATGGAATCTAACCTTTCAAAATGTTTTTCTGGAGGTGGATTTTTGCGGGAGTTTTGTTGATCGATGGATTAAACTGGATTCACCTCTAACCTTG
GAAGAGCTCGAGTTGGTGGTTGTGGCTTGCTGGGCGATTTGGAGCGACCAAAATAAGCTAGTCCATGGCGATAGCATCTCCCCTGCATCTGCCAAAAGCAATTGGATCAG
AAGCTATCTCGATTCTTTCAGGAAAGCGAACCCTAGACGGAAGACTAACCAATTGGCCCATCGTATTCGCGCTTCTGATGAGAATCGACCTTCCCGCTGGTTGCCGCCTC
CTTCGGATTATTGGAAGCTTAATGTTGATGCTGCATGTCTTCAATACGCTTCTTTGTCGAGGCTGAGAGTGGAATCTATAGTGGAATTAAATAGTTTGCTGGCGATCAAT
TTCATTAATAAGAATTATGTGGTTTGGGAGGACATGGAAGCTGATGTGGCCAGAGTGTGGGAGTTGACTTCTCAATTTTTGGACATTGACTTCTCCTTTATTCCAAGATA
TTGTAACGAGTTAGCAGACTTAGGGGGTGTTTGGGGCACTGAATAG
Protein sequenceShow/hide protein sequence
MDGISASNSHMGRIWKNVWKLKIPSKIKLFCWKALKRSLPNNLNLQNRGVRLSNLCPICDSQIENTNHCLFTCNRAQEVWNLTFQNVFLEVDFCGSFVDRWIKLDSPLTL
EELELVVVACWAIWSDQNKLVHGDSISPASAKSNWIRSYLDSFRKANPRRKTNQLAHRIRASDENRPSRWLPPPSDYWKLNVDAACLQYASLSRLRVESIVELNSLLAIN
FINKNYVVWEDMEADVARVWELTSQFLDIDFSFIPRYCNELADLGGVWGTE