; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040920 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040920
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr13:9787401..9788544
RNA-Seq ExpressionLag0040920
SyntenyLag0040920
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.9e-1830.41Show/hide
Query:  WSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRN--IPKPARSQTSH--------------TSWIKP
        W+ K+ W+W+VN L  E++A   +I W +W +RN++    +  D       ++ +I  + NS + +   I +  RSQ +                 W  P
Subjt:  WSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRN--IPKPARSQTSH--------------TSWIKP

Query:  KPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLS
          N WKLN DA+W +    GG+GW++ D  G ++  G  ++  K ++ +LE   I  GL+ I    +Q +  + +ESDS+ VI+ +K+EDVDL+
Subjt:  KPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.9e-1832.54Show/hide
Query:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLK-RNIPKPAR-----SQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGW
        E+  +  II W +W  RNK+  +   P+   I+L I+  I     +N+ LK ++  K           +   W  P  N WKLN +AAW  +   GG+GW
Subjt:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLK-RNIPKPAR-----SQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGW

Query:  VVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANS
        ++RD  G +I    R +  + ++  LE  AI EGL++I     +    + +ESDSL  I  L R+  D +E+   ++EI  +   +  V+  H  REAN 
Subjt:  VVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANS

Query:  VAHWVAREA
        VAH +AR A
Subjt:  VAHWVAREA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]3.5e-1731.94Show/hide
Query:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSH----------TSWIKPKPNFWKLNADAAWFDNLGRGGV
        E+  +  II W +W  RNK+  +    +   I+L+I+  I    ++    N+   + ++  H            W  P  N WKLN DAAW  +   GG+
Subjt:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSH----------TSWIKPKPNFWKLNADAAWFDNLGRGGV

Query:  GWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSI-VDTC--IQKK--MSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSH
        GW++RD  G +I    R +  + ++  LE  AI EGL++I  + C  IQ++    + +ESDSL  I  L R+  D +E+   ++EI  +   +  V+  H
Subjt:  GWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSI-VDTC--IQKK--MSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSH

Query:  CFREANSVAHWVAREA
          REAN VAH +AR A
Subjt:  CFREANSVAHWVAREA

XP_031107832.1 uncharacterized protein LOC116012436 [Ipomoea triloba]1.3e-1629.91Show/hide
Query:  VDWGAAKDPKGWSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFW
        VD+ ++     W E  +     N     DL     I W +W ARN+     K+ + + I L     + EW  + L  N    +R   + T W  P    +
Subjt:  VDWGAAKDPKGWSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFW

Query:  KLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRS
        KLN DAA   N  R G G+ +RDS+G+L+        R +     EA  ++E LK +    +    +L++ESD L VI  +   D+ +S   + ++++R 
Subjt:  KLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRS

Query:  LAARLHAVNFSHCFREANSVAHWVAREATSFEFC
        LA+    + F    R AN VAH +AREA S   C
Subjt:  LAARLHAVNFSHCFREANSVAHWVAREATSFEFC

XP_038715125.1 uncharacterized protein LOC120008837 [Tripterygium wilfordii]6.0e-1731.82Show/hide
Query:  IIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMR
        +++W +W  RN+    NK   ++ +  L    + +WQ++ +  N  +          W KP   + K N D A F    + G GWV+RD  G +   G  
Subjt:  IIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMR

Query:  RVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCF--REANSVAHWVAREATS
         +    D    EA + RE L+ + D  I    ++ IESD+L ++QA+K   +D S + V +DE +SL   ++  N+  CF  R ANSVAH +AR A+S
Subjt:  RVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCF--REANSVAHWVAREATS

TrEMBL top hitse value%identityAlignment
A0A2N9F4S2 Reverse transcriptase domain-containing protein2.0e-1831.37Show/hide
Query:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGS
        + +A  + ++W +WN RNKA   N++  ++ I  L      E+ ++   + +P+   +  S T W  P  + +K+N DAA F +    GVG ++RD  G 
Subjt:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGS

Query:  LITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVARE
         +    +R    + +   EA A RE L+  V+  I   + +E E DSL +  ALK +D   +     +DE R +A   H V+FSH  RE N  AH +AR 
Subjt:  LITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVARE

Query:  ATSF
        A  +
Subjt:  ATSF

A0A6J1CP26 uncharacterized protein LOC1110134129.0e-1932.54Show/hide
Query:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLK-RNIPKPAR-----SQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGW
        E+  +  II W +W  RNK+  +   P+   I+L I+  I     +N+ LK ++  K           +   W  P  N WKLN +AAW  +   GG+GW
Subjt:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLK-RNIPKPAR-----SQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGW

Query:  VVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANS
        ++RD  G +I    R +  + ++  LE  AI EGL++I     +    + +ESDSL  I  L R+  D +E+   ++EI  +   +  V+  H  REAN 
Subjt:  VVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANS

Query:  VAHWVAREA
        VAH +AR A
Subjt:  VAHWVAREA

A0A6J1CQG0 uncharacterized protein LOC1110132169.0e-1930.41Show/hide
Query:  WSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRN--IPKPARSQTSH--------------TSWIKP
        W+ K+ W+W+VN L  E++A   +I W +W +RN++    +  D       ++ +I  + NS + +   I +  RSQ +                 W  P
Subjt:  WSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRN--IPKPARSQTSH--------------TSWIKP

Query:  KPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLS
          N WKLN DA+W +    GG+GW++ D  G ++  G  ++  K ++ +LE   I  GL+ I    +Q +  + +ESDS+ VI+ +K+EDVDL+
Subjt:  KPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.4e-1632.42Show/hide
Query:  DPKGWSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLKRNI----PKPARSQTSHTSWIKPKPNFWK
        D   W+ KEYW W+++    E+  +  II   +W  RNK+  +    +   I+L I+  I     Q++ LKR      P       +   W  P  N WK
Subjt:  DPKGWSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI--QEWQNSYLKRNI----PKPARSQTSHTSWIKPKPNFWK

Query:  LNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKR
        LN DAAW  +    G+GW++RD  G +I  G R +  + ++  LE  AI EGL++I     +    + +ESDSL  I  L R
Subjt:  LNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKR

A0A6J1DSV1 uncharacterized protein LOC1110236081.7e-1731.94Show/hide
Query:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSH----------TSWIKPKPNFWKLNADAAWFDNLGRGGV
        E+  +  II W +W  RNK+  +    +   I+L+I+  I    ++    N+   + ++  H            W  P  N WKLN DAAW  +   GG+
Subjt:  EDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSH----------TSWIKPKPNFWKLNADAAWFDNLGRGGV

Query:  GWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSI-VDTC--IQKK--MSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSH
        GW++RD  G +I    R +  + ++  LE  AI EGL++I  + C  IQ++    + +ESDSL  I  L R+  D +E+   ++EI  +   +  V+  H
Subjt:  GWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSI-VDTC--IQKK--MSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSH

Query:  CFREANSVAHWVAREA
          REAN VAH +AR A
Subjt:  CFREANSVAHWVAREA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.3e-0626.73Show/hide
Query:  IMWSMWNARNKAAVENK-LPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGS---LITF
        I W +W ARN+   +N     I  +   +++ +  WQ++ L   +PK  R  T   +      + +    DAAW       G GWV + +S S   + TF
Subjt:  IMWSMWNARNKAAVENK-LPDINLIRLLIEENIQEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGS---LITF

Query:  --GMRR-----VIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWV
          G RR         W +KS    A++            ++  L + SDS +++ AL   +V L+E+   + EIRS+  R  +++F    R  NS+A   
Subjt:  --GMRR-----VIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWV

Query:  AR
        A+
Subjt:  AR

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.3e-1325.38Show/hide
Query:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQ-NSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMR
        ++W +W +RN+   + K  D   +     E+ +EW     L+     P   +     W  P   + K N DA W     R G+GW++R+ SG ++  G R
Subjt:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQ-NSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMR

Query:  RVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSF
         + R  ++   E +A+R  + ++     ++   +  ESD+ A++  L  +D     ++  +++I+ L      V F    R  N VA  +ARE+ SF
Subjt:  RVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSF

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-0620.64Show/hide
Query:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQN-SYLKRNIPKPARS-QTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM
        ++W +W ARN           +   L  +    +W N +   +  P P R    +   W  P   + K N DA +         GW++R+  G+ I++G 
Subjt:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQN-SYLKRNIPKPARS-QTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM

Query:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSFEF
         ++    +    E KA+   L ++  T I+    + +E D   +I  +       S +   +++I   A +  ++ F    R+ N +AH +A+      +
Subjt:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSFEF

Query:  CCNQETSLSKEKGQSFWV
         C   T  S       W+
Subjt:  CCNQETSLSKEKGQSFWV

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-1326.77Show/hide
Query:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQ-NSYLKRNIPKPARSQTSHTSWIKPKPNFW-KLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM
        ++W +W  RN+     +  +   +    E++++EW+  +  +    KP  +++S   W +P P+ W K N DA W  +  R G+GWV+R+  G +   G 
Subjt:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQ-NSYLKRNIPKPARSQTSHTSWIKPKPNFW-KLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM

Query:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSF
        R + +   +   E +A+R  + S+  +  Q    +  ESDS  +I+ L  +++    +K  + +++ L ++   V F    RE N++A  VARE+ SF
Subjt:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.0e-0720Show/hide
Query:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARS--QTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM
        +MW +W + N     +          +   + +EW ++ +        R+   + +T W  P  +  K N DA+  +     G+GW++R+S G++I  GM
Subjt:  IMWSMWNARNKAAVENKLPDINLIRLLIEENIQEWQNSYLKRNIPKPARS--QTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGM

Query:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREA
         +   +   +  E   +   +++      +K +    E D+  + + +  +  +   ++ F+D I+S      ++ FS   RE N  A ++A++A
Subjt:  RRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAVIQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTTCGACAAAAAAGGTCATTTCTCGGTCAAAAGCGCTTACCGGCTAGCGAAAGATCTTTCCTCCTCCAAGGAAGCTTCTCAATCCAACAACAACAAAGTCTCCC
GAGAATGGAAAAGCATTTGGTCGACTGGGGTGCAGCCAAGGATCCAAAAGGATGGTCGGAAAAGGAATATTGGAGCTGGATGGTGAACAACCTATGCGGGGAAGACCTAG
CGAAAGGCTCAATTATAATGTGGAGCATGTGGAATGCCAGAAACAAGGCCGCAGTAGAAAACAAACTACCAGACATTAATCTCATCAGACTCTTGATTGAGGAAAATATT
CAGGAATGGCAGAACTCTTACCTTAAGAGGAACATCCCGAAGCCGGCGAGGAGCCAAACGAGTCACACATCGTGGATCAAACCGAAGCCTAACTTCTGGAAACTGAATGC
AGATGCTGCCTGGTTTGACAATTTGGGCAGAGGTGGAGTTGGCTGGGTCGTGCGTGACTCGTCTGGATCCTTGATCACCTTCGGTATGAGAAGAGTCATCAGAAAGTGGG
ACATGAAAAGTCTAGAAGCGAAAGCAATTAGGGAAGGGCTGAAATCGATTGTCGATACCTGCATTCAGAAGAAGATGTCCCTAGAGATTGAATCAGACTCTCTGGCCGTG
ATTCAAGCGCTGAAGAGGGAAGACGTCGATTTATCGGAGATGAAGGTATTCGTAGACGAAATTCGCTCCTTAGCCGCTCGCCTCCATGCCGTCAATTTCTCCCATTGCTT
CAGAGAGGCCAATTCAGTCGCCCACTGGGTTGCAAGGGAGGCAACATCCTTTGAATTTTGTTGTAATCAGGAGACATCCTTGTCTAAGGAAAAAGGGCAATCTTTTTGGG
TCCCTGATGTTCCTTCTTTTATTTGGCCCCTTATTTATAAGGGTTGTTCTTCTAGTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACATTTCGACAAAAAAGGTCATTTCTCGGTCAAAAGCGCTTACCGGCTAGCGAAAGATCTTTCCTCCTCCAAGGAAGCTTCTCAATCCAACAACAACAAAGTCTCCC
GAGAATGGAAAAGCATTTGGTCGACTGGGGTGCAGCCAAGGATCCAAAAGGATGGTCGGAAAAGGAATATTGGAGCTGGATGGTGAACAACCTATGCGGGGAAGACCTAG
CGAAAGGCTCAATTATAATGTGGAGCATGTGGAATGCCAGAAACAAGGCCGCAGTAGAAAACAAACTACCAGACATTAATCTCATCAGACTCTTGATTGAGGAAAATATT
CAGGAATGGCAGAACTCTTACCTTAAGAGGAACATCCCGAAGCCGGCGAGGAGCCAAACGAGTCACACATCGTGGATCAAACCGAAGCCTAACTTCTGGAAACTGAATGC
AGATGCTGCCTGGTTTGACAATTTGGGCAGAGGTGGAGTTGGCTGGGTCGTGCGTGACTCGTCTGGATCCTTGATCACCTTCGGTATGAGAAGAGTCATCAGAAAGTGGG
ACATGAAAAGTCTAGAAGCGAAAGCAATTAGGGAAGGGCTGAAATCGATTGTCGATACCTGCATTCAGAAGAAGATGTCCCTAGAGATTGAATCAGACTCTCTGGCCGTG
ATTCAAGCGCTGAAGAGGGAAGACGTCGATTTATCGGAGATGAAGGTATTCGTAGACGAAATTCGCTCCTTAGCCGCTCGCCTCCATGCCGTCAATTTCTCCCATTGCTT
CAGAGAGGCCAATTCAGTCGCCCACTGGGTTGCAAGGGAGGCAACATCCTTTGAATTTTGTTGTAATCAGGAGACATCCTTGTCTAAGGAAAAAGGGCAATCTTTTTGGG
TCCCTGATGTTCCTTCTTTTATTTGGCCCCTTATTTATAAGGGTTGTTCTTCTAGTTGTTAG
Protein sequenceShow/hide protein sequence
MTFRQKRSFLGQKRLPASERSFLLQGSFSIQQQQSLPRMEKHLVDWGAAKDPKGWSEKEYWSWMVNNLCGEDLAKGSIIMWSMWNARNKAAVENKLPDINLIRLLIEENI
QEWQNSYLKRNIPKPARSQTSHTSWIKPKPNFWKLNADAAWFDNLGRGGVGWVVRDSSGSLITFGMRRVIRKWDMKSLEAKAIREGLKSIVDTCIQKKMSLEIESDSLAV
IQALKREDVDLSEMKVFVDEIRSLAARLHAVNFSHCFREANSVAHWVAREATSFEFCCNQETSLSKEKGQSFWVPDVPSFIWPLIYKGCSSSC