; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026968 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026968
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr10:43834678..43836794
RNA-Seq ExpressionLag0026968
SyntenyLag0026968
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]6.9e-2033.17Show/hide
Query:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQ---SSYPKSQQRG-----ARPELKTLTSQVRWRP
        R +W   + W W++  L +EE+  +++I   +W  RN+    G   D+  + RSI   +    D+    S   +SQQ        R  L  L   VRW  
Subjt:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQ---SSYPKSQQRG-----ARPELKTLTSQVRWRP

Query:  PPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDIS
        PP N WKLNTDASW+E+ + GGIGWIL D  G  +  G  +I +   I  +E+  I++GL+ I          +  PI++ESD+  V++++  ED D++
Subjt:  PPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDIS

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.6e-2132.98Show/hide
Query:  SLFSLNRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPP
        + F ++R NW    YW W+M+   EEE  ++++I C +W  RNK    G   +   I+ +I+R + +    Q +  K + +   P  +    ++ RW+PP
Subjt:  SLFSLNRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPP

Query:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN
         +N+WKLNTDA+W     T GIGWILRD  G  I  G + I    +I ++E+ AI +GL+ I             PI +ESD+   + +L+
Subjt:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]1.4e-2033.51Show/hide
Query:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPPPANAWK
        R NW    YW W+M+   EEE  ++++I C +W  RNK    G   +   I+ +I+R + +    Q +  K + +   P  +    ++ RW+PP +N+WK
Subjt:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPPPANAWK

Query:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN
        LNTDA+W     T GIGWILRD  G  I  G + I    +I ++E+ AI +GL+ I             PI +ESD+   + +L+
Subjt:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-2028.7Show/hide
Query:  MMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKT
        M++   +E+L+  ++    +WN+RN +   G     S++ + + + +      +SSY    +       KTL ++++W PPP + W LN DASW++    
Subjt:  MMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKT

Query:  GGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHF
        GGIGWI+R   G  +  G + +    ++K +E  AIL+GL+N+ ++G+        P+ +E+D++ V  +LN + ED+++  ++ EEI  L++  + + F
Subjt:  GGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHF

Query:  DYCPREQNAVADRLTR
            RE N  A  L +
Subjt:  DYCPREQNAVADRLTR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]9.9e-1931.25Show/hide
Query:  MENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTL-----TSQVRWRPPPANAWKLNTDASWNE
        M+   EEE  ++++I   +W  RNK    G   +   I+  I+R +       ++    + + A  +L  +      +  RW+PP +N+WKLNTDA+W  
Subjt:  MENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTL-----TSQVRWRPPPANAWKLNTDASWNE

Query:  KEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSV----PIFVESDASNVVKILNGEDEDISEISFLTEEIRRLK
           TGGIGWILRD  G  I    + I    +I ++E+ AI +GL+   +I  E+  P       PI +ESD+   + +L+ + +D +EI +L EEI ++ 
Subjt:  KEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSV----PIFVESDASNVVKILNGEDEDISEISFLTEEIRRLK

Query:  NQFQEIHFDYCPREQNAVADRLTR
           + +   +  RE N VA  L R
Subjt:  NQFQEIHFDYCPREQNAVADRLTR

TrEMBL top hitse value%identityAlignment
A0A6J1CQG0 uncharacterized protein LOC1110132163.3e-2033.17Show/hide
Query:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQ---SSYPKSQQRG-----ARPELKTLTSQVRWRP
        R +W   + W W++  L +EE+  +++I   +W  RN+    G   D+  + RSI   +    D+    S   +SQQ        R  L  L   VRW  
Subjt:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQ---SSYPKSQQRG-----ARPELKTLTSQVRWRP

Query:  PPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDIS
        PP N WKLNTDASW+E+ + GGIGWIL D  G  +  G  +I +   I  +E+  I++GL+ I          +  PI++ESD+  V++++  ED D++
Subjt:  PPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDIS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X17.9e-2232.98Show/hide
Query:  SLFSLNRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPP
        + F ++R NW    YW W+M+   EEE  ++++I C +W  RNK    G   +   I+ +I+R + +    Q +  K + +   P  +    ++ RW+PP
Subjt:  SLFSLNRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPP

Query:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN
         +N+WKLNTDA+W     T GIGWILRD  G  I  G + I    +I ++E+ AI +GL+ I             PI +ESD+   + +L+
Subjt:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN

A0A6J1DNV9 uncharacterized protein LOC1110224036.7e-2128.7Show/hide
Query:  MMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKT
        M++   +E+L+  ++    +WN+RN +   G     S++ + + + +      +SSY    +       KTL ++++W PPP + W LN DASW++    
Subjt:  MMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKT

Query:  GGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHF
        GGIGWI+R   G  +  G + +    ++K +E  AIL+GL+N+ ++G+        P+ +E+D++ V  +LN + ED+++  ++ EEI  L++  + + F
Subjt:  GGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHF

Query:  DYCPREQNAVADRLTR
            RE N  A  L +
Subjt:  DYCPREQNAVADRLTR

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X26.7e-2133.51Show/hide
Query:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPPPANAWK
        R NW    YW W+M+   EEE  ++++I C +W  RNK    G   +   I+ +I+R + +    Q +  K + +   P  +    ++ RW+PP +N+WK
Subjt:  RGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKT-LTSQVRWRPPPANAWK

Query:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN
        LNTDA+W     T GIGWILRD  G  I  G + I    +I ++E+ AI +GL+ I             PI +ESD+   + +L+
Subjt:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILN

A0A6J1DSV1 uncharacterized protein LOC1110236084.8e-1931.25Show/hide
Query:  MENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTL-----TSQVRWRPPPANAWKLNTDASWNE
        M+   EEE  ++++I   +W  RNK    G   +   I+  I+R +       ++    + + A  +L  +      +  RW+PP +N+WKLNTDA+W  
Subjt:  MENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTL-----TSQVRWRPPPANAWKLNTDASWNE

Query:  KEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSV----PIFVESDASNVVKILNGEDEDISEISFLTEEIRRLK
           TGGIGWILRD  G  I    + I    +I ++E+ AI +GL+   +I  E+  P       PI +ESD+   + +L+ + +D +EI +L EEI ++ 
Subjt:  KEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSV----PIFVESDASNVVKILNGEDEDISEISFLTEEIRRLK

Query:  NQFQEIHFDYCPREQNAVADRLTR
           + +   +  RE N VA  L R
Subjt:  NQFQEIHFDYCPREQNAVADRLTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein2.7e-0623.56Show/hide
Query:  ARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGS-----SICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIF
        A P+++  T +        + +    DA+W ++    G GW+ + +S S     +   G +R     + +   +K+ +     + ++ +E  +     + 
Subjt:  ARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGS-----SICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIF

Query:  VESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLTRLPVSSPMVSVLDSPSLV
        V SD+ ++V  LN  +  ++EI  L  EIR ++N+F+ I F + PR  N++AD   +L +       LD  +LV
Subjt:  VESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLTRLPVSSPMVSVLDSPSLV

AT1G52990.1 thioredoxin family protein7.1e-0725Show/hide
Query:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEI
        P+   K N DAS +E +   G+GW++R+S G+ +  G  +     + +  E  A++  ++   + G          +  E D SNV +++N +  D   +
Subjt:  PANAWKLNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEI

Query:  SFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLTRLPVSS
            + I+     F    F +  REQN  AD L +  + S
Subjt:  SFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLTRLPVSS

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.4e-1728.78Show/hide
Query:  ILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSI
        +L  LW  RN++   G   D   + R    + E    E S+  + + + + P+++   S V+W+ PP    K NTDA+W  +    GIGWILR+ SG  +
Subjt:  ILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGSSI

Query:  CLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLT
         +G + + +  ++   E++A+   +  +     +        I  ESDA  +V +LN  D+    +    E+I++L + F+E+ F++ PR  N VADR+ 
Subjt:  CLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRLT

Query:  RLPVS
        R  +S
Subjt:  RLPVS

AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-1829.18Show/hide
Query:  NRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWK
        N GN NP     W      E+  +    +L  LW  RN++   G   +   + R  E +LE    E     +++  G +P++   +S  RWRPPP    K
Subjt:  NRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWK

Query:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEE
         NTDA+WN   +  GIGW+LR+  G    +G + + K  S+   E++A+   + ++              +  ESD+  +++ILN  DE    +    ++
Subjt:  LNTDASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEE

Query:  IRRLKNQFQEIHFDYCPREQNAVADRLTRLPVS
        ++RL +QF E+ F + PRE N +A+R+ R  +S
Subjt:  IRRLKNQFQEIHFDYCPREQNAVADRLTRLPVS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0821.29Show/hide
Query:  ILCSLWNYRNKINQSGSRPD-KSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGSS
        ++  +W   N +  + +R   +++++ ++    E  ++  ++  ++  R A P   T     +W PP  +  K N DAS +E+    G+GWILR+S G+ 
Subjt:  ILCSLWNYRNKINQSGSRPD-KSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTDASWNEKEKTGGIGWILRDSSGSS

Query:  ICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRL
        I  G  +     + +  E   ++  ++     G +        +  E D   + +++N +  +   +    + I+     F+ I F +  REQN  AD L
Subjt:  ICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFDYCPREQNAVADRL

Query:  TR
         +
Subjt:  TR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTATTTTCCTTGAACAGAGGCAACTGGAACCCGTCAAATTACTGGACGTGGATGATGGAAAACTTAAAGGAAGAAGAACTCGAGAAAGCAATTATGATTCTTTG
CAGCTTATGGAATTACAGAAACAAAATCAATCAATCAGGCAGCAGACCAGACAAAAGTTCAATAAAGAGATCGATTGAACGAAACTTAGAATTGAGGGAAGATGAGCAAA
GTTCGTACCCGAAGTCCCAGCAGCGAGGCGCAAGGCCAGAGTTAAAGACCCTCACGAGTCAAGTGAGATGGAGACCACCACCGGCAAATGCATGGAAGCTTAATACAGAC
GCGTCCTGGAACGAAAAAGAGAAAACGGGTGGAATCGGTTGGATTCTACGTGACTCTTCAGGGTCTTCTATCTGCTTGGGATTCCAACGAATCAACAAAAATTGGTCGAT
AAAATTCATGGAAATGAAGGCGATTTTAAAAGGATTGAAGAACATACCTTCAATCGGCATTGAGAATGGCAATCCGACCTCTGTTCCCATCTTCGTAGAATCTGACGCCT
CAAATGTTGTTAAGATACTTAATGGCGAAGATGAGGATATCTCGGAGATCTCTTTCCTTACCGAAGAGATAAGGCGCTTGAAGAATCAATTCCAAGAGATTCATTTCGAC
TACTGCCCAAGAGAACAAAACGCAGTCGCAGATCGTTTGACGCGTTTACCGGTTTCTTCCCCCATGGTTTCTGTTTTGGATTCCCCTTCGCTGGTGGGAACGGGAATGGG
CTTCTGGCGGGGGTCTCCCCCTTTTTGGATTAGAAACCTCTTAAATGAGTCAAGCTCTTGGGATGATCCTTTGAAAGAGAAGAAAAACTCCTCGTATCTGGAAGGCGTGG
GACATCGTCGAGCAAATCTGAACGTCGTGAGCAGACATCGATCAGTTTGTGGTTCTTTGCACAAGGAGGGAGAACGAGAAGAAGGCGTGGAGGTTTTCACCGTGGAAGTT
GAGAGGAAGGAAGATCTGCTAGTCGGCGCGGAGGTCTTCACTGTGGGTCTACTTGGTGAGGGACAAGGGAAAGACAGAGGACTTTGTTCAGAGGTGGTTCTCAGTTTTTC
AGAGTGGGTCTGCTCGGTGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTATTTTCCTTGAACAGAGGCAACTGGAACCCGTCAAATTACTGGACGTGGATGATGGAAAACTTAAAGGAAGAAGAACTCGAGAAAGCAATTATGATTCTTTG
CAGCTTATGGAATTACAGAAACAAAATCAATCAATCAGGCAGCAGACCAGACAAAAGTTCAATAAAGAGATCGATTGAACGAAACTTAGAATTGAGGGAAGATGAGCAAA
GTTCGTACCCGAAGTCCCAGCAGCGAGGCGCAAGGCCAGAGTTAAAGACCCTCACGAGTCAAGTGAGATGGAGACCACCACCGGCAAATGCATGGAAGCTTAATACAGAC
GCGTCCTGGAACGAAAAAGAGAAAACGGGTGGAATCGGTTGGATTCTACGTGACTCTTCAGGGTCTTCTATCTGCTTGGGATTCCAACGAATCAACAAAAATTGGTCGAT
AAAATTCATGGAAATGAAGGCGATTTTAAAAGGATTGAAGAACATACCTTCAATCGGCATTGAGAATGGCAATCCGACCTCTGTTCCCATCTTCGTAGAATCTGACGCCT
CAAATGTTGTTAAGATACTTAATGGCGAAGATGAGGATATCTCGGAGATCTCTTTCCTTACCGAAGAGATAAGGCGCTTGAAGAATCAATTCCAAGAGATTCATTTCGAC
TACTGCCCAAGAGAACAAAACGCAGTCGCAGATCGTTTGACGCGTTTACCGGTTTCTTCCCCCATGGTTTCTGTTTTGGATTCCCCTTCGCTGGTGGGAACGGGAATGGG
CTTCTGGCGGGGGTCTCCCCCTTTTTGGATTAGAAACCTCTTAAATGAGTCAAGCTCTTGGGATGATCCTTTGAAAGAGAAGAAAAACTCCTCGTATCTGGAAGGCGTGG
GACATCGTCGAGCAAATCTGAACGTCGTGAGCAGACATCGATCAGTTTGTGGTTCTTTGCACAAGGAGGGAGAACGAGAAGAAGGCGTGGAGGTTTTCACCGTGGAAGTT
GAGAGGAAGGAAGATCTGCTAGTCGGCGCGGAGGTCTTCACTGTGGGTCTACTTGGTGAGGGACAAGGGAAAGACAGAGGACTTTGTTCAGAGGTGGTTCTCAGTTTTTC
AGAGTGGGTCTGCTCGGTGGGTTAG
Protein sequenceShow/hide protein sequence
MSLFSLNRGNWNPSNYWTWMMENLKEEELEKAIMILCSLWNYRNKINQSGSRPDKSSIKRSIERNLELREDEQSSYPKSQQRGARPELKTLTSQVRWRPPPANAWKLNTD
ASWNEKEKTGGIGWILRDSSGSSICLGFQRINKNWSIKFMEMKAILKGLKNIPSIGIENGNPTSVPIFVESDASNVVKILNGEDEDISEISFLTEEIRRLKNQFQEIHFD
YCPREQNAVADRLTRLPVSSPMVSVLDSPSLVGTGMGFWRGSPPFWIRNLLNESSSWDDPLKEKKNSSYLEGVGHRRANLNVVSRHRSVCGSLHKEGEREEGVEVFTVEV
ERKEDLLVGAEVFTVGLLGEGQGKDRGLCSEVVLSFSEWVCSVG