; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017644 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017644
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr5:6359134..6365331
RNA-Seq ExpressionLag0017644
SyntenyLag0017644
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383853.1 uncharacterized protein LOC107176237 [Citrus sinensis]2.6e-1527.39Show/hide
Query:  KARETSSHVLWNCKKVRFLW-----ETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEE
        + +ET++H L  CK  + +W     ET FP           N++   +   + + L+K ++E    I W++W G+NK +    K++    I   +  +E 
Subjt:  KARETSSHVLWNCKKVRFLW-----ETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEE

Query:  QRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS-----------------------PIGFGKKNRSSHYPIIVEA
         R+ +   LEL    S +NQ VWSPPP +  K+N DA+ N E+Q+ G+G + RDS                            G      ++   I+VE+
Subjt:  QRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS-----------------------PIGFGKKNRSSHYPIIVEA

Query:  DASEAIKALNHEATDLSESNLVLSEIENLA
        D  E +K +N++    +E   V+SEI++L+
Subjt:  DASEAIKALNHEATDLSESNLVLSEIENLA

XP_015385738.1 uncharacterized protein LOC107177034 [Citrus sinensis]1.3e-1426.25Show/hide
Query:  LARKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQR
        + ++ +ET++H L  CK  + +W    P  T FL   + N++   +   + + L+K ++E    I W++W GRNK +    K++    I   +  +E  R
Subjt:  LARKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQR

Query:  KRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS-----------------------PIGFGKKNRSSHYPIIVEADA
        + +   L+     S +NQ VWSPPP +  K+N DA+ N ++Q+ G+G + RDSS                           G      ++   I+VE+D 
Subjt:  KRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS-----------------------PIGFGKKNRSSHYPIIVEADA

Query:  SEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCP
         E +K +N+     +E   V+SEI++L+     +++   P
Subjt:  SEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCP

XP_018435800.1 PREDICTED: uncharacterized protein LOC108808101 [Raphanus sativus]4.2e-1329.49Show/hide
Query:  ETSSHVLWNCKKVRFLWETLFPKLTHFLCP----CSDNEEFSVVWEA-ITEHLSKEEVER--AAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQ
        E+ +H+L+ C   R  W      + +   P     SD+   ++ W   + +   KEEV+     V+LW LW  RN+ +      D    +R    ++EE 
Subjt:  ETSSHVLWNCKKVRFLWETLFPKLTHFLCP----CSDNEEFSVVWEA-ITEHLSKEEVER--AAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQ

Query:  RKRQNSHLELLNLESLRNQD-VWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV
        R R++   E +   +    D  W+PPPP  VK N+D SW++E + GGVGW+ RD   + +  G +K       I VEA+A  A    +   T L   +  
Subjt:  RKRQNSHLELLNLESLRNQD-VWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV

Query:  LSEIENLASSSNIVAFV
         S +++L  S NI A +
Subjt:  LSEIENLASSSNIVAFV

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.6e-1232.6Show/hide
Query:  WEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRS----TKRNLEE-------QRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSD
        W  +   LS EEV  + VI W +W  RN++I     +D QQ+ RS       N+++       +R +QN        E+L    V WS PP +C K+N+D
Subjt:  WEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRS----TKRNLEE-------QRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSD

Query:  ASWNEEKQQGGVGWIARDSSGSPIGFG------KK-----------------NRSSHYPIIVEADASEAIKALNHEATDLS
        ASW+EE++ GG+GWI  D  G  +  G      KK                 N  S  PI +E+D+ E I+ +  E  DL+
Subjt:  ASWNEEKQQGGVGWIARDSSGSPIGFG------KK-----------------NRSSHYPIIVEADASEAIKALNHEATDLS

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]4.7e-1231.14Show/hide
Query:  RKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTI---VNSNKVDFQ-----QIIRSTKR
        RK  ET+ H+LW CK ++ +W    P   +F      N      WE + +   +EE  R+ +I   +W  RNK+I   V+S   D Q      II S  +
Subjt:  RKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTI---VNSNKVDFQ-----QIIRSTKR

Query:  NLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG
        +   +RK ++ H      ++ R +  W PP  +  K+N+DA+W  +    G+GWI RD  G  I  G
Subjt:  NLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG

TrEMBL top hitse value%identityAlignment
A0A1J3C7A8 Putative ribonuclease H protein (Fragment)4.3e-1124.38Show/hide
Query:  DADYEVHHWITQIHDMCND-SMSLSFVHIRRGRNEMVDALARKARETSSHVLWNCKKVRFLWE-TLFPKLTHFLCPCSDNEEFSVVWEAITEH-LSKEEV
        +A+ ++HH++ +    C   +++L + HI R  +           E+ +H+L+ C   R +W  + FP     +   S       V+     H  +++E 
Subjt:  DADYEVHHWITQIHDMCND-SMSLSFVHIRRGRNEMVDALARKARETSSHVLWNCKKVRFLWE-TLFPKLTHFLCPCSDNEEFSVVWEAITEH-LSKEEV

Query:  ERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRKRQNSHLELLNLESLRNQ---DVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS
             ++W +W  RN  I+   +      +   + ++EE  K      +     S  N    + W PPP   +K N D +W+EE  + G+GW+ RDS G 
Subjt:  ERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRKRQNSHLELLNLESLRNQ---DVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGS

Query:  PIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV
         I  G +K ++   P+  E    EA++   H  T L  SN++
Subjt:  PIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV

A0A6J0JJI5 uncharacterized protein LOC1088081012.0e-1329.49Show/hide
Query:  ETSSHVLWNCKKVRFLWETLFPKLTHFLCP----CSDNEEFSVVWEA-ITEHLSKEEVER--AAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQ
        E+ +H+L+ C   R  W      + +   P     SD+   ++ W   + +   KEEV+     V+LW LW  RN+ +      D    +R    ++EE 
Subjt:  ETSSHVLWNCKKVRFLWETLFPKLTHFLCP----CSDNEEFSVVWEA-ITEHLSKEEVER--AAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQ

Query:  RKRQNSHLELLNLESLRNQD-VWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV
        R R++   E +   +    D  W+PPPP  VK N+D SW++E + GGVGW+ RD   + +  G +K       I VEA+A  A    +   T L   +  
Subjt:  RKRQNSHLELLNLESLRNQD-VWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG-KKNRSSHYPIIVEADASEAIKALNHEATDLSESNLV

Query:  LSEIENLASSSNIVAFV
         S +++L  S NI A +
Subjt:  LSEIENLASSSNIVAFV

A0A6J1CQG0 uncharacterized protein LOC1110132167.8e-1332.6Show/hide
Query:  WEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRS----TKRNLEE-------QRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSD
        W  +   LS EEV  + VI W +W  RN++I     +D QQ+ RS       N+++       +R +QN        E+L    V WS PP +C K+N+D
Subjt:  WEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRS----TKRNLEE-------QRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSD

Query:  ASWNEEKQQGGVGWIARDSSGSPIGFG------KK-----------------NRSSHYPIIVEADASEAIKALNHEATDLS
        ASW+EE++ GG+GWI  D  G  +  G      KK                 N  S  PI +E+D+ E I+ +  E  DL+
Subjt:  ASWNEEKQQGGVGWIARDSSGSPIGFG------KK-----------------NRSSHYPIIVEADASEAIKALNHEATDLS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.3e-1231.14Show/hide
Query:  RKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTI---VNSNKVDFQ-----QIIRSTKR
        RK  ET+ H+LW CK ++ +W    P   +F      N      WE + +   +EE  R+ +I   +W  RNK+I   V+S   D Q      II S  +
Subjt:  RKARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTI---VNSNKVDFQ-----QIIRSTKR

Query:  NLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG
        +   +RK ++ H      ++ R +  W PP  +  K+N+DA+W  +    G+GWI RD  G  I  G
Subjt:  NLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFG

A0A6J1DNV9 uncharacterized protein LOC1110224031.5e-1127.39Show/hide
Query:  SKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSS
        S E+++   +  W +WN RN  I       F  +I+   + + E   +  + L +L+ ++L N+  W PPP     +N+DASW++   +GG+GWI R   
Subjt:  SKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSS

Query:  GSPIGFGKK------------------------NRSSHYPIIVEADASEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCPPRAPPCAAGFSPT
        G  +  G +                        N     P+ +E D++E    LN +  DL+++  V+ EI NL  S  I+AF K       CA   +  
Subjt:  GSPIGFGKK------------------------NRSSHYPIIVEADASEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCPPRAPPCAAGFSPT

Query:  SSL---SAIEVDGIFSFWISSTLEEDFFFI
        +S+   S I VD  F  W+S   +   F I
Subjt:  SSL---SAIEVDGIFSFWISSTLEEDFFFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1125.23Show/hide
Query:  ARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVI-------LWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLE
        +RET +H+L+ C   R +W  + P   +     +D+   ++ W    E     E+ +   I       LW LW  RN+ +    + D  +++R    + E
Subjt:  ARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVI-------LWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLE

Query:  EQRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKK-----------------------NRSSHYPIIV
        E   R+    +    +  RN  V W  PP   VK N+DA+W  E  + G+GWI R+ SG  +  G +                       +R ++  II 
Subjt:  EQRKRQNSHLELLNLESLRNQDV-WSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKK-----------------------NRSSHYPIIV

Query:  EADASEAIKALNHE
        E+DA   +  LN +
Subjt:  EADASEAIKALNHE

AT3G09510.1 Ribonuclease H-like superfamily protein9.7e-0823.24Show/hide
Query:  KARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEE-FSVVWEAITEHLSKEEVERAAV-ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRK
        +  E+ +H L+ C      W      L       +D EE  S +   + +    +  +   V ++W +W  RN  + N  +    + + S K    +   
Subjt:  KARETSSHVLWNCKKVRFLWETLFPKLTHFLCPCSDNEE-FSVVWEAITEHLSKEEVERAAV-ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRK

Query:  RQNSHLELLN--LESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKKNRSSHYPIIVEADASEAIKAL
           SH +  +   +   N+  W  PP + VK N DA ++ +K +   GWI R+  G+PI +G   + +H    +EA+    + AL
Subjt:  RQNSHLELLN--LESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKKNRSSHYPIIVEADASEAIKAL

AT4G29090.1 Ribonuclease H-like superfamily protein2.7e-1022.89Show/hide
Query:  RETSSHVLWNCKKVRFLWETLFPKLTHFLCPC----SDNEEFSVVWEAITEHLSKEEVERAAV----ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLE
        +ET +H+L+ C   R  W      ++    P     +D+   ++ W          + E+A+     +LW LW  RN+ +    + + Q+++R  + +LE
Subjt:  RETSSHVLWNCKKVRFLWETLFPKLTHFLCPC----SDNEEFSVVWEAITEHLSKEEVERAAV----ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLE

Query:  EQRKRQNSHLELLNLESLRNQDV---WSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKK-----------------------NRSSHYPI
        E R R  +  E    +   N+     W PPP   VK N+DA+WN + ++ G+GW+ R+  G     G +                       +R  +  +
Subjt:  EQRKRQNSHLELLNLESLRNQDV---WSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKK-----------------------NRSSHYPI

Query:  IVEADASEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCPPRAPPCAAGFSPTSSLSAIEVD----GIFSFWISSTLE
        I E+D+   I+ LN++          + +++ L S    V FV  P      A   +   SLS +  D     I   W  S+++
Subjt:  IVEADASEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCPPRAPPCAAGFSPTSSLSAIEVD----GIFSFWISSTLE

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.0e-0631.13Show/hide
Query:  ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEE------QRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPI
        ++W +W   N  + N  +  FQ  +     + +E        ++QN +    N +  RN   WSPP    +K N DAS +E     G+GWI R+S G+ I
Subjt:  ILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEE------QRKRQNSHLELLNLESLRNQDVWSPPPPSCVKINSDASWNEEKQQGGVGWIARDSSGSPI

Query:  --GFGK
          G GK
Subjt:  --GFGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTCAGGCCATTAAAGAATTTGAATGCAGTAGTTCAAGCTAAAATGAGGGTTGCAGATTTCATCACTCATTCTAATGGGTGGGATATGGAAAAGCTAAGACAAGT
GGTGGATAAGCCCATTATGGATTGCCCTGCTCGTTGTGAGTGGATCCAAAGTTATTGGAATGAACTCAGAGGTATTAGAGGTATCACGAAGTTATTACAGATGGAGTCAT
CCATGGATCATGATGTTTTGCCAGGCACAATAAAAGTATTCACGGATGCAGCTGTTAACATTCATAAAATTGGAGTGGGTCTTGGGGCCGTAATTGTTGGTTGTGATGAC
AACCTACAATGTGCAATGACGATGTTTGAACACAGGACTCTCTCTCCCTTAGCGGTAGAAGTGCAAGCAATCTTCCATGCAGTGAGATTACTCATTCGAATGCAGATTCA
TGAAGCTGTTTTATATTCAGATTCGTTAAATGACATTCGTATGATTAACAAGGACCAAGATGCTGATTATGAGGTTCACCATTGGATAACACAAATCCATGACATGTGCA
ACGATTCTATGTCGTTATCTTTTGTTCATATTCGACGAGGTCGAAACGAGATGGTCGATGCTTTGGCCAGGAAAGCTCGGGAGACCTCAAGTCACGTCCTATGGAATTGC
AAGAAAGTAAGATTTTTGTGGGAGACCCTATTCCCAAAACTTACCCACTTCCTTTGTCCGTGCAGTGATAACGAGGAGTTCTCCGTGGTGTGGGAAGCTATTACAGAGCA
CTTAAGCAAGGAGGAAGTCGAAAGAGCAGCCGTCATTCTATGGTCTTTATGGAATGGCAGGAACAAGACTATCGTCAACAGCAACAAAGTGGATTTCCAGCAGATAATTA
GATCAACCAAACGAAATTTAGAAGAACAAAGGAAAAGGCAGAATTCTCACCTGGAGTTACTCAACTTGGAGAGCCTTCGGAATCAAGATGTGTGGAGCCCCCCTCCGCCC
AGCTGTGTCAAGATTAATTCTGATGCCTCCTGGAATGAGGAGAAGCAGCAAGGCGGTGTAGGCTGGATTGCTCGTGATTCCTCAGGATCTCCTATAGGCTTTGGGAAGAA
GAATCGGAGCAGTCATTACCCGATTATTGTAGAGGCAGACGCGTCTGAGGCAATAAAGGCGCTTAACCATGAAGCTACGGACCTCTCGGAAAGCAATCTGGTGTTGAGCG
AAATCGAAAATCTGGCTTCTTCGAGCAATATAGTTGCCTTCGTCAAATGCCCACCTCGCGCACCACCTTGCGCGGCGGGTTTTTCGCCGACGAGCAGTCTGTCGGCCATT
GAAGTCGACGGGATATTTTCTTTTTGGATCTCTTCCACGCTGGAAGAAGACTTTTTTTTCATCTCTCTCTTCTACCTGAGCGCGCCACCATCGATTGAACAGCCAGCCAC
CATCGCGCCTCTCTCTGCCCAGTTGCCGCCGCGCGTCTCTCTCGGTAAGATCCTCTCTCCCTCACGATTTAATCTCTCTCTCTCTAATCCCATCGGTTTCTCCCTCAGTC
CTGATCTTCTGACAGCAAGCTCAATGGCAACGTGGAGAAGTCATTTAAGCGTCGTTCGGCTCGTGGGTCATCTTGGATCTGCGTGGGTCAGAGAAAGCACCAAGCTTTTC
TCCCAATTTCTAGCGTTTCGAGTGAACTCGGTTGTGAGTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTTCAGGCCATTAAAGAATTTGAATGCAGTAGTTCAAGCTAAAATGAGGGTTGCAGATTTCATCACTCATTCTAATGGGTGGGATATGGAAAAGCTAAGACAAGT
GGTGGATAAGCCCATTATGGATTGCCCTGCTCGTTGTGAGTGGATCCAAAGTTATTGGAATGAACTCAGAGGTATTAGAGGTATCACGAAGTTATTACAGATGGAGTCAT
CCATGGATCATGATGTTTTGCCAGGCACAATAAAAGTATTCACGGATGCAGCTGTTAACATTCATAAAATTGGAGTGGGTCTTGGGGCCGTAATTGTTGGTTGTGATGAC
AACCTACAATGTGCAATGACGATGTTTGAACACAGGACTCTCTCTCCCTTAGCGGTAGAAGTGCAAGCAATCTTCCATGCAGTGAGATTACTCATTCGAATGCAGATTCA
TGAAGCTGTTTTATATTCAGATTCGTTAAATGACATTCGTATGATTAACAAGGACCAAGATGCTGATTATGAGGTTCACCATTGGATAACACAAATCCATGACATGTGCA
ACGATTCTATGTCGTTATCTTTTGTTCATATTCGACGAGGTCGAAACGAGATGGTCGATGCTTTGGCCAGGAAAGCTCGGGAGACCTCAAGTCACGTCCTATGGAATTGC
AAGAAAGTAAGATTTTTGTGGGAGACCCTATTCCCAAAACTTACCCACTTCCTTTGTCCGTGCAGTGATAACGAGGAGTTCTCCGTGGTGTGGGAAGCTATTACAGAGCA
CTTAAGCAAGGAGGAAGTCGAAAGAGCAGCCGTCATTCTATGGTCTTTATGGAATGGCAGGAACAAGACTATCGTCAACAGCAACAAAGTGGATTTCCAGCAGATAATTA
GATCAACCAAACGAAATTTAGAAGAACAAAGGAAAAGGCAGAATTCTCACCTGGAGTTACTCAACTTGGAGAGCCTTCGGAATCAAGATGTGTGGAGCCCCCCTCCGCCC
AGCTGTGTCAAGATTAATTCTGATGCCTCCTGGAATGAGGAGAAGCAGCAAGGCGGTGTAGGCTGGATTGCTCGTGATTCCTCAGGATCTCCTATAGGCTTTGGGAAGAA
GAATCGGAGCAGTCATTACCCGATTATTGTAGAGGCAGACGCGTCTGAGGCAATAAAGGCGCTTAACCATGAAGCTACGGACCTCTCGGAAAGCAATCTGGTGTTGAGCG
AAATCGAAAATCTGGCTTCTTCGAGCAATATAGTTGCCTTCGTCAAATGCCCACCTCGCGCACCACCTTGCGCGGCGGGTTTTTCGCCGACGAGCAGTCTGTCGGCCATT
GAAGTCGACGGGATATTTTCTTTTTGGATCTCTTCCACGCTGGAAGAAGACTTTTTTTTCATCTCTCTCTTCTACCTGAGCGCGCCACCATCGATTGAACAGCCAGCCAC
CATCGCGCCTCTCTCTGCCCAGTTGCCGCCGCGCGTCTCTCTCGGTAAGATCCTCTCTCCCTCACGATTTAATCTCTCTCTCTCTAATCCCATCGGTTTCTCCCTCAGTC
CTGATCTTCTGACAGCAAGCTCAATGGCAACGTGGAGAAGTCATTTAAGCGTCGTTCGGCTCGTGGGTCATCTTGGATCTGCGTGGGTCAGAGAAAGCACCAAGCTTTTC
TCCCAATTTCTAGCGTTTCGAGTGAACTCGGTTGTGAGTAAGTGA
Protein sequenceShow/hide protein sequence
MNFRPLKNLNAVVQAKMRVADFITHSNGWDMEKLRQVVDKPIMDCPARCEWIQSYWNELRGIRGITKLLQMESSMDHDVLPGTIKVFTDAAVNIHKIGVGLGAVIVGCDD
NLQCAMTMFEHRTLSPLAVEVQAIFHAVRLLIRMQIHEAVLYSDSLNDIRMINKDQDADYEVHHWITQIHDMCNDSMSLSFVHIRRGRNEMVDALARKARETSSHVLWNC
KKVRFLWETLFPKLTHFLCPCSDNEEFSVVWEAITEHLSKEEVERAAVILWSLWNGRNKTIVNSNKVDFQQIIRSTKRNLEEQRKRQNSHLELLNLESLRNQDVWSPPPP
SCVKINSDASWNEEKQQGGVGWIARDSSGSPIGFGKKNRSSHYPIIVEADASEAIKALNHEATDLSESNLVLSEIENLASSSNIVAFVKCPPRAPPCAAGFSPTSSLSAI
EVDGIFSFWISSTLEEDFFFISLFYLSAPPSIEQPATIAPLSAQLPPRVSLGKILSPSRFNLSLSNPIGFSLSPDLLTASSMATWRSHLSVVRLVGHLGSAWVRESTKLF
SQFLAFRVNSVVSK