; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003496 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003496
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr08:2306455..2311237
RNA-Seq ExpressionHG10003496
SyntenyHG10003496
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR020847 - AP endonuclease 1, binding site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]2.5e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]2.0e-7554.29Show/hide
Query:  PD-LVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICKNTCWVTNVYGPTNYKERHHLWPKLTSL
        PD LV+    +  E     IK++WSS  IGW  VE+FG+ GG+L MWD SKI  +E LKGGYSLSI  +T CK +CW+TNVYGP +Y+ER  +W  L SL
Subjt:  PD-LVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICKNTCWVTNVYGPTNYKERHHLWPKLTSL

Query:  SHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKARIFLDHFPL
        S YCT  WC+GG  N TRW HE  P+ + TRGMR+FN  I+ + + E+PL NG+ TWSR G S S SL+D F I+KEWD++ +NSRV RKA    DHFPL
Subjt:  SHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKARIFLDHFPL

Query:  LLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGF
        LLEAG I WGPS FRF NSW+   EC++ I +  ++   + WAGF
Subjt:  LLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGF

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]6.1e-6451.84Show/hide
Query:  LWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKAR
        +W +L+SL+    +PWC+G +FNS R  HER P+GR TR M  FNKFI    LLE PLSNG+FTWSR G   S SL+D FL++  W+D+FDNSRV+R+AR
Subjt:  LWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKAR

Query:  IFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAWFANFEQERKRKEKDLLSELEFFDSKAEREAL
           DHFPL LEAG   WGPS+FRFCNSW+   E  K I +SL   ++  WA   ++  LR +K  LK WF  F +E K KE+ LL+EL+  DS     + 
Subjt:  IFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAWFANFEQERKRKEKDLLSELEFFDSKAEREAL

Query:  ASFELDIRLAIKGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
             D   ++K DL+ L  LEE++LIQKCKL WLK GDENT+FF
Subjt:  ASFELDIRLAIKGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

TrEMBL top hitse value%identityAlignment
A0A438G038 Transposon TX1 uncharacterized 149 kDa protein1.9e-6336.81Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K+F+ S+ PD+VMIQETK  E  ++ + ++WS     W ++ A G SGG+L++WD  K+   E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R   W +L+ ++      WC+GGDFN  R   E+    R+T  M+ F++FI +  L++ PL +  +TWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAWFANFEQERKRKE
         + EW+ +F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W      E  +K+
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAWFANFEQERKRKE

Query:  KDLLSELEFFDSKAEREALASFELDIRLAIKGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        KD+L+ L  FDS  +   L+   L  R   KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  KDLLSELEFFDSKAEREALASFELDIRLAIKGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.2e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

A0A438JX47 LINE-1 retrotransposable element ORF2 protein1.2e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

A0A5D3BHE3 Uncharacterized protein9.8e-7654.29Show/hide
Query:  PD-LVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICKNTCWVTNVYGPTNYKERHHLWPKLTSL
        PD LV+    +  E     IK++WSS  IGW  VE+FG+ GG+L MWD SKI  +E LKGGYSLSI  +T CK +CW+TNVYGP +Y+ER  +W  L SL
Subjt:  PD-LVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICKNTCWVTNVYGPTNYKERHHLWPKLTSL

Query:  SHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKARIFLDHFPL
        S YCT  WC+GG  N TRW HE  P+ + TRGMR+FN  I+ + + E+PL NG+ TWSR G S S SL+D F I+KEWD++ +NSRV RKA    DHFPL
Subjt:  SHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKARIFLDHFPL

Query:  LLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGF
        LLEAG I WGPS FRF NSW+   EC++ I +  ++   + WAGF
Subjt:  LLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGF

A5CAA2 Reverse transcriptase domain-containing protein1.2e-6539.07Show/hide
Query:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK
        MKIISWNT+GL  K K+  +K F++S+ PD+VM QETK  E  ++F+ ++W++    W ++ A G SGG+L++WD  K++  E + G +S+SIKF     
Subjt:  MKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLLMWDESKITPLEFLKGGYSLSIKFMTICK

Query:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL
         + W++ VYGP N   R  LW +L+ ++   +  WC+GGDFN  R   E+    R+T  M+ F+ FI +  L+++PL +  FTWS    +     +DRFL
Subjt:  NTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKFTWSRSGGSNSHSLIDRFL

Query:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK
         + EW+  F  S      R   DH+P++LE     WGP+ FRF N W+      +   +     Q +GW G K   KL+  K  LK W  A+F +  KRK
Subjt:  INKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAW-FANFEQERKRK

Query:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF
        E D+LS L  FDS  E+E   S EL  + AI KG+L  L + EE +  QK ++ W+K GD N+ FF
Subjt:  EKDLLSELEFFDSKAEREALASFELDIRLAI-KGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAGGACCCAATACAAGCTTTCTACTCTGAAAAAATCAAAACCGACTGTGGTATTTCTCGGCTTGCTAAGTTTCGTGCTAATGAAGGTTGGTTTGTAGAATATGC
CTTTTGGTCTTCTTCGGGTGGTAGAAGAAACATTCATATTCCGGCAGGGGAAAATAAACAAGGTTGGCAATCTTTTCTCTCAATGCTCAAGGAAACTCTGATACTTAGTA
CAATTAATTCACAAGACAACAAAGACGGCATAGTTGGACCTACAAGAAACAGAGCTTCTACAAAATCATATGCAGACGTCGTAGGTTCAGAATTTGGGAGAATAAGTCGA
GATAAACCAGCGAATTTGAAGGAAAATAAGTTAGAATACTGGTCCAAAATGAACCATAGTCTGCCAGAAGTCACAAATTGCTATGAGGGATGGATTTACATAAAGAACCT
ACCATTGCCTTTCTGGAAAAATTCAGTTTTTGAAGCTATTGGTGATCATTTGGGAGGATTAATGGAAATTTCTTCAAAAACATTAAATTGGTTAGATTGTTCATCAGAAT
TAATCAAAGTTCAAAATAACATATGTGGTTTCATCCCGGCATCCTTATCCATTGAAGATTCAAACTTGGTGAAACTCTTGATTCATTTCGAAGGCAAAGATGCTTTGGTT
GATCAGAGGAAAAAAATTCCCAACAAGAGCAGTTTTATGGCAAGCGACTTTTCAAACTCCCTAGACATCATTCGAATTAAAGCCGTTATGATTGATGAAGATTTTTCAAC
GGAGATGCTTGAGAAAGGATCAGAAATTGGAAAAGAAGATGAAGAGTTGCATTTAGATACAAATGAGATTCTCCTAAACGAAATTACATTGGGAATCGGAATGGGTGGCC
TGGTATTAAAAAATACAGCAAGCTCCTTGGATAATGAAAAGGCCAAACTAATTCCTGCTGAAGCATTTAATCCTCCTATTTCTCATCTCTCTTCAGTTTTAATGAGGAAT
GAAGCACATACCTCATTGGATAAAGGAAAAGGGCCGATTCATACACCCTCTTCTTTTATTAATGAGTGCAATTTTCAAGAAGGAATTTCAAGGGTCACTTTAGTTCAGCA
ACCGTTTGAAGCAGACTCAGAAATTGAACAGCCACTCTCCCACGCATGCCCGACTCAAACTCCATCCCCAAACTCTCAATTACCAAACACCGAGCTGGAAGTGGAAGAGG
CCAATCAAGATTTTCAGGAAAATTTTAATGAGTTATTAAATATACAAAATAATATTGACTCAACCACAAATTTTGGACATCAGACAAGAAAATCCTCAGTTGAGTCATTG
TTAAACCACTCTACAAGCCCAGATTTTTTGGAAGAAATTTGTGTTCAATCTCTGGTTCCTCAAAGTACAAGTCCTGTAAAGAAAAGACACTCTTCAGAGGGAGGTGCCTC
AAGTATGAAGATCATTTCTTGGAACACCAAGGGCCTCAATGAAAAATCCAAGCAGACAGCCTTGAAACATTTTATCCAAAGTCAACATCCAGACTTGGTAATGATTCAAG
AAACCAAATGTCCAGAATTTAGCCAACAATTCATTAAGGCAATATGGAGTTCTAATGGAATCGGATGGACCTCGGTAGAAGCTTTTGGTAAATCTGGTGGCTTGCTCCTT
ATGTGGGATGAAAGCAAAATTACTCCTCTGGAATTCCTCAAAGGCGGTTATTCACTCTCAATTAAATTCATGACTATTTGTAAGAACACATGTTGGGTGACAAATGTCTA
TGGACCTACAAATTACAAAGAACGACATCATCTTTGGCCCAAACTCACATCCCTATCTCATTACTGCACTGAACCATGGTGTTTAGGGGGAGATTTCAACAGCACTAGAT
GGATTCACGAGAGGTCTCCTATTGGGAGAGTCACAAGAGGAATGAGGAAATTCAACAAGTTCATAGAGGAAGTAGGGCTGCTGGAAATTCCCCTTTCTAATGGTAAATTC
ACTTGGTCACGAAGTGGAGGCTCAAATTCCCACTCCCTTATTGATCGATTCTTAATTAATAAGGAATGGGATGATTTATTCGATAATTCCAGAGTTAGTAGAAAAGCAAG
AATATTCTTAGATCATTTTCCTTTGTTACTAGAAGCAGGAGTAATTAATTGGGGCCCTTCCGCATTTCGATTTTGCAACAGTTGGATGACTATTATAGAATGTTCAAAGG
CCATCAATCAGTCCTTGGATGTTGATCAGTCTAGTGGATGGGCAGGTTTCAAAATCAATTTAAAGCTGCGCAATTCGAAAGAAATTTTAAAGGCTTGGTTTGCTAATTTT
GAACAGGAAAGGAAGCGCAAGGAGAAGGATTTGCTTTCAGAGCTAGAATTCTTTGACTCTAAGGCTGAAAGAGAAGCTCTCGCATCGTTTGAACTAGATATTCGTCTTGC
CATTAAAGGGGATCTGATGAACTTGTGCATGTTGGAAGAAAGAAATTTAATCCAAAAATGTAAGTTGAATTGGCTTAAGTTGGGAGATGAAAATACTACTTTTTTTTCCA
CAGATTTCTTTCAGCAAAGAGGAGGAGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAGGACCCAATACAAGCTTTCTACTCTGAAAAAATCAAAACCGACTGTGGTATTTCTCGGCTTGCTAAGTTTCGTGCTAATGAAGGTTGGTTTGTAGAATATGC
CTTTTGGTCTTCTTCGGGTGGTAGAAGAAACATTCATATTCCGGCAGGGGAAAATAAACAAGGTTGGCAATCTTTTCTCTCAATGCTCAAGGAAACTCTGATACTTAGTA
CAATTAATTCACAAGACAACAAAGACGGCATAGTTGGACCTACAAGAAACAGAGCTTCTACAAAATCATATGCAGACGTCGTAGGTTCAGAATTTGGGAGAATAAGTCGA
GATAAACCAGCGAATTTGAAGGAAAATAAGTTAGAATACTGGTCCAAAATGAACCATAGTCTGCCAGAAGTCACAAATTGCTATGAGGGATGGATTTACATAAAGAACCT
ACCATTGCCTTTCTGGAAAAATTCAGTTTTTGAAGCTATTGGTGATCATTTGGGAGGATTAATGGAAATTTCTTCAAAAACATTAAATTGGTTAGATTGTTCATCAGAAT
TAATCAAAGTTCAAAATAACATATGTGGTTTCATCCCGGCATCCTTATCCATTGAAGATTCAAACTTGGTGAAACTCTTGATTCATTTCGAAGGCAAAGATGCTTTGGTT
GATCAGAGGAAAAAAATTCCCAACAAGAGCAGTTTTATGGCAAGCGACTTTTCAAACTCCCTAGACATCATTCGAATTAAAGCCGTTATGATTGATGAAGATTTTTCAAC
GGAGATGCTTGAGAAAGGATCAGAAATTGGAAAAGAAGATGAAGAGTTGCATTTAGATACAAATGAGATTCTCCTAAACGAAATTACATTGGGAATCGGAATGGGTGGCC
TGGTATTAAAAAATACAGCAAGCTCCTTGGATAATGAAAAGGCCAAACTAATTCCTGCTGAAGCATTTAATCCTCCTATTTCTCATCTCTCTTCAGTTTTAATGAGGAAT
GAAGCACATACCTCATTGGATAAAGGAAAAGGGCCGATTCATACACCCTCTTCTTTTATTAATGAGTGCAATTTTCAAGAAGGAATTTCAAGGGTCACTTTAGTTCAGCA
ACCGTTTGAAGCAGACTCAGAAATTGAACAGCCACTCTCCCACGCATGCCCGACTCAAACTCCATCCCCAAACTCTCAATTACCAAACACCGAGCTGGAAGTGGAAGAGG
CCAATCAAGATTTTCAGGAAAATTTTAATGAGTTATTAAATATACAAAATAATATTGACTCAACCACAAATTTTGGACATCAGACAAGAAAATCCTCAGTTGAGTCATTG
TTAAACCACTCTACAAGCCCAGATTTTTTGGAAGAAATTTGTGTTCAATCTCTGGTTCCTCAAAGTACAAGTCCTGTAAAGAAAAGACACTCTTCAGAGGGAGGTGCCTC
AAGTATGAAGATCATTTCTTGGAACACCAAGGGCCTCAATGAAAAATCCAAGCAGACAGCCTTGAAACATTTTATCCAAAGTCAACATCCAGACTTGGTAATGATTCAAG
AAACCAAATGTCCAGAATTTAGCCAACAATTCATTAAGGCAATATGGAGTTCTAATGGAATCGGATGGACCTCGGTAGAAGCTTTTGGTAAATCTGGTGGCTTGCTCCTT
ATGTGGGATGAAAGCAAAATTACTCCTCTGGAATTCCTCAAAGGCGGTTATTCACTCTCAATTAAATTCATGACTATTTGTAAGAACACATGTTGGGTGACAAATGTCTA
TGGACCTACAAATTACAAAGAACGACATCATCTTTGGCCCAAACTCACATCCCTATCTCATTACTGCACTGAACCATGGTGTTTAGGGGGAGATTTCAACAGCACTAGAT
GGATTCACGAGAGGTCTCCTATTGGGAGAGTCACAAGAGGAATGAGGAAATTCAACAAGTTCATAGAGGAAGTAGGGCTGCTGGAAATTCCCCTTTCTAATGGTAAATTC
ACTTGGTCACGAAGTGGAGGCTCAAATTCCCACTCCCTTATTGATCGATTCTTAATTAATAAGGAATGGGATGATTTATTCGATAATTCCAGAGTTAGTAGAAAAGCAAG
AATATTCTTAGATCATTTTCCTTTGTTACTAGAAGCAGGAGTAATTAATTGGGGCCCTTCCGCATTTCGATTTTGCAACAGTTGGATGACTATTATAGAATGTTCAAAGG
CCATCAATCAGTCCTTGGATGTTGATCAGTCTAGTGGATGGGCAGGTTTCAAAATCAATTTAAAGCTGCGCAATTCGAAAGAAATTTTAAAGGCTTGGTTTGCTAATTTT
GAACAGGAAAGGAAGCGCAAGGAGAAGGATTTGCTTTCAGAGCTAGAATTCTTTGACTCTAAGGCTGAAAGAGAAGCTCTCGCATCGTTTGAACTAGATATTCGTCTTGC
CATTAAAGGGGATCTGATGAACTTGTGCATGTTGGAAGAAAGAAATTTAATCCAAAAATGTAAGTTGAATTGGCTTAAGTTGGGAGATGAAAATACTACTTTTTTTTCCA
CAGATTTCTTTCAGCAAAGAGGAGGAGAAATTTGA
Protein sequenceShow/hide protein sequence
MLKDPIQAFYSEKIKTDCGISRLAKFRANEGWFVEYAFWSSSGGRRNIHIPAGENKQGWQSFLSMLKETLILSTINSQDNKDGIVGPTRNRASTKSYADVVGSEFGRISR
DKPANLKENKLEYWSKMNHSLPEVTNCYEGWIYIKNLPLPFWKNSVFEAIGDHLGGLMEISSKTLNWLDCSSELIKVQNNICGFIPASLSIEDSNLVKLLIHFEGKDALV
DQRKKIPNKSSFMASDFSNSLDIIRIKAVMIDEDFSTEMLEKGSEIGKEDEELHLDTNEILLNEITLGIGMGGLVLKNTASSLDNEKAKLIPAEAFNPPISHLSSVLMRN
EAHTSLDKGKGPIHTPSSFINECNFQEGISRVTLVQQPFEADSEIEQPLSHACPTQTPSPNSQLPNTELEVEEANQDFQENFNELLNIQNNIDSTTNFGHQTRKSSVESL
LNHSTSPDFLEEICVQSLVPQSTSPVKKRHSSEGGASSMKIISWNTKGLNEKSKQTALKHFIQSQHPDLVMIQETKCPEFSQQFIKAIWSSNGIGWTSVEAFGKSGGLLL
MWDESKITPLEFLKGGYSLSIKFMTICKNTCWVTNVYGPTNYKERHHLWPKLTSLSHYCTEPWCLGGDFNSTRWIHERSPIGRVTRGMRKFNKFIEEVGLLEIPLSNGKF
TWSRSGGSNSHSLIDRFLINKEWDDLFDNSRVSRKARIFLDHFPLLLEAGVINWGPSAFRFCNSWMTIIECSKAINQSLDVDQSSGWAGFKINLKLRNSKEILKAWFANF
EQERKRKEKDLLSELEFFDSKAEREALASFELDIRLAIKGDLMNLCMLEERNLIQKCKLNWLKLGDENTTFFSTDFFQQRGGEI