; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g29510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g29510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:21529525..21541297
RNA-Seq ExpressionMoc11g29510
SyntenyMoc11g29510
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAM18736.1 putative reverse transcriptase [Oryza sativa Japonica Group]4.2e-4626.66Show/hide
Query:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD
        NH +   H      R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R D
Subjt:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD

Query:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST
        I +P+ RG+ + +       W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  
Subjt:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST

Query:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN
        PS+  + +   KG ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K   
Subjt:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN

Query:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS
         K+   +P +    +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL 
Subjt:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS

Query:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR
          EK+ G  K Q+ MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R
Subjt:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR

Query:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI
             F+ E  W+ +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++
Subjt:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI

Query:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF
          + R    +   +L  V   V+  MN  L A F
Subjt:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF

EEE50824.1 hypothetical protein OsJ_31232 [Oryza sativa Japonica Group]2.7e-4526.77Show/hide
Query:  RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINIE
        R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R DI +P+ RG+ + + 
Subjt:  RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINIE

Query:  GPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGH
              W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  PS+  + +   KG 
Subjt:  GPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGH

Query:  DEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNF
        ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K    K+   +P +    
Subjt:  DEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNF

Query:  EVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAE
        +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL   EK+ G  K Q+ 
Subjt:  EVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAE

Query:  MDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR---RVFKCEEVWIR
        MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R     F+ E  W+ 
Subjt:  MDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR---RVFKCEEVWIR

Query:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDIFSFSRPTDDDMARI
        +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++  + R    +   +
Subjt:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDIFSFSRPTDDDMARI

Query:  LQNVPFSVTEEMNRKLLAPF
        L  V   V+  MN  L A F
Subjt:  LQNVPFSVTEEMNRKLLAPF

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]5.1e-4441.78Show/hide
Query:  HGEKECD--RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPI
        H  +ECD  R++K GPW FDKAL++LQKP     +SEL FN V+FW H +DLP+  + K M  RLGN +G   +VD +  G  WG SLR++V  DIT+P+
Subjt:  HGEKECD--RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPI

Query:  RRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGHDEELKG
        RRG+KINI+GPMGGCWIP+ YE LPD+CY  G+IGH   +C     + +      + YG WL F  + KA     RK  +P+            ++    
Subjt:  RRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGHDEELKG

Query:  QSQNSGGREAERDEIPMASLDGMEG
         S NS  R  E  +  ++     +G
Subjt:  QSQNSGGREAERDEIPMASLDGMEG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]7.6e-4831.97Show/hide
Query:  DRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINI
        ++I K GPW+FD+ L+++ KP  ++  SEL F  +  W  F+DLPL C+T+ M  RLGN +G  +E D D     WG +LRV+V  DI++P+RRG+K+N+
Subjt:  DRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINI

Query:  EGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKI--------NFRRKSSTPSFSSDRSIPGKGHDEELKG
        +GP+GG WIP+ YE LPD+CYH GL             S  R K     YG+WL +Q T+K  +        +   KS   SFSS  S  G G       
Subjt:  EGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKI--------NFRRKSSTPSFSSDRSIPGKGHDEELKG

Query:  QSQNSGGREAERDEIPMASLDGMEGAVFKGHLLGEIDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRK
        QS  + G  A    IPM                 E  V   PK    G     Q        GKS   +        V    +L   +   KS     + 
Subjt:  QSQNSGGREAERDEIPMASLDGMEGAVFKGHLLGEIDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRK

Query:  ARAQSVFRAD--PFLKSKCSSEAFNKLKARLHY-------FGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWC
        + + S+ R D  P    + +    +    + H                W + GD+N IL   E    S+   ++++ FR  +D C L D+GF G  FTWC
Subjt:  ARAQSVFRAD--PFLKSKCSSEAFNKLKARLHY-------FGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWC

Query:  NRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDH
        N R   DQ+++RLDRFL N+ F  ++ +       W+ + H
Subjt:  NRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDH

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.7e-0539.02Show/hide
Query:  RACHNAIPIMMNLQRRGMEVLPLCLVCLKKEESVDHALVSCKRARQIWDCLLPNM-LRDGRSTIDFIDRWMAWELLSETIDL
        R+ H  IP   NL  RG+  LP C +C  + ES+ HA   CKRARQIW  L P +        I F++ W +     E  DL
Subjt:  RACHNAIPIMMNLQRRGMEVLPLCLVCLKKEESVDHALVSCKRARQIWDCLLPNM-LRDGRSTIDFIDRWMAWELLSETIDL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.2e-4626.66Show/hide
Query:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD
        NH +   H      R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R D
Subjt:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD

Query:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST
        I +P+ RG+ + +       W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  
Subjt:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST

Query:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN
        PS+  + +   KG ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K   
Subjt:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN

Query:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS
         K+   +P +    +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL 
Subjt:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS

Query:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR
          EK+ G  K Q+ MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R
Subjt:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR

Query:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI
             F+ E  W+ +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++
Subjt:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI

Query:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF
          + R    +   +L  V   V+  MN  L A F
Subjt:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF

TrEMBL top hitse value%identityAlignment
A0A2N9HFT1 Uncharacterized protein6.5e-4523.82Show/hide
Query:  HGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRR
        + E + +R+L   PW++DK +++ ++  +   +  ++F+ V  W   + LP+  +++ +   +G+++G      S+ D        R+KVR DITQP+ R
Subjt:  HGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRR

Query:  GLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQ---------STMKAKIN-FRRKSSTPSFSSDRSIPGK
        G ++ +     G W+   YE LP++CY  GL+ H  K+C++ +          A YG WL             T++ + N F+R     + + D   P  
Subjt:  GLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQ---------STMKAKIN-FRRKSSTPSFSSDRSIPGK

Query:  GHDEELKGQSQNSGGREAERDEIPMA--------------SLDGMEGAVFKGH------LLGEIDV---------LPDPKGFLNGGR--DGNQDLFHKGN
        G        +++S     E  +  +               +L  ++ A+   H      ++ E D+           DP+  L   +    N+ +  + N
Subjt:  GHDEELKGQSQNSGGREAERDEIPMA--------------SLDGMEGAVFKGH------LLGEIDV---------LPDPKGFLNGGR--DGNQDLFHKGN

Query:  NGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARAQSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTK
         G  LA       + ++  T      +    S   W+  A     F   P  K+     ++N L+    Y    +   WC  GD NEI+ LEEK+   +K
Subjt:  NGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARAQSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTK

Query:  IQAEMDRFREAVDDCYLQDLGFSGNTFTWCNRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRRRVFKCEEVWIR
         +++M  FREA+DDC   DLG+ G  FTWCN R     ++E+LDR + + A+ +++ + RV H D+  SDH+P+ LS     +R +   + F+ EE+W+ 
Subjt:  IQAEMDRFREAVDDCYLQDLGFSGNTFTWCNRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRRRVFKCEEVWIR

Query:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEK-------------------------IGCEDM---
        D  C   I     W   Q  +R E  + +           +++   + S+L  K+ +    +   +                         +G  D    
Subjt:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEK-------------------------IGCEDM---

Query:  -----DLVEKRFISYYEDIFSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPFGVLKLQ
             D V+   ISY+++IF  S P+  D   +LQ +P  +T+ MN+ L  P+   +++
Subjt:  -----DLVEKRFISYYEDIFSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPFGVLKLQ

A0A6J1DX30 uncharacterized protein LOC1110248743.7e-4831.97Show/hide
Query:  DRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINI
        ++I K GPW+FD+ L+++ KP  ++  SEL F  +  W  F+DLPL C+T+ M  RLGN +G  +E D D     WG +LRV+V  DI++P+RRG+K+N+
Subjt:  DRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINI

Query:  EGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKI--------NFRRKSSTPSFSSDRSIPGKGHDEELKG
        +GP+GG WIP+ YE LPD+CYH GL             S  R K     YG+WL +Q T+K  +        +   KS   SFSS  S  G G       
Subjt:  EGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLCFQSTMKAKI--------NFRRKSSTPSFSSDRSIPGKGHDEELKG

Query:  QSQNSGGREAERDEIPMASLDGMEGAVFKGHLLGEIDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRK
        QS  + G  A    IPM                 E  V   PK    G     Q        GKS   +        V    +L   +   KS     + 
Subjt:  QSQNSGGREAERDEIPMASLDGMEGAVFKGHLLGEIDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRK

Query:  ARAQSVFRAD--PFLKSKCSSEAFNKLKARLHY-------FGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWC
        + + S+ R D  P    + +    +    + H                W + GD+N IL   E    S+   ++++ FR  +D C L D+GF G  FTWC
Subjt:  ARAQSVFRAD--PFLKSKCSSEAFNKLKARLHY-------FGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWC

Query:  NRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDH
        N R   DQ+++RLDRFL N+ F  ++ +       W+ + H
Subjt:  NRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDH

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-0539.02Show/hide
Query:  RACHNAIPIMMNLQRRGMEVLPLCLVCLKKEESVDHALVSCKRARQIWDCLLPNM-LRDGRSTIDFIDRWMAWELLSETIDL
        R+ H  IP   NL  RG+  LP C +C  + ES+ HA   CKRARQIW  L P +        I F++ W +     E  DL
Subjt:  RACHNAIPIMMNLQRRGMEVLPLCLVCLKKEESVDHALVSCKRARQIWDCLLPNM-LRDGRSTIDFIDRWMAWELLSETIDL

A0A6J1DX30 uncharacterized protein LOC1110248742.0e-4626.66Show/hide
Query:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD
        NH +   H      R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R D
Subjt:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD

Query:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST
        I +P+ RG+ + +       W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  
Subjt:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST

Query:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN
        PS+  + +   KG ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K   
Subjt:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN

Query:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS
         K+   +P +    +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL 
Subjt:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS

Query:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR
          EK+ G  K Q+ MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R
Subjt:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR

Query:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI
             F+ E  W+ +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++
Subjt:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI

Query:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF
          + R    +   +L  V   V+  MN  L A F
Subjt:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF

B9G5C7 Reverse transcriptase domain-containing protein1.3e-4526.77Show/hide
Query:  RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINIE
        R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R DI +P+ RG+ + + 
Subjt:  RILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINIE

Query:  GPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGH
              W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  PS+  + +   KG 
Subjt:  GPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSSTPSFSSDRSIPGKGH

Query:  DEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNF
        ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K    K+   +P +    
Subjt:  DEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDFSNF

Query:  EVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAE
        +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL   EK+ G  K Q+ 
Subjt:  EVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAE

Query:  MDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR---RVFKCEEVWIR
        MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R     F+ E  W+ 
Subjt:  MDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR---RVFKCEEVWIR

Query:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDIFSFSRPTDDDMARI
        +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++  + R    +   +
Subjt:  DNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDIFSFSRPTDDDMARI

Query:  LQNVPFSVTEEMNRKLLAPF
        L  V   V+  MN  L A F
Subjt:  LQNVPFSVTEEMNRKLLAPF

Q7G3D9 Retrotransposon protein, putative, unclassified2.0e-4626.66Show/hide
Query:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD
        NH +   H      R L+ GPW F+K L+++    +   + ++IF FV  W     LPL  MTK     +G  VG    +D + DG   G+ LR+K+R D
Subjt:  NHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYD

Query:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST
        I +P+ RG+ + +       W P+ YE LPD+CY  G++GH  K C   L   E     ++  ++     W            FQ   K+  +  + S  
Subjt:  ITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSME-----RTKGYVAPYGAW----------LCFQSTMKAKINFRRKSST

Query:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN
        PS+  + +   KG ++  +G+ + +     G  AER + P  SL    +GA  +    GE             ID +PD +G ++GG  G   +  K   
Subjt:  PSFSSDRSIPGKGHDEELKGQSQNSG----GREAERDEIPMASL-DGMEGAVFKGHLLGE-------------IDVLPDPKGFLNGGRDGNQDLFHKGNN

Query:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS
         K+   +P +    +       + E   P+     KR+  A           ++     P L     SE  ++    +          W M GD NEIL 
Subjt:  GKSLAFVPNDFSNFEVDGTGSLQIEVFSPKSVQQWKRKARA-----------QSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILS

Query:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR
          EK+ G  K Q+ MD FR A+ DC L DLGF G+ FTW N    +   I E LDR + N  +++++   RV + D   SDHRP+++ L E  ++ ++ R
Subjt:  LEEKEDGSTKIQAEMDRFREAVDDCYLQDLGFSGNTFTWCNR-RKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRR

Query:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI
             F+ E  W+ +     ++ E  D      +  L+  L   +LA   G+ +  ++   N +  L++R     K+LE   C    +   + +   E++
Subjt:  ---RVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTLALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDI

Query:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF
          + R    +   +L  V   V+  MN  L A F
Subjt:  FSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.5e-0438.82Show/hide
Query:  NPQLAEIAAICEGLRLAERLGLSRVLVESDSKNVIDSICEISVSRGEVANRIADIHSLANSFASISFLHVFRESNQTAHALAKES
        +P  AE  AI   +  A +L  S +LV SDSK+++D++   +VS  E+   + +I S+ N F SISF  + R  N  A A AK S
Subjt:  NPQLAEIAAICEGLRLAERLGLSRVLVESDSKNVIDSICEISVSRGEVANRIADIHSLANSFASISFLHVFRESNQTAHALAKES

AT3G31430.1 unknown protein6.1e-1123.36Show/hide
Query:  EKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGL
        E+  + +L+ GPW+F+  +++LQ+    + L    F F+ FW     +P + + + +V+ +G  +G   + D + + +   +  RV + +DIT P+R   
Subjt:  EKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGL

Query:  KINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKEC
               +    +   YE L  +C   G++ H    C
Subjt:  KINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKEC

AT3G42140.1 zinc ion binding;nucleic acid binding2.5e-0422.63Show/hide
Query:  EKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGL
        E+    IL+ GPWSF+  + ++Q+  K+   S+  F  + FW     +PL  +T  ++  +G  +G+  E +   D         V V            
Subjt:  EKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLPLECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGL

Query:  KINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKEC
                    +   YE L ++C   G++ H   EC
Subjt:  KINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKEC

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0436.9Show/hide
Query:  PQLAEIAAICEGLRLAERLGLSRVLVESDSKNVIDSICEISVSRGEVANRIADIHSLANSFASISFLHVFRESNQTAHALAKES
        P +AE  A+   L+ A+ +G++++ + SDS+ +I +I   S S  E    I DI +L+  FA +SF  V R  N+ A  LAK S
Subjt:  PQLAEIAAICEGLRLAERLGLSRVLVESDSKNVIDSICEISVSRGEVANRIADIHSLANSFASISFLHVFRESNQTAHALAKES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTTCTTGACTGTGGTTTCCAAAACCTTGGGGTTTTGAGCGCTGTTGGAATTTCTGTCGTTGTCTTGTGTCTGGATAGGAATGGGGACAACGGATCTCTTGGAAG
ACTGGAAAAAGTTCAAGCTAACATCTGTAGAAGAGGAGATCCACCAAGGATTCAACAAGAGTTCACTCCTTATTCAAATTCCTACCACCAAGATTGGAGGGATCATTCGA
ATTTCTTGTGGGGAGAGCAACAACTTGAAAGTAGCTATCCATATGAACATGCATTCCCACAAGAATTTCCAATACAACCTCAGCAGGAGTACGATCAACCAAGGGATCTA
AAGCAAATACAATCGTGGGTAATGTCTTGCCTTGAAGAGACGATGATAGATCTTATGGCGAGAAATGAGGCAACAGTACGCAACAACATGATAGATTTTATGGCGCGAAA
TGATGCAGCAGTACGCAACCTGCATATCCGAATAGATCAATTGGTTGCTGAGCTGAGGAATAGTCCACCAACAGCTTGTCTTAATGACGCGGAGATCCCAAAGCAAGAAG
GGGAACAACAAAGTGAGGCAATGAGTGTATGGAGCAAGCTCGAACATGACGAAGTAGAACTTCTAGTGGACAATGATGACCCACCATCCCCTAATGTGATGAACGAGGTA
GGAAATTATAAGGAAGTTGAACAAGAGCAACCAGTAGAGGCAAACCCCACCCCCCTTCATTCTTTCTTCCATCTTTTCACCATGGGAACCTTACCATCTCACCCCGGAGC
TCCGATTCTTCATCGTCATCTTATCCCGTGGGTCGCCGATGTTCCCAATCATCCAATGAAGAAAGTCCATGGTGAAAAAGAATGTGATAGGATCCTTAAGTTGGGGCCAT
GGTCTTTTGATAAAGCATTGTTGATACTTCAAAAGCCTCCAAAGATGGTAAAATTGTCTGAATTGATTTTCAATTTTGTGTCGTTTTGGACCCACTTCTATGATCTTCCT
TTGGAATGTATGACCAAGGCTATGGTGCAGAGGTTGGGGAATGTTGTAGGCGTGTGCGACGAGGTCGATAGTGATGCAGATGGTTTGTGTTGGGGAGAGAGTCTTCGGGT
GAAGGTTAGATATGATATCACTCAACCTATTCGGAGGGGATTAAAAATCAATATCGAGGGGCCAATGGGAGGATGCTGGATACCAATGACTTACGAAAGTCTTCCAGATT
ACTGCTATCACTATGGATTGATCGGGCATCTAGTGAAGGAGTGCACGGATGGTTTGACAAGCATGGAGCGAACGAAAGGGTATGTGGCACCTTATGGTGCCTGGCTTTGT
TTCCAAAGTACAATGAAAGCGAAGATCAATTTTCGTCGAAAGAGCTCTACCCCTTCATTTTCGAGTGATAGGTCAATCCCTGGGAAGGGTCACGATGAAGAGCTGAAGGG
GCAAAGCCAAAATTCAGGTGGTAGAGAGGCTGAGCGGGATGAAATTCCAATGGCGAGTCTTGACGGAATGGAGGGTGCAGTTTTCAAGGGTCATTTACTTGGAGAAATTG
ATGTCTTGCCTGATCCTAAGGGATTTTTAAATGGGGGGAGAGATGGAAATCAGGATCTTTTCCATAAAGGTAATAATGGAAAATCTCTGGCCTTTGTTCCAAATGATTTT
TCTAATTTTGAGGTAGATGGTACGGGCTCATTACAAATTGAGGTGTTCAGCCCAAAGTCGGTTCAGCAATGGAAGAGGAAGGCTCGGGCCCAGTCTGTTTTTAGGGCTGA
CCCTTTTCTGAAGTCAAAATGTAGTTCGGAGGCTTTCAATAAATTGAAGGCTCGCTTGCATTATTTTGGGTGTTTTGTTGTTGAAGCTTGGTGTATGGAGGGTGACTTAA
ACGAAATTCTATCTCTGGAAGAAAAAGAAGATGGCAGTACTAAAATTCAAGCTGAAATGGATAGATTCAGAGAAGCTGTTGATGATTGTTATCTCCAAGACTTGGGCTTT
AGTGGCAATACTTTTACTTGGTGCAACCGGAGAAAGGAGAGGGACCAAATTTATGAGAGGCTTGATCGCTTTCTGGGCAATGAGGCTTTTCAAAGCATTTGGACAGAATT
AAGAGTGACTCATCAAGATTGGGCAAAGTCAGATCATAGGCCCATCTTGCTTTCTTTATTTGAAACATGTGACAGGCCTATTCAGCGGAGGAGGGTGTTCAAATGTGAGG
AAGTGTGGATAAGAGATAACACATGTGGAACCATTATTAATGAGGTGGGGGATTGGGGTGATGATCAGAATAATCAACGTTTAGAAGCTTGCCTTCGAAAGTTTACTCTG
GCTCTCCCCCTTGGGATTTTGAGCTTATCCACAAATGGGAAAAAGAACTCGATTCTTTGCTTGAAACAGAGGAAATTTACTGAAGACAAAGATCTTGAGAAAATTGGCTG
CGAGGACATGGATCTTGTGGAGAAGAGGTTCATATCTTATTATGAGGATATTTTCTCTTTTTCAAGACCCACTGATGATGATATGGCTCGAATTTTACAGAATGTTCCTT
TCTCAGTAACAGAAGAGATGAATAGGAAGCTTTTGGCTCCTTTTGGAGTGCTGAAATTACAGATGGGTGTGGCATTATTCAAAGAATGGGAACTATTCAGTAAGGAGCGG
GCTTGTCATAATGCCATTCCGATTATGATGAATTTACAAAGGAGAGGGATGGAAGTTTTACCTTTATGTCTGGTTTGTTTGAAGAAGGAAGAGTCTGTGGATCATGCGTT
GGTGAGTTGCAAAAGAGCCCGTCAAATTTGGGACTGTTTGTTGCCGAATATGTTGAGGGACGGTCGGTCTACCATTGATTTTATTGATCGTTGGATGGCTTGGGAGTTGT
TATCAGAGACGATCGATCTGGGGACGAATCCTCAACTCGCTGAAATTGCTGCAATTTGTGAAGGTTTGAGGCTTGCAGAAAGGCTAGGGCTTTCTCGAGTGTTGGTGGAG
TCTGATTCGAAGAATGTGATCGACTCAATTTGTGAAATTTCGGTCTCTAGGGGTGAGGTGGCAAATCGTATAGCTGATATTCATTCCCTGGCGAACTCTTTTGCTTCGAT
TTCTTTTCTCCATGTTTTTAGGGAATCAAACCAAACTGCTCATGCTCTAGCTAAGGAGAGTACCAGGATGGGCCATTCGTTTTTGTGGCTTTCCAATTTCCCTTTTTTTC
TCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTTCTTGACTGTGGTTTCCAAAACCTTGGGGTTTTGAGCGCTGTTGGAATTTCTGTCGTTGTCTTGTGTCTGGATAGGAATGGGGACAACGGATCTCTTGGAAG
ACTGGAAAAAGTTCAAGCTAACATCTGTAGAAGAGGAGATCCACCAAGGATTCAACAAGAGTTCACTCCTTATTCAAATTCCTACCACCAAGATTGGAGGGATCATTCGA
ATTTCTTGTGGGGAGAGCAACAACTTGAAAGTAGCTATCCATATGAACATGCATTCCCACAAGAATTTCCAATACAACCTCAGCAGGAGTACGATCAACCAAGGGATCTA
AAGCAAATACAATCGTGGGTAATGTCTTGCCTTGAAGAGACGATGATAGATCTTATGGCGAGAAATGAGGCAACAGTACGCAACAACATGATAGATTTTATGGCGCGAAA
TGATGCAGCAGTACGCAACCTGCATATCCGAATAGATCAATTGGTTGCTGAGCTGAGGAATAGTCCACCAACAGCTTGTCTTAATGACGCGGAGATCCCAAAGCAAGAAG
GGGAACAACAAAGTGAGGCAATGAGTGTATGGAGCAAGCTCGAACATGACGAAGTAGAACTTCTAGTGGACAATGATGACCCACCATCCCCTAATGTGATGAACGAGGTA
GGAAATTATAAGGAAGTTGAACAAGAGCAACCAGTAGAGGCAAACCCCACCCCCCTTCATTCTTTCTTCCATCTTTTCACCATGGGAACCTTACCATCTCACCCCGGAGC
TCCGATTCTTCATCGTCATCTTATCCCGTGGGTCGCCGATGTTCCCAATCATCCAATGAAGAAAGTCCATGGTGAAAAAGAATGTGATAGGATCCTTAAGTTGGGGCCAT
GGTCTTTTGATAAAGCATTGTTGATACTTCAAAAGCCTCCAAAGATGGTAAAATTGTCTGAATTGATTTTCAATTTTGTGTCGTTTTGGACCCACTTCTATGATCTTCCT
TTGGAATGTATGACCAAGGCTATGGTGCAGAGGTTGGGGAATGTTGTAGGCGTGTGCGACGAGGTCGATAGTGATGCAGATGGTTTGTGTTGGGGAGAGAGTCTTCGGGT
GAAGGTTAGATATGATATCACTCAACCTATTCGGAGGGGATTAAAAATCAATATCGAGGGGCCAATGGGAGGATGCTGGATACCAATGACTTACGAAAGTCTTCCAGATT
ACTGCTATCACTATGGATTGATCGGGCATCTAGTGAAGGAGTGCACGGATGGTTTGACAAGCATGGAGCGAACGAAAGGGTATGTGGCACCTTATGGTGCCTGGCTTTGT
TTCCAAAGTACAATGAAAGCGAAGATCAATTTTCGTCGAAAGAGCTCTACCCCTTCATTTTCGAGTGATAGGTCAATCCCTGGGAAGGGTCACGATGAAGAGCTGAAGGG
GCAAAGCCAAAATTCAGGTGGTAGAGAGGCTGAGCGGGATGAAATTCCAATGGCGAGTCTTGACGGAATGGAGGGTGCAGTTTTCAAGGGTCATTTACTTGGAGAAATTG
ATGTCTTGCCTGATCCTAAGGGATTTTTAAATGGGGGGAGAGATGGAAATCAGGATCTTTTCCATAAAGGTAATAATGGAAAATCTCTGGCCTTTGTTCCAAATGATTTT
TCTAATTTTGAGGTAGATGGTACGGGCTCATTACAAATTGAGGTGTTCAGCCCAAAGTCGGTTCAGCAATGGAAGAGGAAGGCTCGGGCCCAGTCTGTTTTTAGGGCTGA
CCCTTTTCTGAAGTCAAAATGTAGTTCGGAGGCTTTCAATAAATTGAAGGCTCGCTTGCATTATTTTGGGTGTTTTGTTGTTGAAGCTTGGTGTATGGAGGGTGACTTAA
ACGAAATTCTATCTCTGGAAGAAAAAGAAGATGGCAGTACTAAAATTCAAGCTGAAATGGATAGATTCAGAGAAGCTGTTGATGATTGTTATCTCCAAGACTTGGGCTTT
AGTGGCAATACTTTTACTTGGTGCAACCGGAGAAAGGAGAGGGACCAAATTTATGAGAGGCTTGATCGCTTTCTGGGCAATGAGGCTTTTCAAAGCATTTGGACAGAATT
AAGAGTGACTCATCAAGATTGGGCAAAGTCAGATCATAGGCCCATCTTGCTTTCTTTATTTGAAACATGTGACAGGCCTATTCAGCGGAGGAGGGTGTTCAAATGTGAGG
AAGTGTGGATAAGAGATAACACATGTGGAACCATTATTAATGAGGTGGGGGATTGGGGTGATGATCAGAATAATCAACGTTTAGAAGCTTGCCTTCGAAAGTTTACTCTG
GCTCTCCCCCTTGGGATTTTGAGCTTATCCACAAATGGGAAAAAGAACTCGATTCTTTGCTTGAAACAGAGGAAATTTACTGAAGACAAAGATCTTGAGAAAATTGGCTG
CGAGGACATGGATCTTGTGGAGAAGAGGTTCATATCTTATTATGAGGATATTTTCTCTTTTTCAAGACCCACTGATGATGATATGGCTCGAATTTTACAGAATGTTCCTT
TCTCAGTAACAGAAGAGATGAATAGGAAGCTTTTGGCTCCTTTTGGAGTGCTGAAATTACAGATGGGTGTGGCATTATTCAAAGAATGGGAACTATTCAGTAAGGAGCGG
GCTTGTCATAATGCCATTCCGATTATGATGAATTTACAAAGGAGAGGGATGGAAGTTTTACCTTTATGTCTGGTTTGTTTGAAGAAGGAAGAGTCTGTGGATCATGCGTT
GGTGAGTTGCAAAAGAGCCCGTCAAATTTGGGACTGTTTGTTGCCGAATATGTTGAGGGACGGTCGGTCTACCATTGATTTTATTGATCGTTGGATGGCTTGGGAGTTGT
TATCAGAGACGATCGATCTGGGGACGAATCCTCAACTCGCTGAAATTGCTGCAATTTGTGAAGGTTTGAGGCTTGCAGAAAGGCTAGGGCTTTCTCGAGTGTTGGTGGAG
TCTGATTCGAAGAATGTGATCGACTCAATTTGTGAAATTTCGGTCTCTAGGGGTGAGGTGGCAAATCGTATAGCTGATATTCATTCCCTGGCGAACTCTTTTGCTTCGAT
TTCTTTTCTCCATGTTTTTAGGGAATCAAACCAAACTGCTCATGCTCTAGCTAAGGAGAGTACCAGGATGGGCCATTCGTTTTTGTGGCTTTCCAATTTCCCTTTTTTTC
TCAATTAG
Protein sequenceShow/hide protein sequence
MSVLDCGFQNLGVLSAVGISVVVLCLDRNGDNGSLGRLEKVQANICRRGDPPRIQQEFTPYSNSYHQDWRDHSNFLWGEQQLESSYPYEHAFPQEFPIQPQQEYDQPRDL
KQIQSWVMSCLEETMIDLMARNEATVRNNMIDFMARNDAAVRNLHIRIDQLVAELRNSPPTACLNDAEIPKQEGEQQSEAMSVWSKLEHDEVELLVDNDDPPSPNVMNEV
GNYKEVEQEQPVEANPTPLHSFFHLFTMGTLPSHPGAPILHRHLIPWVADVPNHPMKKVHGEKECDRILKLGPWSFDKALLILQKPPKMVKLSELIFNFVSFWTHFYDLP
LECMTKAMVQRLGNVVGVCDEVDSDADGLCWGESLRVKVRYDITQPIRRGLKINIEGPMGGCWIPMTYESLPDYCYHYGLIGHLVKECTDGLTSMERTKGYVAPYGAWLC
FQSTMKAKINFRRKSSTPSFSSDRSIPGKGHDEELKGQSQNSGGREAERDEIPMASLDGMEGAVFKGHLLGEIDVLPDPKGFLNGGRDGNQDLFHKGNNGKSLAFVPNDF
SNFEVDGTGSLQIEVFSPKSVQQWKRKARAQSVFRADPFLKSKCSSEAFNKLKARLHYFGCFVVEAWCMEGDLNEILSLEEKEDGSTKIQAEMDRFREAVDDCYLQDLGF
SGNTFTWCNRRKERDQIYERLDRFLGNEAFQSIWTELRVTHQDWAKSDHRPILLSLFETCDRPIQRRRVFKCEEVWIRDNTCGTIINEVGDWGDDQNNQRLEACLRKFTL
ALPLGILSLSTNGKKNSILCLKQRKFTEDKDLEKIGCEDMDLVEKRFISYYEDIFSFSRPTDDDMARILQNVPFSVTEEMNRKLLAPFGVLKLQMGVALFKEWELFSKER
ACHNAIPIMMNLQRRGMEVLPLCLVCLKKEESVDHALVSCKRARQIWDCLLPNMLRDGRSTIDFIDRWMAWELLSETIDLGTNPQLAEIAAICEGLRLAERLGLSRVLVE
SDSKNVIDSICEISVSRGEVANRIADIHSLANSFASISFLHVFRESNQTAHALAKESTRMGHSFLWLSNFPFFLN