; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036604 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036604
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:49124984..49139428
RNA-Seq ExpressionLag0036604
SyntenyLag0036604
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU21788.1 hypothetical protein TSUD_329120, partial [Trifolium subterraneum]3.3e-5326.23Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN
        L+GK+ T    +  A K +M  AW++R   ++  +  N++LF F ++++ D +  N PW                               + +LP+  R+
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--
        E I KK+GN +G F E D K +  ++G  +R+RV +D+ KPL+RG  +   G  +E WV  +YER+P FCF CG+IGH ++DC   +D  E+   +LE  
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--

Query:  ---FGMWVKF-------------------------QGFNGRTKSSGSPKRNSDDPNLQDPTEEENYR---NEARNSESLDVDLNIISPVAEDPEETIRRE
           FG W++                             N R ++SG+ K  +++   Q  +++++     ++A  S+  D D      V ++ E  +   
Subjt:  ---FGMWVKF-------------------------QGFNGRTKSSGSPKRNSDDPNLQDPTEEENYR---NEARNSESLDVDLNIISPVAEDPEETIRRE

Query:  GKQVVFTEYE--ESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGN-------PRAF
        G  V+ T+ +  E   HV +           K + W+     R +  K K  +  +   + GK     +++T    +     +++  G+           
Subjt:  GKQVVFTEYE--ESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGN-------PRAF

Query:  RAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHS
        RA+  L    NPQ+ F+ +T+      E I++   F  CL V  NG      GGL L+W     V++ S+S NHI   C+   +  SW  T +YGYP+  
Subjt:  RAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHS

Query:  KKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIR-------------PNGELFTWPIEIWMGNRGQRNNKSH
         K  TW+LIR+L   +   WL  GD+N+ L + EKQG  +R  + +   RQ + DC+  D+              P        + I +     R+ +  
Subjt:  KKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIR-------------PNGELFTWPIEIWMGNRGQRNNKSH

Query:  DIQFKFEELWTKYEECADIIATNSDWTGKKSNEMLGITDSNRLWTEDH
           F+FEE WTK  +C ++I  N   +    ++ L   +S     EDH
Subjt:  DIQFKFEELWTKYEECADIIATNSDWTGKKSNEMLGITDSNRLWTEDH

KAA3459308.1 reverse transcriptase [Gossypium australe]2.6e-5026.37Show/hide
Query:  AMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIIN--------------------------LPIG-----FRNENIVKKIGNGLGGF
        + +  M+  WKT+R F++ ++G N++L IFES  D + +    PW                             + IG     F  ++++  IG   GG 
Subjt:  AMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIIN--------------------------LPIG-----FRNENIVKKIGNGLGGF

Query:  LEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPK-DITEINKEDLEFGMWVKFQG--FNGR
        +  + K      G+  RIR+ +D+ KPLRRG  V   G  ++CW++ +YE++  FCF CG++GHA+  C+  K D+ E  +E  +F + +K +   F   
Subjt:  LEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPK-DITEINKEDLEFGMWVKFQG--FNGR

Query:  TKSSGSPKRNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKG
             +        +      EEN + + ++    D+ L + + +AE  +  +++  ++ +    E+   ++  E     LQ + KN  W   +  R  G
Subjt:  TKSSGSPKRNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKG

Query:  MKIKWIRVHQRK-EEVGKGENDRIILTALLKKSQNCAKERGL---------GNPR------------AFRAIKNLVLSRNPQVFFICKTKCDERVTERIK
        +      V +RK  +   G ++R        K Q  A +  +         GN              A R ++ L+   NP + F+ ++K D +  ERI+
Subjt:  MKIKWIRVHQRK-EEVGKGENDRIILTALLKKSQNCAKERGL---------GNPR------------AFRAIKNLVLSRNPQVFFICKTKCDERVTERIK

Query:  NACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISWNDI--SWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEK
            F   + V + G++GGLCL W+ +  VS+++FS NHID  ++ +++   WR+T  YG P  + +  +W L+R L +    PWL+ GD+NE L + EK
Subjt:  NACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISWNDI--SWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEK

Query:  QGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTWP-------------IEIWMGNRG--QRNNKSHDIQFKFEELWTKYEECADIIATNSDWTGKKSNE
         G  LR+   +  FR+ L DC L D+   G  FTW                 W+   G    + K    +FKFE  WT  E   + I    + +    +E
Subjt:  QGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTWP-------------IEIWMGNRG--QRNNKSHDIQFKFEELWTKYEECADIIATNSDWTGKKSNE

Query:  MLG
         LG
Subjt:  MLG

MCH80348.1 hypothetical protein [Trifolium medium]3.8e-5727.58Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN
        L+GK+ T    +  A K +M  AW++R   ++  +  N+YLF F +K++ D +  N PW                               + +LP+  R+
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--
        E I KK+GN +G F E D K    ++G  +R+RV +D+ KPL+RG  +   G  +E WV  +YER+P FCF CG+IGH ++DC   +D  E+   +LE  
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--

Query:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNS-ESLDVDLNIISPVAEDPEETIRREGKQVVFTEYE-ESRSHVNDEDNQDVL-
           FG W++        K S   K+ S   N           ++ +NS  + ++D  +      D + + ++  K  +  + E ++    N +   +V+ 
Subjt:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNS-ESLDVDLNIISPVAEDPEETIRREGKQVVFTEYE-ESRSHVNDEDNQDVL-

Query:  ------------QLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGK-GEN---DRIILTALLKKSQNCAKE-RG----------LGNPRAFRAIKNLV
                    ++ E  Q      G RW   K+   +  +  + VGK G+    D ++    ++  +   K+ RG           G+PRA RA+  L 
Subjt:  ------------QLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGK-GEN---DRIILTALLKKSQNCAKE-RG----------LGNPRAFRAIKNLV

Query:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHSKKAFTWE
           NPQ+ F+ +T+      E I++   F  CL+V  NG      GGL L+W  +  V++ SFS NHI   C    +  SW  T +YGYP+   K  TW 
Subjt:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHSKKAFTWE

Query:  LIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----------------------------PIEIWMGNR
        LIR+L   +   WL  GD+N+ L + EKQG  +R        RQ + D +L D+   G  FTW                            PI++   N 
Subjt:  LIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----------------------------PIEIWMGNR

Query:  GQRNNKSHDI------------------QFKFEELWTKYEECADIIATN
         QR    H                     F+FEE WTK  +C ++I +N
Subjt:  GQRNNKSHDI------------------QFKFEELWTKYEECADIIATN

PNY15174.1 ribonuclease H, partial [Trifolium pratense]2.2e-4928.52Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN
        L+GKL T    +    K  +  +W+ +   ++  +  N++LF F SK+D + +  N PW                               I +LP+  R+
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--
        + +  ++GN +G F+E D+K  + +LG  +R++V ID+ KPL+RG ++   G  +   V  +YER+P FC+ CG+IGH +KDC + +   E   ED+E  
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--

Query:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLM
           +G W+K        K SG  K+     +       E   ++  + E L   L I         E +    K  V  E +E       +D ++V  + 
Subjt:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLM

Query:  E----------KNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKG-ENDRIILTALLKKSQNCAKERGLGNPRAF---RAIKNLVLSRNPQVFFICKTKCD
        E          K   + I S  + K   +K  R   RK+   K  E D+  L A L K Q        G+P A    RA+  L+   NP + F+ +T+  
Subjt:  E----------KNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKG-ENDRIILTALLKKSQNCAKERGLGNPRAF---RAIKNLVLSRNPQVFFICKTKCD

Query:  ERVTERIKNACKFDGCLSVRSNGA----KGGLCLLWKVNELVSVQSFSSNHIDCNI--SWNDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLG
            ERIK  C F  CLSV   G+     GG+ LLW+    +SV +FS NHI C+I       +W  + +YG+P    K  TW+LI+ +       W+  
Subjt:  ERVTERIKNACKFDGCLSVRSNGA----KGGLCLLWKVNELVSVQSFSSNHIDCNI--SWNDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLG

Query:  GDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW
        GD N+ LS ++K G   R    +   RQ +  C L+D+  +G  +TW
Subjt:  GDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW

XP_030509295.1 uncharacterized protein LOC115723978 [Cannabis sativa]7.6e-5027.99Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW--------IINLPIG-----------------------FRN
        L+G+ +TN  I    M+N +   W+      V  +  N +LF F  + D   +    PW        I  L +G                          
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW--------IINLPIG-----------------------FRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGF-MVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLK--PKDITEINKEDL
        E  V+ +GN +G F+E D K       + +R+RV +DI+KPLRR   + K  GS    WVT +YER P FCF CG IG+  K C K   +   ++ K   
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGF-MVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLK--PKDITEINKEDL

Query:  EF-------------GMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTE----------------EENYRNEARNSESLDVDLNIISPVAEDPEETIRREG
        EF               W++ +G+       G P    +  N    +                 + +  N   N  S + D ++ S    + E   + +G
Subjt:  EF-------------GMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTE----------------EENYRNEARNSESLDVDLNIISPVAEDPEETIRREG

Query:  KQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKI---KWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLV
        K ++    ++    V+D+++  V+   +K +V     G   KG  +   + + V  +K + G G       T +   S NC   RGLG  R  + +K LV
Subjt:  KQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKI---KWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLV

Query:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISW-NDISWRFTRVYGYPKHSKKAFTWELIRNL
          + P V F+C+T  ++   + +  + +F+G  +V ++G  GG+ LLWK  + V + SFS NHID  +++    S+RFT +YG P  S +  TW LI NL
Subjt:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISW-NDISWRFTRVYGYPKHSKKAFTWELIRNL

Query:  REPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----PIEIWMGNR------GQRNNKSHDIQFKFEELWTKYEE
           S  PW + GD+N  L   EK G       LIDGF+  +  C L D+   G  FTW        W+  R        R NK     F+FE  W  +++
Subjt:  REPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----PIEIWMGNR------GQRNNKSHDIQFKFEELWTKYEE

Query:  CADIIATNSDW
        C D+I   S W
Subjt:  CADIIATNSDW

TrEMBL top hitse value%identityAlignment
A0A2Z6LV25 Uncharacterized protein (Fragment)1.6e-5326.23Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN
        L+GK+ T    +  A K +M  AW++R   ++  +  N++LF F ++++ D +  N PW                               + +LP+  R+
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--
        E I KK+GN +G F E D K +  ++G  +R+RV +D+ KPL+RG  +   G  +E WV  +YER+P FCF CG+IGH ++DC   +D  E+   +LE  
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--

Query:  ---FGMWVKF-------------------------QGFNGRTKSSGSPKRNSDDPNLQDPTEEENYR---NEARNSESLDVDLNIISPVAEDPEETIRRE
           FG W++                             N R ++SG+ K  +++   Q  +++++     ++A  S+  D D      V ++ E  +   
Subjt:  ---FGMWVKF-------------------------QGFNGRTKSSGSPKRNSDDPNLQDPTEEENYR---NEARNSESLDVDLNIISPVAEDPEETIRRE

Query:  GKQVVFTEYE--ESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGN-------PRAF
        G  V+ T+ +  E   HV +           K + W+     R +  K K  +  +   + GK     +++T    +     +++  G+           
Subjt:  GKQVVFTEYE--ESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGN-------PRAF

Query:  RAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHS
        RA+  L    NPQ+ F+ +T+      E I++   F  CL V  NG      GGL L+W     V++ S+S NHI   C+   +  SW  T +YGYP+  
Subjt:  RAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHS

Query:  KKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIR-------------PNGELFTWPIEIWMGNRGQRNNKSH
         K  TW+LIR+L   +   WL  GD+N+ L + EKQG  +R  + +   RQ + DC+  D+              P        + I +     R+ +  
Subjt:  KKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIR-------------PNGELFTWPIEIWMGNRGQRNNKSH

Query:  DIQFKFEELWTKYEECADIIATNSDWTGKKSNEMLGITDSNRLWTEDH
           F+FEE WTK  +C ++I  N   +    ++ L   +S     EDH
Subjt:  DIQFKFEELWTKYEECADIIATNSDWTGKKSNEMLGITDSNRLWTEDH

A0A392M033 CCHC-type domain-containing protein (Fragment)1.8e-5727.58Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN
        L+GK+ T    +  A K +M  AW++R   ++  +  N+YLF F +K++ D +  N PW                               + +LP+  R+
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW-------------------------------IINLPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--
        E I KK+GN +G F E D K    ++G  +R+RV +D+ KPL+RG  +   G  +E WV  +YER+P FCF CG+IGH ++DC   +D  E+   +LE  
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLE--

Query:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNS-ESLDVDLNIISPVAEDPEETIRREGKQVVFTEYE-ESRSHVNDEDNQDVL-
           FG W++        K S   K+ S   N           ++ +NS  + ++D  +      D + + ++  K  +  + E ++    N +   +V+ 
Subjt:  ---FGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNS-ESLDVDLNIISPVAEDPEETIRREGKQVVFTEYE-ESRSHVNDEDNQDVL-

Query:  ------------QLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGK-GEN---DRIILTALLKKSQNCAKE-RG----------LGNPRAFRAIKNLV
                    ++ E  Q      G RW   K+   +  +  + VGK G+    D ++    ++  +   K+ RG           G+PRA RA+  L 
Subjt:  ------------QLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGK-GEN---DRIILTALLKKSQNCAKE-RG----------LGNPRAFRAIKNLV

Query:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHSKKAFTWE
           NPQ+ F+ +T+      E I++   F  CL+V  NG      GGL L+W  +  V++ SFS NHI   C    +  SW  T +YGYP+   K  TW 
Subjt:  LSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNG----AKGGLCLLWKVNELVSVQSFSSNHID--CNISWNDISWRFTRVYGYPKHSKKAFTWE

Query:  LIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----------------------------PIEIWMGNR
        LIR+L   +   WL  GD+N+ L + EKQG  +R        RQ + D +L D+   G  FTW                            PI++   N 
Subjt:  LIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW----------------------------PIEIWMGNR

Query:  GQRNNKSHDI------------------QFKFEELWTKYEECADIIATN
         QR    H                     F+FEE WTK  +C ++I +N
Subjt:  GQRNNKSHDI------------------QFKFEELWTKYEECADIIATN

A0A803PLV0 Uncharacterized protein2.0e-5629.33Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIIN-------------------------------LPIGFRN
        L+G+ +++  +    M+N +   WK      V  +  + YLF F  + D   + T  PW  N                               L  G  +
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIIN-------------------------------LPIGFRN

Query:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLK--PKDITEINKEDLE
         ++++ + N +G F+E D K       +  R+RV IDI+KPLRR   +  P      W T +YER P FCF CG IGH+ K C+K   K + +I K   E
Subjt:  ENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLK--PKDITEINKEDLE

Query:  F-------------GMWVKFQGFNG---RTKSSGSPK-RNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRS
        F               W++ +G+       +SSG  + R +DD N++         N   +   +  D+N               +G  V   +  ++  
Subjt:  F-------------GMWVKFQGFNG---RTKSSGSPK-RNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRS

Query:  H-------VNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFF
        +       +N+E N+   ++ +K  V II +  R    +++       KE  G G            +  NC   RGLGNPRAF+ I  LV  + P V F
Subjt:  H-------VNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFF

Query:  ICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISW-NDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWL
        +C+T C +   ER+ +   F+GC  V  +G  GG+ +LWK  + VS+ SFS NHID  + W  +  +R T +YG P  + +A TW L+R L+E S  PW 
Subjt:  ICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISW-NDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWL

Query:  LGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW
        L GD+N  +   EK+G       L++GFR  +GDC L+D++  G  FTW
Subjt:  LGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW

A0A803PLV0 Uncharacterized protein7.5e-0335.09Show/hide
Query:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI
        I ++ YYP+ + L  E+G  PS++W+S+L  ++L+  G+R +IG G    +  +PW+
Subjt:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI

A0A803PLV0 Uncharacterized protein7.2e-5427.96Show/hide
Query:  SDLREQNLCCSKAGSTTATLQLVSITATSQLEPNSGEVGQYLIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW
        S++ E+      A     +L++  +     ++P        L+G+ ++   +    M+N++   W+      V ++  N+YLF F  + D   +    PW
Subjt:  SDLREQNLCCSKAGSTTATLQLVSITATSQLEPNSGEVGQYLIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPW

Query:  IIN-------------------------------LPIGFRNENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFM-VKIPGSCEECW
          N                               +  G      V+ IGN LG F+E D K       + +R+RV  +I KPLRR     K  G+    W
Subjt:  IIN-------------------------------LPIGFRNENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFM-VKIPGSCEECW

Query:  VTIRYERIPEFCFFCGKIGHAIKDC--LKPKDITEINKEDLEFGMWVKFQGFNG-------RTK---------SSGSPKRNSDDPNLQDPTEEENYRNEA
        V+ +YER+P FC+ CG IGH+ + C  L  + I  I K   E  M  KFQ  N        RTK           G  +     PN+ + ++  N  N +
Subjt:  VTIRYERIPEFCFFCGKIGHAIKDC--LKPKDITEINKEDLEFGMWVKFQGFNG-------RTK---------SSGSPKRNSDDPNLQDPTEEENYRNEA

Query:  RNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALL
         N   + V L   +   +  +  +    KQ+V  + ++    +  ED+  ++   +K +        + +G   +W                 II+  L 
Subjt:  RNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLMEKNQVWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALL

Query:  KKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISWNDI-S
          S NC   RGLG P   + +K+LV  + P + F+C+T CD++  E +     F+GC  V + G  GG+ LLW+ +  V + SFS NHIDC +S N + +
Subjt:  KKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVNELVSVQSFSSNHIDCNISWNDI-S

Query:  WRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW
        +R T +YG P  +++  TW+LIR L+E S  PW+L GD+N  LS  +K+G       LIDGF + L  CNL D+   G  +TW
Subjt:  WRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGELFTW

A0A803Q8J6 Uncharacterized protein1.0e-5228.46Show/hide
Query:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIINLPIGFRNENIVKKIGNGLGGFLEQDNKRNYAQLGNSIR
        L+G+ +T+  I   AM++ M   W+  R   V  +G + +LF F  + D + +            GF++  +VK IGN +G F+E D         + +R
Subjt:  LIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKKDRDWIATNDPWIINLPIGFRNENIVKKIGNGLGGFLEQDNKRNYAQLGNSIR

Query:  IRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLEFGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPT
        +RV + + KPL+R   +++ G    C+ T +YE +  FCF CG +GH+ + C K  DI    +  +     ++ +    R + +   K    +  +++ +
Subjt:  IRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDITEINKEDLEFGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPT

Query:  E-----EENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLME-KNQVW---------IIPSGFRWKGMKIKW
        +     +    +E  N ES+ +++N    V E      R +GK ++    EE+    N+     V Q +E  N +W         +I  G  W   K + 
Subjt:  E-----EENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLME-KNQVW---------IIPSGFRWKGMKIKW

Query:  IRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVN
        +    +   +        I+  L   S NC   RGLGNP   + +K ++  + P   F+C+TKC +   + + ++  F+G   V ++G  GGL L WK  
Subjt:  IRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGLCLLWKVN

Query:  ELVSVQSFSSNHIDCNI-SWNDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRP
        E   +  +S NHID  + S N+  WR T +YG P+  ++  TW L+R+L   S  PW + GD N  ++ E+K+G       L++GF+Q L DC+LRD+  
Subjt:  ELVSVQSFSSNHIDCNI-SWNDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRP

Query:  NGELFTWPIEIWMGNRGQRN
         G  FTW       NRGQ N
Subjt:  NGELFTWPIEIWMGNRGQRN

A0A803Q8J6 Uncharacterized protein1.8e-0440.35Show/hide
Query:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI
        I ++ Y+ + N L  +LG  PSY+W+S+L  ++L+ KG+R+KIGNG +  +  +PW+
Subjt:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI

A0A803Q8J6 Uncharacterized protein1.8e-0125.76Show/hide
Query:  WTPPPDGFWNLNSDAACLPSSPTFGLGSICRNMRGDLMAASSCYFDFCMNPLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAE
        W  P      +N DAA       FG+G + R+ RG L+ A +  F+  + PL AE   + E +         ++ +E+D    +  +   S++ S   A 
Subjt:  WTPPPDGFWNLNSDAACLPSSPTFGLGSICRNMRGDLMAASSCYFDFCMNPLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAE

Query:  VEKIWDLTSSFENLDFSYISRSCNNVADSLAK
        +++   L    EN+   ++ RS N VA + A+
Subjt:  VEKIWDLTSSFENLDFSYISRSCNNVADSLAK

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003103.7e-0745.61Show/hide
Query:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI
        +L+S Y+P S+++E  +G  PSY W+S++ GRELL +GL   IG+G  T ++ D WI
Subjt:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI

Arabidopsis top hitse value%identityAlignment
AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0433.33Show/hide
Query:  SDAACLPSSPTFGLGSICRNMRGDLMAASSCYFDFCMN---PLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAEVEKIWDLTS
        +DAA    +   G G + RN   +L A    Y     N   PL AE  A+   +  A S+G +KL + SD+Q  I  I  +S   ++    +  I +L+ 
Subjt:  SDAACLPSSPTFGLGSICRNMRGDLMAASSCYFDFCMN---PLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAEVEKIWDLTS

Query:  SFENLDFSYISRSCNNVADSLAK
         F ++ FS++ RS N VAD LAK
Subjt:  SFENLDFSYISRSCNNVADSLAK

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-0822.56Show/hide
Query:  LPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWIPKSRLRSNV---------------TWTPPPDGFWNLNSDAACLPSSPTFGLGSICRNMRGDL
        +P  LW+      EL+ +G  +   N +E     +  + + R+R+                  W PPP  +   N+DA     +   G+G + RN +G++
Subjt:  LPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWIPKSRLRSNV---------------TWTPPPDGFWNLNSDAACLPSSPTFGLGSICRNMRGDL

Query:  MAASSCYFDFCMNPLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAEVEKIWDLTSSFENLDFSYISRSCNNVADSLAK
            +       + L AEL+A+   ++  S    + +  ESD+Q+ I  I+   ++W  ++  ++ +  L S F  + F +I R  N +A+ +A+
Subjt:  MAASSCYFDFCMNPLRAELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAEVEKIWDLTSSFENLDFSYISRSCNNVADSLAK

AT4G29090.1 Ribonuclease H-like superfamily protein4.2e-0638.6Show/hide
Query:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI
        + KS Y+  S+ L   LG  PS++WKS+   +E+L +G R  +GNGE+  ++R  W+
Subjt:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0845.61Show/hide
Query:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI
        +L+S Y+P S+++E  +G  PSY W+S++ GRELL +GL   IG+G  T ++ D WI
Subjt:  ILKSLYYPDSNILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGACGTGGCCTGCAAAACAAGGGAAACTGTGCACCGGTGTGGTGCTCGCCACACCGACTCCGATGCTTAACTACATAAGTTCTCCTGAGTTCAAATATTCCAA
GGCTGGCCTTAGATGCTTTATCATGAGGGTCTCCTTTTCATTTGGTTATGGTCGGCTTCAAACTTCGGATGATCAACTTCAACCCTCTGAGGGTTGTGGAGATGCGTCAA
ATCCTATGGAAAAATGTTGGGCCTTTGAGGGTGGAGCTCGGTCTTGGCCTCTGGATGGTCGACCTCGGCCTTGGCATGAGGCCGATCAGCTCCTCTGCGTTGTTTGCTTG
CTTGTTTTCCATCCCAAGGGTGTCTGCACTCAGTTAGGGACTTGGGGAGTGGTGTGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGTAGTTTCCGAGCCTCTCT
TCGAGCATCGTGGTCATCCCAGAGGGGTGCGTACACTCTTCTGGGGAAAACTTGGGGAGGCCTAGGAAGTTCCCGGGCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAACTTGGGGAGGCCTAGGAAGTTCTTGGGCCTCTCTTCAAGCATCTTGGTCATCCTAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGG
GGAGGTCTAGGAAGTTCTCGGGTCTCTCTTCAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCTCGGGCCT
CTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCTCGTGCCTCTCTTCAAGCATCGTGGTCATCCCAGG
GGTGCGTACACTCTTCTAGGGAAAAGCTTGGGGTCTCATCCCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATCTCATCTCGAGGGTGCTACACTCAGTCGAGGTCTT
GGAATCTCATCTCGAGGGTGTGTACACTCAGTCGAGGTCTTGGAATTTCATCTCGACGGTGCGTACACTCGATCGAGGGCTTTGAAGGACCCAACCTCTGCAAGGGATGA
ATATTCGAAAGCAAACAAGATGGAAGCCAACGCTCTAATTCTTGACTCATCCGCCACCGTCAATCCACTTATTCAAGTAAATTTCGTGACACACATACACGATCGCCCAT
CAGTGAAGAATTCGAATGTGAGGGCATTGTGGTGCGATGGAGAGGGAATAGCCCTAGAGATTGATTCGGGCCCAATGGGCCTCCGTAGGCTTGTTTCTTCTTTGGGCTTT
TATCATTCTGACTTTCTTAGAGCTAGGTCCAAATTCACCCATAACAGAAAAAATATATTGCTGAGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGTAGCAC
AACTGCCACGTTACAGCTCGTTAGCATAACTGCCACGTCACAGCTCGAGCCCAACAGTGGTGAAGTTGGCCAATACCTGATTGGCAAACTCATCACAAACTGGTTCATCT
CAAAGACTGCAATGAAGAATTCGATGGAAGGTGCCTGGAAAACAAGAAGAGATTTTAAGGTTGATATTATTGGAACCAATATTTACCTGTTCATATTTGAAAGCAAGAAG
GACAGAGACTGGATAGCAACCAACGACCCATGGATTATAAATCTTCCCATCGGTTTCCGGAATGAAAACATAGTAAAGAAGATTGGTAATGGCCTTGGAGGTTTTTTGGA
GCAGGACAATAAGAGAAATTACGCACAGTTGGGAAATAGTATAAGGATCAGAGTTCAAATAGATATCTCAAAGCCACTACGAAGGGGGTTTATGGTGAAAATACCCGGTT
CTTGCGAGGAATGTTGGGTTACAATACGATACGAAAGAATTCCAGAATTTTGTTTCTTCTGTGGCAAAATTGGTCATGCTATTAAAGATTGCCTAAAACCAAAAGATATC
ACAGAGATAAATAAAGAAGACTTGGAATTTGGCATGTGGGTGAAATTTCAAGGTTTTAATGGAAGAACAAAAAGCTCTGGCTCCCCAAAAAGGAATTCGGATGATCCTAA
CTTGCAAGACCCAACTGAAGAAGAAAATTACAGGAATGAAGCTAGAAATAGTGAAAGTCTAGATGTTGACCTTAATATAATTAGCCCTGTAGCCGAAGATCCTGAAGAAA
CCATAAGGAGGGAAGGGAAACAAGTTGTTTTTACAGAGTATGAGGAAAGCAGGAGTCATGTTAATGATGAAGACAACCAGGATGTTCTCCAATTAATGGAGAAGAATCAA
GTTTGGATAATACCATCAGGTTTTCGTTGGAAAGGGATGAAAATAAAATGGATACGGGTCCATCAAAGAAAAGAAGAGGTTGGAAAAGGAGAGAACGACAGGATTATTTT
GACAGCTCTCCTAAAGAAGTCCCAAAACTGCGCAAAAGAAAGGGGGTTGGGGAACCCAAGAGCATTCCGTGCGATTAAGAACCTTGTCTTGTCAAGGAATCCCCAAGTTT
TCTTTATCTGCAAGACAAAGTGTGATGAAAGAGTGACAGAGAGAATTAAAAATGCTTGTAAGTTCGATGGATGTCTCAGTGTTAGGAGCAATGGCGCTAAAGGAGGTCTT
TGTTTGTTATGGAAAGTGAATGAGTTAGTCTCAGTTCAATCTTTTTCTAGCAACCACATTGATTGCAACATCTCATGGAATGACATTAGTTGGAGATTTACCAGAGTATA
TGGGTACCCTAAACATAGTAAAAAAGCCTTCACTTGGGAATTAATCCGCAACCTTAGAGAGCCATCGGATACTCCTTGGCTGTTAGGGGGAGATTGGAACGAAAGCCTGA
GCAATGAAGAAAAACAGGGTGAGCCTTTAAGAGATATAAATCTGATTGATGGCTTCAGGCAATGTCTTGGCGATTGTAATCTTAGAGATATCAGACCAAATGGGGAACTG
TTCACATGGCCAATTGAAATTTGGATGGGAAACAGAGGACAGAGAAATAACAAAAGTCATGATATCCAATTTAAATTCGAGGAGCTTTGGACAAAATATGAGGAATGTGC
AGACATAATAGCAACCAATAGTGATTGGACAGGAAAAAAATCTAATGAGATGTTGGGTATAACTGATTCAAACAGGCTGTGGACTGAAGATCATCTAGAGATTGAACAAA
CTTTTACTGTCTATTTTGAAAATTTGTTTAGTACTTCAAATCCAGATCCTTCCTCCATAATTCTAGCATTAGATGGAATTATTCTTAAAAGCTTGTACTACCCTGATTCT
AATATTCTTGAGCCTGAGCTGGGTAGGCTTCCTTCCTACCTTTGGAAAAGTATGCTTTGGGGTAGAGAACTCTTACTCAAAGGGCTGAGATATAAAATAGGTAATGGGGA
GGAGACTTTCATGTTTAGGGATCCCTGGATTCCCAAGTCCAGATTGAGAAGTAATGTTACTTGGACACCTCCTCCCGATGGATTCTGGAACTTAAACTCAGATGCTGCAT
GTCTTCCCTCTTCCCCAACTTTTGGCTTGGGCTCGATTTGCAGGAATATGCGCGGTGATCTTATGGCTGCTTCTTCTTGCTATTTTGATTTTTGCATGAATCCCCTTCGG
GCCGAACTAAAAGCCATTTCTGAGGTAATGATGCTTGCCTCCTCTTTGGGATGTTCGAAGTTGAAGGTTGAATCAGATAATCAAATGGCCATAAATTTCATTATGAGAAA
ATCAGATGTGTGGAGTGATGTGGAAGCCGAGGTGGAAAAAATATGGGATTTAACCTCTTCTTTTGAAAACCTCGATTTTTCTTACATTTCAAGGAGCTGCAATAATGTTG
CTGACTCATTAGCTAAATTTGCTCGGTCCTTAAAGGTCAATATGTCCTGGGTTGGTCAATTCCCACCCTTGAGGTTAGCTCTACATGCACGTATGACAGAGATGACTCTA
CTACGCATTTCGAGATTACTCTATGCACAACTTGACAGAGATTGCTCTGCTGAGCATATTGAGGTTGCTCTTGAGTTTCAACCAAAGTTGATTGAACGGAGTAAGTTAGC
TAAGGTCACGTCTTCTTCAGCTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGTGATTTTGGACCACACAGAGGGACAAGGAGCTGAGGAGGACAATCGGGCAG
AGGTAGGACCAAAAGCCCGACCCAGAGGAAGACCGGACCAAAGGGTTGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTCGGCC
CGTTTGCGCGGGCCGAGCCCGGTGACCTCTCTTCGGTCCCTGATGCCCCGAATCGCCCCGGTTCCGCCTGCTTCTCCTCGGTTTTCTGACTTAGGCATCGGAGGCGGTGT
GGCCTACACCACGCCGGTGTGCAGCGATTTTTGCTGGTCTTGCAGTTGGCGTCGTCTGTGGGGAAGAGTGTTTGCCAGTTCAGACCACGCATCGGTGGAGTTCTATAGGA
TGAATCGGGTTCAGAAACTAGGACACCTCGGGTATGAGCAGAAAGCGCCGAATCAGACTGAGGTCGGCCTCGGTATGGAAAAGGCCGACCCTAACTCCGGAGGCTGTTGC
GAGCAGAAAGCGCCGAATCAGATTGAGGTCAGCCTCGGTATGGAAAAGGCCGACCCTAGCTCCGGAGGCTGTTGCGAGGTTCCGCATGTCAGCGATGCTGCTGGACTTGA
CCCAAGGGGACAACCTGCGGAGGAACTGGAATCTGTACCACTAACGTCTGAGGAAAGGAGAGTGAACATCGGCACCAAATTGGGGGCTGAGCAGAGGGGCAGACTGGTTG
GTTTCTTGAGGGCTAATGCAGATCTGTTTGCATGGTCACACGAGGATATGCCGGGAATTGACCCGGGAGTGATGGTTCACCAGTTGAACGTGGATAGGGATTTCAAGCCA
GTAAAGCAGAGGCGAAGAACATTTAATTCGGAGAGGAATGAGGTTGTTGCCAGTCAATGGATGAGCTACTTAAAGCAGGGTTCATCCGAGAAGTTCATTATCCCCAATGG
TTGTCCAATGTGGTACTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGACGTGGCCTGCAAAACAAGGGAAACTGTGCACCGGTGTGGTGCTCGCCACACCGACTCCGATGCTTAACTACATAAGTTCTCCTGAGTTCAAATATTCCAA
GGCTGGCCTTAGATGCTTTATCATGAGGGTCTCCTTTTCATTTGGTTATGGTCGGCTTCAAACTTCGGATGATCAACTTCAACCCTCTGAGGGTTGTGGAGATGCGTCAA
ATCCTATGGAAAAATGTTGGGCCTTTGAGGGTGGAGCTCGGTCTTGGCCTCTGGATGGTCGACCTCGGCCTTGGCATGAGGCCGATCAGCTCCTCTGCGTTGTTTGCTTG
CTTGTTTTCCATCCCAAGGGTGTCTGCACTCAGTTAGGGACTTGGGGAGTGGTGTGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGTAGTTTCCGAGCCTCTCT
TCGAGCATCGTGGTCATCCCAGAGGGGTGCGTACACTCTTCTGGGGAAAACTTGGGGAGGCCTAGGAAGTTCCCGGGCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAACTTGGGGAGGCCTAGGAAGTTCTTGGGCCTCTCTTCAAGCATCTTGGTCATCCTAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGG
GGAGGTCTAGGAAGTTCTCGGGTCTCTCTTCAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCTCGGGCCT
CTCTTCAAGCATTGTGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCTCGTGCCTCTCTTCAAGCATCGTGGTCATCCCAGG
GGTGCGTACACTCTTCTAGGGAAAAGCTTGGGGTCTCATCCCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATCTCATCTCGAGGGTGCTACACTCAGTCGAGGTCTT
GGAATCTCATCTCGAGGGTGTGTACACTCAGTCGAGGTCTTGGAATTTCATCTCGACGGTGCGTACACTCGATCGAGGGCTTTGAAGGACCCAACCTCTGCAAGGGATGA
ATATTCGAAAGCAAACAAGATGGAAGCCAACGCTCTAATTCTTGACTCATCCGCCACCGTCAATCCACTTATTCAAGTAAATTTCGTGACACACATACACGATCGCCCAT
CAGTGAAGAATTCGAATGTGAGGGCATTGTGGTGCGATGGAGAGGGAATAGCCCTAGAGATTGATTCGGGCCCAATGGGCCTCCGTAGGCTTGTTTCTTCTTTGGGCTTT
TATCATTCTGACTTTCTTAGAGCTAGGTCCAAATTCACCCATAACAGAAAAAATATATTGCTGAGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGTAGCAC
AACTGCCACGTTACAGCTCGTTAGCATAACTGCCACGTCACAGCTCGAGCCCAACAGTGGTGAAGTTGGCCAATACCTGATTGGCAAACTCATCACAAACTGGTTCATCT
CAAAGACTGCAATGAAGAATTCGATGGAAGGTGCCTGGAAAACAAGAAGAGATTTTAAGGTTGATATTATTGGAACCAATATTTACCTGTTCATATTTGAAAGCAAGAAG
GACAGAGACTGGATAGCAACCAACGACCCATGGATTATAAATCTTCCCATCGGTTTCCGGAATGAAAACATAGTAAAGAAGATTGGTAATGGCCTTGGAGGTTTTTTGGA
GCAGGACAATAAGAGAAATTACGCACAGTTGGGAAATAGTATAAGGATCAGAGTTCAAATAGATATCTCAAAGCCACTACGAAGGGGGTTTATGGTGAAAATACCCGGTT
CTTGCGAGGAATGTTGGGTTACAATACGATACGAAAGAATTCCAGAATTTTGTTTCTTCTGTGGCAAAATTGGTCATGCTATTAAAGATTGCCTAAAACCAAAAGATATC
ACAGAGATAAATAAAGAAGACTTGGAATTTGGCATGTGGGTGAAATTTCAAGGTTTTAATGGAAGAACAAAAAGCTCTGGCTCCCCAAAAAGGAATTCGGATGATCCTAA
CTTGCAAGACCCAACTGAAGAAGAAAATTACAGGAATGAAGCTAGAAATAGTGAAAGTCTAGATGTTGACCTTAATATAATTAGCCCTGTAGCCGAAGATCCTGAAGAAA
CCATAAGGAGGGAAGGGAAACAAGTTGTTTTTACAGAGTATGAGGAAAGCAGGAGTCATGTTAATGATGAAGACAACCAGGATGTTCTCCAATTAATGGAGAAGAATCAA
GTTTGGATAATACCATCAGGTTTTCGTTGGAAAGGGATGAAAATAAAATGGATACGGGTCCATCAAAGAAAAGAAGAGGTTGGAAAAGGAGAGAACGACAGGATTATTTT
GACAGCTCTCCTAAAGAAGTCCCAAAACTGCGCAAAAGAAAGGGGGTTGGGGAACCCAAGAGCATTCCGTGCGATTAAGAACCTTGTCTTGTCAAGGAATCCCCAAGTTT
TCTTTATCTGCAAGACAAAGTGTGATGAAAGAGTGACAGAGAGAATTAAAAATGCTTGTAAGTTCGATGGATGTCTCAGTGTTAGGAGCAATGGCGCTAAAGGAGGTCTT
TGTTTGTTATGGAAAGTGAATGAGTTAGTCTCAGTTCAATCTTTTTCTAGCAACCACATTGATTGCAACATCTCATGGAATGACATTAGTTGGAGATTTACCAGAGTATA
TGGGTACCCTAAACATAGTAAAAAAGCCTTCACTTGGGAATTAATCCGCAACCTTAGAGAGCCATCGGATACTCCTTGGCTGTTAGGGGGAGATTGGAACGAAAGCCTGA
GCAATGAAGAAAAACAGGGTGAGCCTTTAAGAGATATAAATCTGATTGATGGCTTCAGGCAATGTCTTGGCGATTGTAATCTTAGAGATATCAGACCAAATGGGGAACTG
TTCACATGGCCAATTGAAATTTGGATGGGAAACAGAGGACAGAGAAATAACAAAAGTCATGATATCCAATTTAAATTCGAGGAGCTTTGGACAAAATATGAGGAATGTGC
AGACATAATAGCAACCAATAGTGATTGGACAGGAAAAAAATCTAATGAGATGTTGGGTATAACTGATTCAAACAGGCTGTGGACTGAAGATCATCTAGAGATTGAACAAA
CTTTTACTGTCTATTTTGAAAATTTGTTTAGTACTTCAAATCCAGATCCTTCCTCCATAATTCTAGCATTAGATGGAATTATTCTTAAAAGCTTGTACTACCCTGATTCT
AATATTCTTGAGCCTGAGCTGGGTAGGCTTCCTTCCTACCTTTGGAAAAGTATGCTTTGGGGTAGAGAACTCTTACTCAAAGGGCTGAGATATAAAATAGGTAATGGGGA
GGAGACTTTCATGTTTAGGGATCCCTGGATTCCCAAGTCCAGATTGAGAAGTAATGTTACTTGGACACCTCCTCCCGATGGATTCTGGAACTTAAACTCAGATGCTGCAT
GTCTTCCCTCTTCCCCAACTTTTGGCTTGGGCTCGATTTGCAGGAATATGCGCGGTGATCTTATGGCTGCTTCTTCTTGCTATTTTGATTTTTGCATGAATCCCCTTCGG
GCCGAACTAAAAGCCATTTCTGAGGTAATGATGCTTGCCTCCTCTTTGGGATGTTCGAAGTTGAAGGTTGAATCAGATAATCAAATGGCCATAAATTTCATTATGAGAAA
ATCAGATGTGTGGAGTGATGTGGAAGCCGAGGTGGAAAAAATATGGGATTTAACCTCTTCTTTTGAAAACCTCGATTTTTCTTACATTTCAAGGAGCTGCAATAATGTTG
CTGACTCATTAGCTAAATTTGCTCGGTCCTTAAAGGTCAATATGTCCTGGGTTGGTCAATTCCCACCCTTGAGGTTAGCTCTACATGCACGTATGACAGAGATGACTCTA
CTACGCATTTCGAGATTACTCTATGCACAACTTGACAGAGATTGCTCTGCTGAGCATATTGAGGTTGCTCTTGAGTTTCAACCAAAGTTGATTGAACGGAGTAAGTTAGC
TAAGGTCACGTCTTCTTCAGCTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGTGATTTTGGACCACACAGAGGGACAAGGAGCTGAGGAGGACAATCGGGCAG
AGGTAGGACCAAAAGCCCGACCCAGAGGAAGACCGGACCAAAGGGTTGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTCGGCC
CGTTTGCGCGGGCCGAGCCCGGTGACCTCTCTTCGGTCCCTGATGCCCCGAATCGCCCCGGTTCCGCCTGCTTCTCCTCGGTTTTCTGACTTAGGCATCGGAGGCGGTGT
GGCCTACACCACGCCGGTGTGCAGCGATTTTTGCTGGTCTTGCAGTTGGCGTCGTCTGTGGGGAAGAGTGTTTGCCAGTTCAGACCACGCATCGGTGGAGTTCTATAGGA
TGAATCGGGTTCAGAAACTAGGACACCTCGGGTATGAGCAGAAAGCGCCGAATCAGACTGAGGTCGGCCTCGGTATGGAAAAGGCCGACCCTAACTCCGGAGGCTGTTGC
GAGCAGAAAGCGCCGAATCAGATTGAGGTCAGCCTCGGTATGGAAAAGGCCGACCCTAGCTCCGGAGGCTGTTGCGAGGTTCCGCATGTCAGCGATGCTGCTGGACTTGA
CCCAAGGGGACAACCTGCGGAGGAACTGGAATCTGTACCACTAACGTCTGAGGAAAGGAGAGTGAACATCGGCACCAAATTGGGGGCTGAGCAGAGGGGCAGACTGGTTG
GTTTCTTGAGGGCTAATGCAGATCTGTTTGCATGGTCACACGAGGATATGCCGGGAATTGACCCGGGAGTGATGGTTCACCAGTTGAACGTGGATAGGGATTTCAAGCCA
GTAAAGCAGAGGCGAAGAACATTTAATTCGGAGAGGAATGAGGTTGTTGCCAGTCAATGGATGAGCTACTTAAAGCAGGGTTCATCCGAGAAGTTCATTATCCCCAATGG
TTGTCCAATGTGGTACTGGTAA
Protein sequenceShow/hide protein sequence
MGKTWPAKQGKLCTGVVLATPTPMLNYISSPEFKYSKAGLRCFIMRVSFSFGYGRLQTSDDQLQPSEGCGDASNPMEKCWAFEGGARSWPLDGRPRPWHEADQLLCVVCL
LVFHPKGVCTQLGTWGVVCTLFWGKAWGGLGSFRASLRASWSSQRGAYTLLGKTWGGLGSSRASLQASWSSQGCVHSSGENLGRPRKFLGLSSSILVILGVRTLFWGKAW
GGLGSSRVSLQASWSSQGGAYTLLGKSLGRPRKFSGLSSSIVVIPGVRTLFWGKAWGGLGSSRASLQASWSSQGCVHSSREKLGVSSRGCVHSVEVLESHLEGATLSRGL
GISSRGCVHSVEVLEFHLDGAYTRSRALKDPTSARDEYSKANKMEANALILDSSATVNPLIQVNFVTHIHDRPSVKNSNVRALWCDGEGIALEIDSGPMGLRRLVSSLGF
YHSDFLRARSKFTHNRKNILLSDLREQNLCCSKAGSTTATLQLVSITATSQLEPNSGEVGQYLIGKLITNWFISKTAMKNSMEGAWKTRRDFKVDIIGTNIYLFIFESKK
DRDWIATNDPWIINLPIGFRNENIVKKIGNGLGGFLEQDNKRNYAQLGNSIRIRVQIDISKPLRRGFMVKIPGSCEECWVTIRYERIPEFCFFCGKIGHAIKDCLKPKDI
TEINKEDLEFGMWVKFQGFNGRTKSSGSPKRNSDDPNLQDPTEEENYRNEARNSESLDVDLNIISPVAEDPEETIRREGKQVVFTEYEESRSHVNDEDNQDVLQLMEKNQ
VWIIPSGFRWKGMKIKWIRVHQRKEEVGKGENDRIILTALLKKSQNCAKERGLGNPRAFRAIKNLVLSRNPQVFFICKTKCDERVTERIKNACKFDGCLSVRSNGAKGGL
CLLWKVNELVSVQSFSSNHIDCNISWNDISWRFTRVYGYPKHSKKAFTWELIRNLREPSDTPWLLGGDWNESLSNEEKQGEPLRDINLIDGFRQCLGDCNLRDIRPNGEL
FTWPIEIWMGNRGQRNNKSHDIQFKFEELWTKYEECADIIATNSDWTGKKSNEMLGITDSNRLWTEDHLEIEQTFTVYFENLFSTSNPDPSSIILALDGIILKSLYYPDS
NILEPELGRLPSYLWKSMLWGRELLLKGLRYKIGNGEETFMFRDPWIPKSRLRSNVTWTPPPDGFWNLNSDAACLPSSPTFGLGSICRNMRGDLMAASSCYFDFCMNPLR
AELKAISEVMMLASSLGCSKLKVESDNQMAINFIMRKSDVWSDVEAEVEKIWDLTSSFENLDFSYISRSCNNVADSLAKFARSLKVNMSWVGQFPPLRLALHARMTEMTL
LRISRLLYAQLDRDCSAEHIEVALEFQPKLIERSKLAKVTSSSASTNSLLVSREGQVILDHTEGQGAEEDNRAEVGPKARPRGRPDQRVGPKWPDPYGRPRQKAEADHSA
RLRGPSPVTSLRSLMPRIAPVPPASPRFSDLGIGGGVAYTTPVCSDFCWSCSWRRLWGRVFASSDHASVEFYRMNRVQKLGHLGYEQKAPNQTEVGLGMEKADPNSGGCC
EQKAPNQIEVSLGMEKADPSSGGCCEVPHVSDAAGLDPRGQPAEELESVPLTSEERRVNIGTKLGAEQRGRLVGFLRANADLFAWSHEDMPGIDPGVMVHQLNVDRDFKP
VKQRRRTFNSERNEVVASQWMSYLKQGSSEKFIIPNGCPMWYW