; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030350 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030350
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:46545628..46550821
RNA-Seq ExpressionLag0030350
SyntenyLag0030350
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002937 - Amine oxidase
IPR012337 - Ribonuclease H-like superfamily
IPR036188 - FAD/NAD(P)-binding domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]8.1e-5932.09Show/hide
Query:  NVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRG--LEGPGGSLRRYQQSSHKSTKRNSS---------------------------------
        +   S+  + L + ++VKLD NNY LW+ +VL ++RG K    + G  G    +  SS  S  +NS+                                 
Subjt:  NVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRG--LEGPGGSLRRYQQSSHKSTKRNSS---------------------------------

Query:  ---------------------------KHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFE
                                     ++G  KM +YL  MK   D L+LAGNP+   DLI   L GLD EY P+V  ++D+   +W +L + L+TFE
Subjt:  ---------------------------KHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFE

Query:  GTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRG-SGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEED
          + +  +N TN         L LN  +    +    S  R   SN  N + S SRG  G RGRG+ G+N         CQ+CG   H A  C+ RF++ 
Subjt:  GTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRG-SGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEED

Query:  FNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTAN-----------------AGNLGVKPSIMVR-KSHCWGWN------QASIK
        ++  + S  +  QG        +A++A+   V D  W  DSGA+ HVT                    GN G K +I+    S     N        +I 
Subjt:  FNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTAN-----------------AGNLGVKPSIMVR-KSHCWGWN------QASIK

Query:  RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVA---AVSKIVNSVIKSCNLNASVNENSMFCDACQ
        +NL+S+++L ADNN+ VEF  N CFVKDK + +V+L G LK+GLYQL       S       + H       +K+++ V++SC +    ++N  FC+ACQ
Subjt:  RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVA---AVSKIVNSVIKSCNLNASVNENSMFCDACQ

Query:  LGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
         GK H LPF  S + +  PL LVH D+WGP+P++++SG+++Y+ FVDD++R T+I+PLK K
Subjt:  LGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

KAF8393253.1 hypothetical protein HHK36_021494 [Tetracentron sinense]1.2e-5735.29Show/hide
Query:  QEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVK-TWQELSSILITFEGTLSRYSSNATNTVLPDLSVQLALNRPS
        ++ +  M++Y+  +K  SD+L   G P+   D I  +L GL  +Y  +V SI+ +D K +   + S+L++FE  L + +S+   + +     Q       
Subjt:  QEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVK-TWQELSSILITFEGTLSRYSSNATNTVLPDLSVQLALNRPS

Query:  RYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGR-GRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPTSAYIAT
         Y +  + Y  S    SNQ N+Q+ + R  G  GR G+ GRNN   NNRP CQLCGKFGH+  +CY RF+  F    ++    NQG        SA +AT
Subjt:  RYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGR-GRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPTSAYIAT

Query:  PEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEFHSNGCF
        P  V D  W  DSGAT H+T ++ NL                G + SI    S     N  S           +  NLIS+A+  ADNN  +EFH N  F
Subjt:  PEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEFHSNGCF

Query:  VKDKASRRVMLHGTLKNGLYQLELPSIQK---------STTEVSSSTSHVAAV-----------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLP
        VKD+ ++RV+  G L+NGLY+  + S +          + T VS+  SHV  +           S+IV+ ++K CN++   N     C +CQ  KSHRLP
Subjt:  VKDKASRRVMLHGTLKNGLYQLELPSIQK---------STTEVSSSTSHVAAV-----------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLP

Query:  FTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYT
        +  S +++  PL LV+ D+WGP+P++STSG R++I FVDD T
Subjt:  FTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYT

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.4e-6336.16Show/hide
Query:  KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQL
        K    + ++G  KM EYLT MK+ +D+L LAG+ +   DL++  LAGLD EY PIV  ++DK+  TW E+ + L+T+E  L + ++ +  T+ P  ++  
Subjt:  KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQL

Query:  ALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGS-GNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPT
         L            Y+    S +  G      +RG+ G RGRGR  +      +R  CQ+C K GH+A  CY RF +++   ++    S +    + N  
Subjt:  ALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGS-GNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPT

Query:  SAYIATPEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEF
        +AY+A+P  V D  W  DSGA+ HVT +   +                G    I+          Q S           I +NL+SI++LT DN++YVEF
Subjt:  SAYIATPEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEF

Query:  HSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV---------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV
        H   CFVKDK + R++L G +K+GLYQ  LP    ST +       +            SK++N V+K CN+ AS  EN  FC+ACQ GK+H LPF  SV
Subjt:  HSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV---------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV

Query:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        + +  PL LVH D+WGP+P+ S SG+++Y+ F+DD++R T+I+PLK K
Subjt:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]3.3e-6031.97Show/hide
Query:  TNVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRGLEG--------------PGGSLRRYQ---------------------------QSSHK
        ++  +S+  + L + ++VKLD +NY LW+ MVL I+RG +  L+G                 S +++                            Q  H 
Subjt:  TNVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRGLEG--------------PGGSLRRYQ---------------------------QSSHK

Query:  ST-----------------------KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILI
         T                       K      ++G  KM +YL  MK  +D L+LAGNPI   DLI   L GLD EY P+V  ++D+   +W +L + L+
Subjt:  ST-----------------------KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILI

Query:  TFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFE
        TFE  + + +S      L +L++    N     V ++  +  +R + +N     ++  RGS  RG  RGGR    R+ + TCQ+CG   H A  C++RF+
Subjt:  TFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFE

Query:  EDFNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVT------ANAGNLGVKPSIMV---RKSHCWGWNQASIK-------------
        + ++  + S NN  QG        +A++A+   + D  W  DSGA+ HVT       N      K S++V    K        + +K             
Subjt:  EDFNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVT------ANAGNLGVKPSIMV---RKSHCWGWNQASIK-------------

Query:  -RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDA
         +NL+S+++L ADNN+ VEF  N CFVKDK + + +L G LK+GLYQL   S + S+  VS   S    +    +K+++ V+KSCN+  S ++   FC+A
Subjt:  -RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDA

Query:  CQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        CQ GK H LPF  S + +   L LVH D+WGP+P++S+SG+++Y+ F+DD+TR T+I+PLK K
Subjt:  CQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]5.3e-5834.82Show/hide
Query:  KSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLS
        KS   N+ K +    KM +YL  MK  +D L+LAG+PI   DL+   L GLD EY P+V  ++D+   +W +  + L+ FE  L +  +N  N       
Subjt:  KSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLS

Query:  VQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRG-RGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDF--NNPHASGNNSNQGGSS
          + LN  + +  + +       SG N+  S+  + RGS +RG RG  GR    +  RP CQ+CGKFGH+A  CY+RF++ +   N +A G  S+     
Subjt:  VQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRG-RGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDF--NNPHASGNNSNQGGSS

Query:  SNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNL--------------GVKPSIMVRKSHCWGWNQASIK---------RNLISIARLTADNNVYVE
             SA++A+P    D  W  DSGA+ HVT  +G L              G    + +  S     N  +++         +NL+S+++LT DNN  VE
Subjt:  SNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNL--------------GVKPSIMVRKSHCWGWNQASIK---------RNLISIARLTADNNVYVE

Query:  FHSNGCFVKDKASRRVMLHGTLKNGLYQL----ELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV
        F  N C+VKDK + + +L G LK+GLYQL    E P+ +     +S        +    +K++  V+K  N+  S ++   FC+ACQ GK H LPF  S 
Subjt:  FHSNGCFVKDKASRRVMLHGTLKNGLYQL----ELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV

Query:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        + +  PL L+H D+WGP+P++S S +++Y+ F+DD++R T+IFPLK K
Subjt:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-6436.16Show/hide
Query:  KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQL
        K    + ++G  KM EYLT MK+ +D+L LAG+ +   DL++  LAGLD EY PIV  ++DK+  TW E+ + L+T+E  L + ++ +  T+ P  ++  
Subjt:  KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQL

Query:  ALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGS-GNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPT
         L            Y+    S +  G      +RG+ G RGRGR  +      +R  CQ+C K GH+A  CY RF +++   ++    S +    + N  
Subjt:  ALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGS-GNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPT

Query:  SAYIATPEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEF
        +AY+A+P  V D  W  DSGA+ HVT +   +                G    I+          Q S           I +NL+SI++LT DN++YVEF
Subjt:  SAYIATPEIVNDPRWLADSGATTHVTANAGNL----------------GVKPSIMVRKSHCWGWNQAS-----------IKRNLISIARLTADNNVYVEF

Query:  HSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV---------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV
        H   CFVKDK + R++L G +K+GLYQ  LP    ST +       +            SK++N V+K CN+ AS  EN  FC+ACQ GK+H LPF  SV
Subjt:  HSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV---------SKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSV

Query:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        + +  PL LVH D+WGP+P+ S SG+++Y+ F+DD++R T+I+PLK K
Subjt:  TKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.6e-6031.97Show/hide
Query:  TNVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRGLEG--------------PGGSLRRYQ---------------------------QSSHK
        ++  +S+  + L + ++VKLD +NY LW+ MVL I+RG +  L+G                 S +++                            Q  H 
Subjt:  TNVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRGLEG--------------PGGSLRRYQ---------------------------QSSHK

Query:  ST-----------------------KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILI
         T                       K      ++G  KM +YL  MK  +D L+LAGNPI   DLI   L GLD EY P+V  ++D+   +W +L + L+
Subjt:  ST-----------------------KRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILI

Query:  TFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFE
        TFE  + + +S      L +L++    N     V ++  +  +R + +N     ++  RGS  RG  RGGR    R+ + TCQ+CG   H A  C++RF+
Subjt:  TFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFE

Query:  EDFNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVT------ANAGNLGVKPSIMV---RKSHCWGWNQASIK-------------
        + ++  + S NN  QG        +A++A+   + D  W  DSGA+ HVT       N      K S++V    K        + +K             
Subjt:  EDFNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVT------ANAGNLGVKPSIMV---RKSHCWGWNQASIK-------------

Query:  -RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDA
         +NL+S+++L ADNN+ VEF  N CFVKDK + + +L G LK+GLYQL   S + S+  VS   S    +    +K+++ V+KSCN+  S ++   FC+A
Subjt:  -RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAV----SKIVNSVIKSCNLNASVNENSMFCDA

Query:  CQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        CQ GK H LPF  S + +   L LVH D+WGP+P++S+SG+++Y+ F+DD+TR T+I+PLK K
Subjt:  CQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

A0A2Z6MBG6 Integrase catalytic domain-containing protein3.9e-5932.09Show/hide
Query:  NVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRG--LEGPGGSLRRYQQSSHKSTKRNSS---------------------------------
        +   S+  + L + ++VKLD NNY LW+ +VL ++RG K    + G  G    +  SS  S  +NS+                                 
Subjt:  NVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKRG--LEGPGGSLRRYQQSSHKSTKRNSS---------------------------------

Query:  ---------------------------KHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFE
                                     ++G  KM +YL  MK   D L+LAGNP+   DLI   L GLD EY P+V  ++D+   +W +L + L+TFE
Subjt:  ---------------------------KHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFE

Query:  GTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRG-SGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEED
          + +  +N TN         L LN  +    +    S  R   SN  N + S SRG  G RGRG+ G+N         CQ+CG   H A  C+ RF++ 
Subjt:  GTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRG-SGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEED

Query:  FNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTAN-----------------AGNLGVKPSIMVR-KSHCWGWN------QASIK
        ++  + S  +  QG        +A++A+   V D  W  DSGA+ HVT                    GN G K +I+    S     N        +I 
Subjt:  FNNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTAN-----------------AGNLGVKPSIMVR-KSHCWGWN------QASIK

Query:  RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVA---AVSKIVNSVIKSCNLNASVNENSMFCDACQ
        +NL+S+++L ADNN+ VEF  N CFVKDK + +V+L G LK+GLYQL       S       + H       +K+++ V++SC +    ++N  FC+ACQ
Subjt:  RNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVA---AVSKIVNSVIKSCNLNASVNENSMFCDACQ

Query:  LGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
         GK H LPF  S + +  PL LVH D+WGP+P++++SG+++Y+ FVDD++R T+I+PLK K
Subjt:  LGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

A0A803PEH4 Uncharacterized protein2.5e-6632.21Show/hide
Query:  SSTAAAVVLSSAITPSTNVVSSSFGHP-LSTVLTVKLDENNYLLWRGMVLAILRGQK-------------------------------------------
        ++++ A   SS  T   + + ++F  P L+   ++KLD NNY LW+ MV  I+RG +                                           
Subjt:  SSTAAAVVLSSAITPSTNVVSSSFGHP-LSTVLTVKLDENNYLLWRGMVLAILRGQK-------------------------------------------

Query:  ---------------------RGLEGPGGSLRRYQQSSHKSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCS
                             R LE   G+   Y +S    T+      ++G+T M EYL   K  S+ L LAG+P     L++ VL GLD EY+ IV  
Subjt:  ---------------------RGLEGPGGSLRRYQQSSHKSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCS

Query:  INDKDVKTWQELSSILITFEGTLSR-----YSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNN
        I  +   TWQEL  +L++F+  + R      +SN   +  P  ++    N   R    Q     S+N+ +N G   S+ SRG+ NR RGRG        +
Subjt:  INDKDVKTWQELSSILITFEGTLSR-----YSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNN

Query:  RPTCQLCGKFGHSAPACYFRFEEDF-----NNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVKPSIMVRKSHCWGWN
        RPTCQ+ GK+GH+A  CY RF+E +     NNPH    N N+ G ++NN  SA++ATPE++    W ADSGA+ H+T++  NL  K     ++S   G  
Subjt:  RPTCQLCGKFGHSAPACYFRFEEDF-----NNPHASGNNSNQGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVKPSIMVRKSHCWGWN

Query:  Q----------------------------ASIKRNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKS------------
                                       I +NL+S+++L  DNNV +EF+SN C VKDK +++V+LHG LK+ LYQL+ P  + S            
Subjt:  Q----------------------------ASIKRNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKS------------

Query:  TTEVSSSTSHVAAVS------------------KIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYR
        T  V S+ +     S                  K++N V++S N++ S N     CDACQ GK+H LPF  S T++   L L+H DLWGP+P+ S   + 
Subjt:  TTEVSSSTSHVAAVS------------------KIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYR

Query:  FYISFVDDYTRLTYIFPLKLK
        +YI FVDDY+R T+++PLKLK
Subjt:  FYISFVDDYTRLTYIFPLKLK

A0A803QCY3 Uncharacterized protein1.4e-6135.23Show/hide
Query:  YQQSSHKSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNT
        + +S    T+      ++G T M+EYL   K  +D+L LAG P     L + VL+ LD  Y+ +V  I  +   +WQEL  +L++FE  + R        
Subjt:  YQQSSHKSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATNT

Query:  VLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDF--NNPHASGNNSN
                                   R + +++G     ++RG+G R RGRG  N    N++PTCQ+CGK+ HSA  CY  F++ +  ++PH+S  N N
Subjt:  VLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDF--NNPHASGNNSN

Query:  QGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVKPSIMVRKSHCWGWNQAS-----------------------------------IKR
        + G ++NNP SA+IATPE ++   W ADSGA+ ++TA+       PS++ +K    G  + +                                   I +
Subjt:  QGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVKPSIMVRKSHCWGWNQAS-----------------------------------IKR

Query:  NLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAVSKIVNSVIKSCNLNASVNENSMFCDACQLGKS
        N +S+++LT DN+V +EFHSN CFVKD A+RRV+L G LK+GLYQL+ P  + +    S+S             ++  CN   +V+    FCDACQ GKS
Subjt:  NLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTTEVSSSTSHVAAVSKIVNSVIKSCNLNASVNENSMFCDACQLGKS

Query:  HRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        H LPF  S +K+   L LVH DLWGPSP+ S   +++Y+ FVDD TR T+I+PLK K
Subjt:  HRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

SwissProt top hitse value%identityAlignment
O24164 Protoporphyrinogen oxidase, mitochondrial8.5e-1975.81Show/hide
Query:  ASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA
        +S K+VAV+GAGVSGLAAAYKLK HG NVTV EA+ +AGGKLRSVS +GLIWDEGANTM  +
Subjt:  ASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA

Q8S9J1 Protoporphyrinogen oxidase 2, chloroplastic/mitochondrial2.9e-1981.97Show/hide
Query:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA
        S K+VAVVGAGVSGLAAAYKLKS G NVTV EAD R GGKLRSV  NGLIWDEGANTM  A
Subjt:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-2627.02Show/hide
Query:  HKSTKRNSSKHQEGATKMV-EYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDV-KTWQELSSILITFEGTLSRYSSNATNTVLP
        H +  R   K     TK + +Y+  +    D L L G P+   + +  VL  L  EY P++  I  KD   T  E+   L+  E  +   SS    TV+P
Subjt:  HKSTKRNSSKHQEGATKMV-EYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDV-KTWQELSSILITFEGTLSRYSSNATNTVLP

Query:  DLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRP-----TCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSN
                   +  V  +   + + N+  N+ N   + +  + ++   +   N +  NN+       CQ+CG  GHSA  C         +  +S N+  
Subjt:  DLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRP-----TCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSN

Query:  QGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVK------PSIMVRK------SHCWGWNQA---------------SIKRNLISIARL
             +     A +A     +   WL DSGAT H+T++  NL +         +MV        SH    + +               +I +NLIS+ RL
Subjt:  QGGSSSNNPTSAYIATPEIVNDPRWLADSGATTHVTANAGNLGVK------PSIMVRK------SHCWGWNQA---------------SIKRNLISIARL

Query:  TADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKST--TEVSSSTSHVA-------AVSKIVNSVIKSCNLNA-SVNENSMFCDACQLG
           N V VEF      VKD  +   +L G  K+ LY+  + S Q  +     SS  +H +           I+NSVI + +L+  + +   + C  C + 
Subjt:  TADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKST--TEVSSSTSHVA-------AVSKIVNSVIKSCNLNA-SVNENSMFCDACQLG

Query:  KSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK
        KS+++PF++S   ST PL  ++ D+W  SP++S   YR+Y+ FVD +TR T+++PLK K
Subjt:  KSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLK

Q94IG7 Protoporphyrinogen oxidase 24.2e-1874.6Show/hide
Query:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASAYE
        S K+VAVVGAGVSGLAAAYKLKS+G NVT+ EAD+RAGGKL++V  +GLIWDEGANTM  + E
Subjt:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASAYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-2529.04Show/hide
Query:  DNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDV-KTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSN
        D L L G P+   + +  VL  L  +Y P++  I  KD   +  E+   LI  E  L   +S     V+P ++  +  +R +     Q     +RN  +N
Subjt:  DNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDV-KTWQELSSILITFEGTLSRYSSNATNTVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSN

Query:  QGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNN---PTSAYIATPEIVNDPRWLADSGAT
           S S     SG+R   R  +    R     CQ+C   GHSA  C           H   + +NQ  S+S        A +A     N   WL DSGAT
Subjt:  QGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNN---PTSAYIATPEIVNDPRWLADSGAT

Query:  THVTANAGNLGVK------PSIMV-----------------RKSHCWGWNQA----SIKRNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLK
         H+T++  NL           +M+                   S     N+     +I +NLIS+ RL   N V VEF      VKD  +   +L G  K
Subjt:  THVTANAGNLGVK------PSIMV-----------------RKSHCWGWNQA----SIKRNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLK

Query:  NGLYQLELPSIQKSTTEVS--SSTSHVAAVSK-------IVNSVIKSCNLNA-SVNENSMFCDACQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVV
        + LY+  + S Q  +   S  S  +H +  S+       I+NSVI + +L   + +   + C  C + KSH++PF+ S   S+ PL  ++ D+W  SP++
Subjt:  NGLYQLELPSIQKSTTEVS--SSTSHVAAVSK-------IVNSVIKSCNLNA-SVNENSMFCDACQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVV

Query:  STSGYRFYISFVDDYTRLTYIFPLKLK
        S   YR+Y+ FVD +TR T+++PLK K
Subjt:  STSGYRFYISFVDDYTRLTYIFPLKLK

Arabidopsis top hitse value%identityAlignment
AT3G10390.1 Flavin containing amine oxidoreductase family protein1.2e-0454.76Show/hide
Query:  ASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKL
        +S+  V +VGAG+SGLAAA +L   GF VTVLE   R GG++
Subjt:  ASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKL

AT4G16310.1 LSD1-like 35.5e-0553.49Show/hide
Query:  IASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKL
        +   KKV V+GAG +GL AA  L+  GF+VTVLEA +R GG++
Subjt:  IASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKL

AT5G14220.1 Flavin containing amine oxidoreductase family2.1e-2081.97Show/hide
Query:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA
        S K+VAVVGAGVSGLAAAYKLKS G NVTV EAD R GGKLRSV  NGLIWDEGANTM  A
Subjt:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA

AT5G14220.2 Flavin containing amine oxidoreductase family2.1e-2081.97Show/hide
Query:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA
        S K+VAVVGAGVSGLAAAYKLKS G NVTV EAD R GGKLRSV  NGLIWDEGANTM  A
Subjt:  SRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCCTGTTTTGAGCTAAATTCTGCTGGTGATAGCTTAATTCTTTGCATTTTCTTCGATTTTATAGCATCCAGAAAGAAGGTAGCTGTTGTTGGCGCTGGCGTTAG
TGGGCTTGCTGCAGCCTACAAATTGAAATCGCATGGTTTCAATGTTACGGTGCTTGAAGCGGATGCAAGAGCTGGAGGAAAGCTAAGAAGCGTCTCATACAATGGACTTA
TCTGGGATGAAGGAGCCAATACAATGGCAAGTGCCTACGAATCCCATTGCATTGATCAAGAGCAACTTTCTCTCTGCCAAGTCAAAGTTTGTGCAAACGGCAAAGTAAGC
ATGGAGGATAGTGATGATAAACCCTGTGAGAACAGAGATCTATACGATGTCGTCTGCCTGTACCTAGCCTTCCAGCCTAAACGATGTCGTTCTAATTGCCTTTGTATTGT
TAGTATTCTGTTATTCTTAGGGGCAAACGCCATGGCTGATGAAGTCTCATCATCAACCGCTGCTGCTGTAGTATTATCATCTGCCATCACTCCTTCAACCAATGTTGTAA
GTTCTTCGTTTGGCCATCCCCTGAGCACAGTCTTAACTGTGAAATTAGATGAAAACAACTACCTCCTCTGGAGGGGAATGGTTCTAGCCATTCTTAGAGGCCAAAAGAGA
GGTCTGGAAGGCCCTGGAGGAAGTTTACGGCGCTACCAGCAAAGCTCGCATAAATCAACTAAGAGGAATTCTTCAAAACACCAAGAAGGAGCTACAAAAATGGTCGAGTA
CTTGACTGTGATGAAGCAAGCTTCGGACAACCTTCAACTTGCTGGTAATCCTATATGTCTTGGTGACTTAATTTCATATGTTCTTGCGGGATTGGATCCTGAGTACATTC
CTATTGTTTGCTCGATTAATGACAAAGATGTTAAGACCTGGCAGGAGCTCAGTTCTATTTTGATCACTTTTGAGGGAACCTTGTCACGTTACAGCAGCAATGCTACTAAC
ACTGTTTTACCTGATTTATCGGTGCAACTTGCTCTTAATCGACCGAGTAGATATGTTGAACAACAAAAACCCTATAGTCCCAGTCGCAACTCTGGCAGTAATCAAGGAAA
TTCGCAATCTAGTTACTCTAGAGGCAGTGGCAACAGAGGAAGAGGTCGTGGTGGTAGGAACAACTATCAACGAAACAATAGACCAACCTGTCAGCTGTGTGGAAAATTCG
GACACTCAGCACCTGCTTGCTATTTTCGCTTTGAAGAGGACTTCAATAACCCTCATGCCTCAGGTAATAACTCGAACCAGGGTGGCAGTTCATCTAACAATCCTACCTCT
GCTTATATCGCAACACCTGAAATTGTAAATGATCCTCGTTGGCTTGCTGATAGTGGTGCCACCACACATGTGACTGCCAATGCTGGTAATCTGGGAGTGAAACCGAGTAT
CATGGTAAGGAAATCTCATTGTTGGGGATGGAACCAAGCTAGCATCAAACGAAATCTCATTAGCATTGCTCGATTGACTGCGGATAACAATGTCTATGTTGAATTTCACT
CTAATGGCTGTTTTGTGAAGGACAAGGCTTCAAGGAGGGTGATGCTTCACGGAACACTTAAAAATGGCTTGTACCAGCTCGAGCTTCCTTCAATTCAAAAGTCCACAACT
GAAGTCAGTTCTTCTACCTCTCATGTTGCTGCTGTTTCAAAAATAGTTAATAGTGTCATTAAGTCCTGTAATTTGAATGCTTCAGTGAATGAAAACTCCATGTTCTGTGA
TGCTTGTCAATTAGGCAAGTCTCACCGTTTACCTTTTACTCGCTCAGTCACTAAGTCTACTCATCCTCTTGCACTTGTTCATTGTGACTTATGGGGGCCCTCACCTGTTG
TGTCTACTTCTGGTTATCGTTTCTATATTAGCTTTGTAGATGACTATACCAGACTCACCTATATATTTCCCCTTAAACTTAAAATTGTTCCTTCTCCCATTATCACAGCC
TCATCTTCAAATCCTGAGCAGTCCACTGATTCTGTCCCATTGCCAGTGTTCCGTGATGCATCACTATCCTCGCCCTCATCCTCTACCGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCCTGTTTTGAGCTAAATTCTGCTGGTGATAGCTTAATTCTTTGCATTTTCTTCGATTTTATAGCATCCAGAAAGAAGGTAGCTGTTGTTGGCGCTGGCGTTAG
TGGGCTTGCTGCAGCCTACAAATTGAAATCGCATGGTTTCAATGTTACGGTGCTTGAAGCGGATGCAAGAGCTGGAGGAAAGCTAAGAAGCGTCTCATACAATGGACTTA
TCTGGGATGAAGGAGCCAATACAATGGCAAGTGCCTACGAATCCCATTGCATTGATCAAGAGCAACTTTCTCTCTGCCAAGTCAAAGTTTGTGCAAACGGCAAAGTAAGC
ATGGAGGATAGTGATGATAAACCCTGTGAGAACAGAGATCTATACGATGTCGTCTGCCTGTACCTAGCCTTCCAGCCTAAACGATGTCGTTCTAATTGCCTTTGTATTGT
TAGTATTCTGTTATTCTTAGGGGCAAACGCCATGGCTGATGAAGTCTCATCATCAACCGCTGCTGCTGTAGTATTATCATCTGCCATCACTCCTTCAACCAATGTTGTAA
GTTCTTCGTTTGGCCATCCCCTGAGCACAGTCTTAACTGTGAAATTAGATGAAAACAACTACCTCCTCTGGAGGGGAATGGTTCTAGCCATTCTTAGAGGCCAAAAGAGA
GGTCTGGAAGGCCCTGGAGGAAGTTTACGGCGCTACCAGCAAAGCTCGCATAAATCAACTAAGAGGAATTCTTCAAAACACCAAGAAGGAGCTACAAAAATGGTCGAGTA
CTTGACTGTGATGAAGCAAGCTTCGGACAACCTTCAACTTGCTGGTAATCCTATATGTCTTGGTGACTTAATTTCATATGTTCTTGCGGGATTGGATCCTGAGTACATTC
CTATTGTTTGCTCGATTAATGACAAAGATGTTAAGACCTGGCAGGAGCTCAGTTCTATTTTGATCACTTTTGAGGGAACCTTGTCACGTTACAGCAGCAATGCTACTAAC
ACTGTTTTACCTGATTTATCGGTGCAACTTGCTCTTAATCGACCGAGTAGATATGTTGAACAACAAAAACCCTATAGTCCCAGTCGCAACTCTGGCAGTAATCAAGGAAA
TTCGCAATCTAGTTACTCTAGAGGCAGTGGCAACAGAGGAAGAGGTCGTGGTGGTAGGAACAACTATCAACGAAACAATAGACCAACCTGTCAGCTGTGTGGAAAATTCG
GACACTCAGCACCTGCTTGCTATTTTCGCTTTGAAGAGGACTTCAATAACCCTCATGCCTCAGGTAATAACTCGAACCAGGGTGGCAGTTCATCTAACAATCCTACCTCT
GCTTATATCGCAACACCTGAAATTGTAAATGATCCTCGTTGGCTTGCTGATAGTGGTGCCACCACACATGTGACTGCCAATGCTGGTAATCTGGGAGTGAAACCGAGTAT
CATGGTAAGGAAATCTCATTGTTGGGGATGGAACCAAGCTAGCATCAAACGAAATCTCATTAGCATTGCTCGATTGACTGCGGATAACAATGTCTATGTTGAATTTCACT
CTAATGGCTGTTTTGTGAAGGACAAGGCTTCAAGGAGGGTGATGCTTCACGGAACACTTAAAAATGGCTTGTACCAGCTCGAGCTTCCTTCAATTCAAAAGTCCACAACT
GAAGTCAGTTCTTCTACCTCTCATGTTGCTGCTGTTTCAAAAATAGTTAATAGTGTCATTAAGTCCTGTAATTTGAATGCTTCAGTGAATGAAAACTCCATGTTCTGTGA
TGCTTGTCAATTAGGCAAGTCTCACCGTTTACCTTTTACTCGCTCAGTCACTAAGTCTACTCATCCTCTTGCACTTGTTCATTGTGACTTATGGGGGCCCTCACCTGTTG
TGTCTACTTCTGGTTATCGTTTCTATATTAGCTTTGTAGATGACTATACCAGACTCACCTATATATTTCCCCTTAAACTTAAAATTGTTCCTTCTCCCATTATCACAGCC
TCATCTTCAAATCCTGAGCAGTCCACTGATTCTGTCCCATTGCCAGTGTTCCGTGATGCATCACTATCCTCGCCCTCATCCTCTACCGGATAG
Protein sequenceShow/hide protein sequence
MLSCFELNSAGDSLILCIFFDFIASRKKVAVVGAGVSGLAAAYKLKSHGFNVTVLEADARAGGKLRSVSYNGLIWDEGANTMASAYESHCIDQEQLSLCQVKVCANGKVS
MEDSDDKPCENRDLYDVVCLYLAFQPKRCRSNCLCIVSILLFLGANAMADEVSSSTAAAVVLSSAITPSTNVVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQKR
GLEGPGGSLRRYQQSSHKSTKRNSSKHQEGATKMVEYLTVMKQASDNLQLAGNPICLGDLISYVLAGLDPEYIPIVCSINDKDVKTWQELSSILITFEGTLSRYSSNATN
TVLPDLSVQLALNRPSRYVEQQKPYSPSRNSGSNQGNSQSSYSRGSGNRGRGRGGRNNYQRNNRPTCQLCGKFGHSAPACYFRFEEDFNNPHASGNNSNQGGSSSNNPTS
AYIATPEIVNDPRWLADSGATTHVTANAGNLGVKPSIMVRKSHCWGWNQASIKRNLISIARLTADNNVYVEFHSNGCFVKDKASRRVMLHGTLKNGLYQLELPSIQKSTT
EVSSSTSHVAAVSKIVNSVIKSCNLNASVNENSMFCDACQLGKSHRLPFTRSVTKSTHPLALVHCDLWGPSPVVSTSGYRFYISFVDDYTRLTYIFPLKLKIVPSPIITA
SSSNPEQSTDSVPLPVFRDASLSSPSSSTG