; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g04580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g04580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:3177259..3181127
RNA-Seq ExpressionMoc10g04580
SyntenyMoc10g04580
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]3.7e-5133.5Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLA-----NQLHNQNLNPNTFRGKGYASGK----------
        ED++IY  NGLP ++N F+TS+RT+ + ++ EE++ +L  EE  ++   K++     P A++A     N   N+  +P+ F G+G   G+          
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLA-----NQLHNQNLNPNTFRGKGYASGK----------

Query:  -GKF--PASGNGNAGNRGRSLSSGNSGNRLAYLRLCPL--------VYSVHQLILLNKG-PPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDL
         G+F  P  G  N     +     N  +  ++  +C +        +   H++    +G PPS + T+   + +  +  + S   W  D+G   H+T+DL
Subjt:  -GKF--PASGNGNAGNRGRSLSSGNSGNRLAYLRLCPL--------VYSVHQLILLNKG-PPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDL

Query:  GQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIP
          L    EY+GDD IT+ NG +L I+HSG  SI  +    RL N+L VP+++TNLLSVH+ C DN+C  +FD+  F IQDK++ Q+LF GPS +G YP+P
Subjt:  GQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIP

Query:  STVFPPSTVHTSP--LPSL-------------------------VAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLS
        ++     T H++P   P L                          A++G Q S+ +WHDRLGHP    L+++L S +++       +CQHCL GKM KL 
Subjt:  STVFPPSTVHTSP--LPSL-------------------------VAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLS

Query:  FPVSST
        FP+S+T
Subjt:  FPVSST

PKU66571.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.3e-4033.33Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG
        ED+++Y  NGLP+ +N+F++S+RT    +S E L+ LL +EE  +  +L+KD                    P       + S +G+   +  G    RG
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG

Query:  RSLSSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPST-----RSTSCHGSCSNLAVVNGSST-QWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGH
        R ++S    N        P +  V Q+   N     T      S+S   S + L+    S+T +W+ DSG   H+T+D   +   S Y G D +++ NG 
Subjt:  RSLSSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPST-----RSTSCHGSCSNLAVVNGSST-QWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGH

Query:  SLPITHSGCG--SISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVA
         +PI H G G   +  +   L LQ +LH+P +S NLLS+H+L  DNNC V FDA+ F IQD+S  QIL  G   NG YPI       ST  T+   +   
Subjt:  SLPITHSGCG--SISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVA

Query:  HVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST
            + SS  WH RLGHP   +L+ + + +     S+++++C+ C  GK HKL F  SST
Subjt:  HVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST

PKU67431.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]2.9e-4035.54Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKG-YASGKGKFPASGNGNAGNR
        ED+++Y  NGLP+ +N F++S+RT    +S E L+ LL +EE  L  +L++D   AQP++       NQ       RG+G +++GKG+    G G+    
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKG-YASGKGKFPASGNGNAGNR

Query:  GRSLSSGNSGNRLAYLRLCPL-VYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPI
            S   + N     ++C    ++          PP+  S +     +     N S+ +W+ DSG  +H+T+D   L   + Y G D I++ NG++LPI
Subjt:  GRSLSSGNSGNRLAYLRLCPL-VYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPI

Query:  THSGCG--SISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGA
         H G G   +  +   L LQN+LHVP +S NLLS+H+L  DNNC + FD + + IQD +  QIL  G + NG Y    ++F P  + TSP          
Subjt:  THSGCG--SISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGA

Query:  QSSSSVWHDRLGHPGDNVLRTVLRSLNLSMC-SSLNHVCQHCLHGKMHKLSFPVSSTLPINSP
        +S SS WH RL HP  +VL+ VL  L  S+C  S + +C+ C  GK HKL F VSST    +P
Subjt:  QSSSSVWHDRLGHPGDNVLRTVLRSLNLSMC-SSLNHVCQHCLHGKMHKLSFPVSSTLPINSP

PKU70484.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]4.5e-4133.24Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG
        ED+++Y  NGLPS + +F+T++RT  Q +S ++L+ LL +EE  L ++  KD             L    +  N+     +A+ +G+     N   GNRG
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG

Query:  RSLSSGNS-GNRLAYLRLCPLVYSV--HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLP
        R  S+    G++  ++     + S   H  I          S + + S    A  N  ST W  DSG ++H+T+D  QL+    Y G+ Q+ +GNG SLP
Subjt:  RSLSSGNS-GNRLAYLRLCPLVYSV--HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLP

Query:  ITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQ
        I ++G G + T    L L  + HVPN+S NLLSVH+L  DNNC+V F ++ + I+D  + ++L  GP  NG Y +PS V      H       +A + A+
Subjt:  ITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQ

Query:  SSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST
            +WH RLGHP   VL+ + R+ N ++C+S ++ C  C   K  +L F  S++
Subjt:  SSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST

PKU73010.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.3e-4033.7Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL----DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNA
        ED+++Y  NGLP+ +  F+TS+RT  Q ++ ++L+ LL +EE  +     K+L+   L   PTAL A +           RG+   S KG+F  S   N 
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL----DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNA

Query:  GNRGRSLSSGNSGNRLAYLRLCPLVYSVHQLILLNKGP---PSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNG
         N   +     S  R    ++C               P   PS++ T+   S       N + + W  DSG +TH+T+D  Q      Y G+ QIT+GNG
Subjt:  GNRGRSLSSGNSGNRLAYLRLCPLVYSVHQLILLNKGP---PSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNG

Query:  HSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAH
          LPI ++G G + T   +L+L  L  VPN+S NL+SV +L  DNNC++ FD+N + I+D  + ++L  GP  NG YPI      P T         +A 
Subjt:  HSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAH

Query:  VGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST
        +  Q+   +WH RLGHPG NVL+ + +       S    VC  C   K  +LSFP+S++
Subjt:  VGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLSFPVSST

TrEMBL top hitse value%identityAlignment
A0A2N9FUJ5 Uncharacterized protein1.5e-5036.72Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTA------------LLANQLHNQNLNPNTFRGKGYA
        E+++     GLP ++  F +++RTR + +SFEE+ VLL  EE +L       K L    +FA  T             + +NQ   +  N N     G  
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTA------------LLANQLHNQNLNPNTFRGKGYA

Query:  SGKGKFPASG---NGNA---GNRGRSLSSGNS-------GNRLAYLRLCPLVYSV--------HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLA
        S   ++P S    +GNA    N+    ++GN+       G       +C +   V        H++    +G       +   S SN +      T WL 
Subjt:  SGKGKFPASG---NGNA---GNRGRSLSSGNS-------GNRLAYLRLCPLVYSV--------HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLA

Query:  DSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILF
        D+G   H+TS+LG LTG + Y+G DQ+ VGNG S+PI + G G +ST + N RL NLLH P IS+NLLSVH+LC  NNC   FD+N+F IQD  SG++L+
Subjt:  DSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILF

Query:  HGPSINGFYPIPSTVFPPSTVHTS-PLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-----HVCQHCLHGKMHKLSFPVS---ST
         G S NG YPI +    PS   +S    S+ A + +++   +WH RLGHP D VL + + S  LS C S+N     H C+HCL GKMHKL F  S   ST
Subjt:  HGPSINGFYPIPSTVFPPSTVHTS-PLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-----HVCQHCLHGKMHKLSFPVS---ST

Query:  LPI
         P+
Subjt:  LPI

A0A2N9HGS3 Uncharacterized protein3.0e-5137Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYAS---GKGKFPAS
        EDL+     GLP +F  F +S+RTR   VSFEE+ VLL  EE +L       K L    +FA       N   +  ++    RG+G  +   G+G+F ++
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYAS---GKGKFPAS

Query:  GNGNAGNRGRSLSS----------------------GNSGNRLAYLRLC-----PLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSG
         + NA  +G +  S                      G S N     ++C       +   H++    +G       +   S SN + +    T WL D+G
Subjt:  GNGNAGNRGRSLSS----------------------GNSGNRLAYLRLC-----PLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSG

Query:  CNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGP
           H+TS+L  LTG + Y+G DQ+ VGNG ++PI + G G +ST   N +L NLLH P IS+NLLSVH+LC  NNC   FD+N+F IQ   SG++L+ G 
Subjt:  CNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGP

Query:  SINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN--HV---CQHCLHGKMHKLSFPVSSTLPINSPM
        S NG YPI +    P +V  SP  S+ A + +++   +WH RLGHP D VL + + S  LS C S+N  HV   C+HCL GKMHKL F VSS      P+
Subjt:  SINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN--HV---CQHCLHGKMHKLSFPVSSTLPINSPM

A0A2N9HZ49 Uncharacterized protein4.0e-5137.83Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYAS-GKGKFPASGN
        E+++     GLP ++  F +++RTR + +SFEE+ VLL  EE +L       K L    +FA  T    N     +   ++ + +G A     ++P S  
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETAL------DKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYAS-GKGKFPASGN

Query:  GNAGNRGRSLSSGNSGNRLAYLRLCPLV-------YSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDD
        GNA  +      G S       ++C  V       Y         + PP+    +   S SN +      T WL D+G   H+TS+LG LTG + Y+G D
Subjt:  GNAGNRGRSLSSGNSGNRLAYLRLCPLV-------YSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDD

Query:  QITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPS-TVHTS
        Q+ VGNG S+PI + G G +ST + N RL NLLH P IS+NLLSVH+LC  NN    FD+N+F IQD  SG++L+ G S NG YPI +    PS +  +S
Subjt:  QITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPS-TVHTS

Query:  PLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-----HVCQHCLHGKMHKLSFPVS---STLPI
           S+ A + +++   +WH RLGHP D VL + + S  LS C S+N     H C+HCL GKMHKL F  S   ST P+
Subjt:  PLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-----HVCQHCLHGKMHKLSFPVS---STLPI

A0A2N9J043 Integrase catalytic domain-containing protein7.5e-5035.42Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG
        E+L+     GLP +F  F +++RTR  T+SFE+L VLL  EE ++ +        A    +  NQ  +   N      +GY  G+G      + N G  G
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG

Query:  RSLSSGNSGNRLAYLR---------------------LCPLVYSV--------HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTS
        RS  S ++ +++ +L                       C + +          H++    +G       +   S SNL     S T WL D+G + H+T+
Subjt:  RSLSSGNSGNRLAYLR---------------------LCPLVYSV--------HQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTS

Query:  DLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYP
            L   + Y+G DQ+TVGNG SLPI   G   + T Y   +L+N+LH+P I++NLLSVH+LCLDNNC  +FDANQ  IQD  +G++L+ G S NG YP
Subjt:  DLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYP

Query:  IPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSL--NHV---CQHCLHGKMHKLSFPVS
        I S+ F      ++ +        +     +WH RLGHP   VL T+L S  LS C+SL  N V   C+HC+ GKMH+L FPVS
Subjt:  IPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSL--NHV---CQHCLHGKMHKLSFPVS

A0A5J5A1U7 Integrase catalytic domain-containing protein1.8e-5133.5Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLA-----NQLHNQNLNPNTFRGKGYASGK----------
        ED++IY  NGLP ++N F+TS+RT+ + ++ EE++ +L  EE  ++   K++     P A++A     N   N+  +P+ F G+G   G+          
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLA-----NQLHNQNLNPNTFRGKGYASGK----------

Query:  -GKF--PASGNGNAGNRGRSLSSGNSGNRLAYLRLCPL--------VYSVHQLILLNKG-PPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDL
         G+F  P  G  N     +     N  +  ++  +C +        +   H++    +G PPS + T+   + +  +  + S   W  D+G   H+T+DL
Subjt:  -GKF--PASGNGNAGNRGRSLSSGNSGNRLAYLRLCPL--------VYSVHQLILLNKG-PPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDL

Query:  GQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIP
          L    EY+GDD IT+ NG +L I+HSG  SI  +    RL N+L VP+++TNLLSVH+ C DN+C  +FD+  F IQDK++ Q+LF GPS +G YP+P
Subjt:  GQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIP

Query:  STVFPPSTVHTSP--LPSL-------------------------VAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLS
        ++     T H++P   P L                          A++G Q S+ +WHDRLGHP    L+++L S +++       +CQHCL GKM KL 
Subjt:  STVFPPSTVHTSP--LPSL-------------------------VAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHVCQHCLHGKMHKLS

Query:  FPVSST
        FP+S+T
Subjt:  FPVSST

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0422.79Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASG-NGNAGNR
        ED  I   N LPS ++   T++     T+  +++   L+  E    K  KK E   Q  AL+              RG+ Y      +  SG  G + NR
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASG-NGNAGNR

Query:  GRS--LSSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAV--------VNGSSTQWLADSGCNTHVT--SDLGQLTGLSEYRGDDQ
         +S   +  N      + R CP            K   +T +   +     L +        ++G  ++W+ D+  + H T   DL        Y   D 
Subjt:  GRS--LSSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAV--------VNGSSTQWLADSGCNTHVT--SDLGQLTGLSEYRGDDQ

Query:  ITVGNGHSLPITHSGCGSISTSYS---NLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHT
         TV  G++     +G G I    +    L L+++ HVP++  NL+S   + LD +    + ANQ +   K S  ++  G +    Y   + +        
Subjt:  ITVGNGHSLPITHSGCGSISTSYS---NLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHT

Query:  SPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-HVCQHCLHGKMHKLSFPVSSTLPIN
              +     + S  +WH R+GH  +  L+ + +   +S         C +CL GK H++SF  SS   +N
Subjt:  SPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLN-HVCQHCLHGKMHKLSFPVSSTLPIN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-2531.23Show/hide
Query:  NQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRGRSL---SSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNG-SSTQW
        N+  N+N N N+   K +      F  + N +    G+       G+S  R + L+        H L  +N   P +  T      +NLA+ +  SS  W
Subjt:  NQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRGRSL---SSGNSGNRLAYLRLCPLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNG-SSTQW

Query:  LADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQI
        L DSG   H+TSD   L+    Y G D + V +G ++PI+H+G  S+ST    L L N+L+VPNI  NL+SV+RLC  N   V F    F ++D ++G  
Subjt:  LADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQI

Query:  LFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHV--CQHCLHGKMHKLSF---PVSSTLP
        L  G + +  Y  P     P ++  SP         ++++ S WH RLGHP  ++L +V+ + +LS+ +  +    C  CL  K +K+ F    ++ST P
Subjt:  LFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHV--CQHCLHGKMHKLSF---PVSSTLP

Query:  I
        +
Subjt:  I

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-2728.32Show/hide
Query:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG
        ++ V      LP D+      +  +    S  E+H  L+  E+   K L  +   A+   + AN + ++N N N  R +        +  + N N  N  
Subjt:  EDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRG

Query:  RSLSSGN-SGNR--LAYLRLCPLVY----------SVHQLILLNKGPPSTRSTSCHGSCSNLAVVNG-SSTQWLADSGCNTHVTSDLGQLTGLSEYRGDD
        +  SSG+ S NR    YL  C +             +HQ         ST   +     +NLAV +  ++  WL DSG   H+TSD   L+    Y G D
Subjt:  RSLSSGN-SGNR--LAYLRLCPLVY----------SVHQLILLNKGPPSTRSTSCHGSCSNLAVVNG-SSTQWLADSGCNTHVTSDLGQLTGLSEYRGDD

Query:  QITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSP
         + + +G ++PITH+G  S+ TS  +L L  +L+VPNI  NL+SV+RLC  N   V F    F ++D ++G  L  G + +  Y  P        + +S 
Subjt:  QITVGNGHSLPITHSGCGSISTSYSNLRLQNLLHVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSP

Query:  LPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHV--CQHCLHGKMHKLSFPVSSTLPINSPMVTTCVNDTPTDILDLDN
          S+ A   ++++ S WH RLGHP   +L +V+ + +L + +  + +  C  C   K HK+ F  +ST+  + P+     +   + IL +DN
Subjt:  LPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSSLNHV--CQHCLHGKMHKLSFPVSSTLPINSPMVTTCVNDTPTDILDLDN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATCTCGTCATATATGCTTTCAATGGTCTTCCTTCTGATTTCAACACGTTTCGCACTTCAATGCGCACTCGCCCTCAGACAGTTTCTTTCGAAGAGCTTCATGT
ATTGCTTGTTGCTGAGGAAACCGCTCTTGATAAACAACTCAAGAAGGATGAATTATTTGCTCAACCTACGGCTTTACTTGCCAATCAGTTACACAATCAGAATCTGAATC
CAAATACCTTTCGAGGGAAAGGCTATGCTAGTGGTAAAGGGAAATTTCCGGCCTCTGGAAATGGTAACGCTGGAAATCGAGGAAGATCTCTTTCCTCTGGTAACTCTGGA
AACCGACTCGCTTACCTGCGGCTTTGCCCTTTGGTGTACAGCGTTCACCAACTAATTCTTCTCAACAAAGGGCCGCCATCCACCCGCTCAACTAGCTGCCATGGTAGCTG
CTCAAATCTTGCAGTGGTAAATGGTTCTTCAACTCAGTGGCTTGCGGATTCTGGCTGTAACACACATGTTACATCTGATTTGGGTCAACTTACTGGACTTTCTGAGTATC
GTGGAGATGATCAGATCACTGTAGGGAATGGACATTCACTTCCCATTACTCATTCAGGTTGTGGTTCTATCTCTACTTCTTATTCTAATCTAAGACTTCAAAATCTGTTA
CATGTTCCAAATATTTCTACCAACCTTTTATCAGTTCATCGTCTTTGCTTAGATAACAATTGTCTTGTTGTCTTTGATGCTAATCAATTCTTTATTCAGGACAAATCTTC
GGGCCAGATACTTTTTCACGGTCCTAGTATAAATGGCTTTTATCCCATTCCTTCTACTGTCTTCCCTCCATCCACTGTCCATACGTCTCCCTTGCCATCGCTCGTTGCTC
ATGTTGGTGCACAATCATCTTCTTCAGTCTGGCATGATAGACTAGGACACCCAGGTGACAATGTTCTTCGCACTGTTCTTCGTTCTCTGAATTTGTCAATGTGTAGTTCT
CTAAATCATGTTTGCCAACATTGTCTTCACGGCAAGATGCACAAATTATCTTTTCCTGTTTCTAGTACTTTACCAATCAACTCTCCAATGGTTACTACTTGTGTAAATGA
TACTCCTACTGATATTTTGGATTTGGATAACTCCACTGCTGTTCATAACATTCCAGAAGCCATTCCACATTCTGTCCAGAATATTCATCCTATGTGCACCCGTAGCAAGT
CTGTTATTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATCTCGTCATATATGCTTTCAATGGTCTTCCTTCTGATTTCAACACGTTTCGCACTTCAATGCGCACTCGCCCTCAGACAGTTTCTTTCGAAGAGCTTCATGT
ATTGCTTGTTGCTGAGGAAACCGCTCTTGATAAACAACTCAAGAAGGATGAATTATTTGCTCAACCTACGGCTTTACTTGCCAATCAGTTACACAATCAGAATCTGAATC
CAAATACCTTTCGAGGGAAAGGCTATGCTAGTGGTAAAGGGAAATTTCCGGCCTCTGGAAATGGTAACGCTGGAAATCGAGGAAGATCTCTTTCCTCTGGTAACTCTGGA
AACCGACTCGCTTACCTGCGGCTTTGCCCTTTGGTGTACAGCGTTCACCAACTAATTCTTCTCAACAAAGGGCCGCCATCCACCCGCTCAACTAGCTGCCATGGTAGCTG
CTCAAATCTTGCAGTGGTAAATGGTTCTTCAACTCAGTGGCTTGCGGATTCTGGCTGTAACACACATGTTACATCTGATTTGGGTCAACTTACTGGACTTTCTGAGTATC
GTGGAGATGATCAGATCACTGTAGGGAATGGACATTCACTTCCCATTACTCATTCAGGTTGTGGTTCTATCTCTACTTCTTATTCTAATCTAAGACTTCAAAATCTGTTA
CATGTTCCAAATATTTCTACCAACCTTTTATCAGTTCATCGTCTTTGCTTAGATAACAATTGTCTTGTTGTCTTTGATGCTAATCAATTCTTTATTCAGGACAAATCTTC
GGGCCAGATACTTTTTCACGGTCCTAGTATAAATGGCTTTTATCCCATTCCTTCTACTGTCTTCCCTCCATCCACTGTCCATACGTCTCCCTTGCCATCGCTCGTTGCTC
ATGTTGGTGCACAATCATCTTCTTCAGTCTGGCATGATAGACTAGGACACCCAGGTGACAATGTTCTTCGCACTGTTCTTCGTTCTCTGAATTTGTCAATGTGTAGTTCT
CTAAATCATGTTTGCCAACATTGTCTTCACGGCAAGATGCACAAATTATCTTTTCCTGTTTCTAGTACTTTACCAATCAACTCTCCAATGGTTACTACTTGTGTAAATGA
TACTCCTACTGATATTTTGGATTTGGATAACTCCACTGCTGTTCATAACATTCCAGAAGCCATTCCACATTCTGTCCAGAATATTCATCCTATGTGCACCCGTAGCAAGT
CTGTTATTGAGTGA
Protein sequenceShow/hide protein sequence
MEDLVIYAFNGLPSDFNTFRTSMRTRPQTVSFEELHVLLVAEETALDKQLKKDELFAQPTALLANQLHNQNLNPNTFRGKGYASGKGKFPASGNGNAGNRGRSLSSGNSG
NRLAYLRLCPLVYSVHQLILLNKGPPSTRSTSCHGSCSNLAVVNGSSTQWLADSGCNTHVTSDLGQLTGLSEYRGDDQITVGNGHSLPITHSGCGSISTSYSNLRLQNLL
HVPNISTNLLSVHRLCLDNNCLVVFDANQFFIQDKSSGQILFHGPSINGFYPIPSTVFPPSTVHTSPLPSLVAHVGAQSSSSVWHDRLGHPGDNVLRTVLRSLNLSMCSS
LNHVCQHCLHGKMHKLSFPVSSTLPINSPMVTTCVNDTPTDILDLDNSTAVHNIPEAIPHSVQNIHPMCTRSKSVIE