; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g01180 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g01180
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:900295..907172
RNA-Seq ExpressionMoc02g01180
SyntenyMoc02g01180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-8349.74Show/hide
Query:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW
        +A Q+MG+ NA DLW A Q+LFGVQS+AEED+LRQ+FQ TRK      D+L +MK+++D LGQAGSPVP R+ ISQ LLGLDE YNPV+A IQGK  ISW
Subjt:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW

Query:  PEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMA---NSSRSVSGGNQRQNQNSRPPFNNNRG--GGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYH
         ++Q+ELL F+KRLE Q++ KNT +   +V VN+A   NSS      N + + N+R   NN++G  GG N GRGR     ++  CQVC K GHSAL  Y+
Subjt:  PEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMA---NSSRSVSGGNQRQNQNSRPPFNNNRG--GGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYH

Query:  RFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLK
        RF+KE+ +    +  +   NF+  SN  V               Q+ N F A  +TVI+ NWY+DSGA+NH+T +Y+++  P+EY G+E++ VGNGD L 
Subjt:  RFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLK

Query:  ISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGALKDGLYRLNTV
        IS++G + L      + L+NVLCV +I KNLVSVSKLA+DNNVY+EFH   C +KD   G+ +L   +KDGLY L+T+
Subjt:  ISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGALKDGLYRLNTV

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]8.9e-7945.92Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK                       
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG

Query:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGR
                      GLDE YN V+  IQGK  ISW ++Q++LL+F+KRL+ QN+  KNT +   S ++NMA         N ++NQ+++  +  N    R
Subjt:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGR

Query:  NRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGAS
            G+  N N+   CQ+CGK GHSAL  Y+RF+KE+ +    N   H  N +   N  V           F +TQN  PF A P+TV+DPNWY+DSGA+
Subjt:  NRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGAS

Query:  NHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK
        NHVT + ++M  PTEY G+E+VTVGNG++L IS+VG +CL      ++L+N+LCV +IAKNL+SVSKLA+DN++Y+EFH   C +KD   GK
Subjt:  NHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]1.2e-9183.71Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        MFVQQSIGNMETSQTNISAPSSSSIATEAA+NPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG
        LKMTDFL VMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPE+QAE                                RSVSGG
Subjt:  LKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG

Query:  NQRQNQNSRPPFNNNRGGGRN
        NQRQNQNS+PPFNNNRGGGRN
Subjt:  NQRQNQNSRPPFNNNRGGGRN

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]1.5e-7848.21Show/hide
Query:  MFVQQSIGN-----------METSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYL
        MF+Q +IG               S    S   +SS  T   VNP YESW+  DQLLLGWLYNSMTPEVA QVMG E A DLW +I +LFGVQS+ EEDYL
Subjt:  MFVQQSIGN-----------METSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYL

Query:  RQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVS
        R VFQ TRKG+LKM ++L  MK + DNL QAGSP+P R+L+SQVLLGLDEEYN +VA IQG+  +SW ++Q+ELL++++RLE Q++ K TV FN  ++ S
Subjt:  RQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVS

Query:  VNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEY-RNNTQSHGKNFNGDSNQGVNNNSGQGTSY
        VNM N +R V+  N+  + N         GGG  RGRGR   NN + +CQVCGK GH A   ++R+ +++  N+ Q+  + F   +NQ  N    Q    
Subjt:  VNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEY-RNNTQSHGKNFNGDSNQGVNNNSGQGTSY

Query:  AFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAK
        A      +NPFL   E + D NWY DSGASNHVT+D+N++  P EY        G G+ L ISHVG  CL SD   + L ++LC  +  K
Subjt:  AFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAK

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]3.9e-7449.44Show/hide
Query:  MFVQQSIGN-----------METSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYL
        MF+Q +IG               S    S   +SS  T   VNP YESW+  DQLLLGWLYNSMTPEVA QVMG E A DLW +I +LFGVQS+ EEDYL
Subjt:  MFVQQSIGN-----------METSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYL

Query:  RQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVS
        R VFQ TRKG+LKM ++L  MK + DNL QAGSP+P R+L+SQVLLGLDEEYN +VA IQG+  +SW ++Q+ELL++++RLE Q++ K TV FN  ++ S
Subjt:  RQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVS

Query:  VNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEY-RNNTQSHGKNFNGDSNQGVNNNSGQGTSY
        VNM N +R V+  N+  + N         GGG  RGRGR   NN + +CQVCGK GH A   ++R+ +++  N+ Q+  + F   +NQ  N    Q    
Subjt:  VNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEY-RNNTQSHGKNFNGDSNQGVNNNSGQGTSY

Query:  AFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGN
        A      +NPFL   E + D NWY DSGASNHVT+D+N++  P EY G    T GN
Subjt:  AFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGN

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X14.3e-7945.92Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK                       
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG

Query:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGR
                      GLDE YN V+  IQGK  ISW ++Q++LL+F+KRL+ QN+  KNT +   S ++NMA         N ++NQ+++  +  N    R
Subjt:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGR

Query:  NRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGAS
            G+  N N+   CQ+CGK GHSAL  Y+RF+KE+ +    N   H  N +   N  V           F +TQN  PF A P+TV+DPNWY+DSGA+
Subjt:  NRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGAS

Query:  NHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK
        NHVT + ++M  PTEY G+E+VTVGNG++L IS+VG +CL      ++L+N+LCV +IAKNL+SVSKLA+DN++Y+EFH   C +KD   GK
Subjt:  NHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK

A0A5A7SIT7 Uncharacterized protein2.8e-7050.16Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG
        +SS  T   VN L+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRKG+ KM ++L VMK++ DNLGQ G
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAG

Query:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRN
        SPVP R+LISQVLLGLDE YN V+  IQGK  ISW ++Q++LL+F+K L+ QN+ K      N       N ++  +   QR + N +       G  R 
Subjt:  SPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGGRN

Query:  RGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTA
           G+  N N+   CQ+CGK GHSAL  Y+RF+KE+ +           D N+  +N S       F +TQN  PF A P+TV+DPNWY+DSGA+NHVT 
Subjt:  RGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTA

Query:  DYNSMVQPTEYGG
        + ++M  PTEY G
Subjt:  DYNSMVQPTEYGG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-949.9e-8449.74Show/hide
Query:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW
        +A Q+MG+ NA DLW A Q+LFGVQS+AEED+LRQ+FQ TRK      D+L +MK+++D LGQAGSPVP R+ ISQ LLGLDE YNPV+A IQGK  ISW
Subjt:  VATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISW

Query:  PEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMA---NSSRSVSGGNQRQNQNSRPPFNNNRG--GGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYH
         ++Q+ELL F+KRLE Q++ KNT +   +V VN+A   NSS      N + + N+R   NN++G  GG N GRGR     ++  CQVC K GHSAL  Y+
Subjt:  PEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMA---NSSRSVSGGNQRQNQNSRPPFNNNRG--GGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYH

Query:  RFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLK
        RF+KE+ +    +  +   NF+  SN  V               Q+ N F A  +TVI+ NWY+DSGA+NH+T +Y+++  P+EY G+E++ VGNGD L 
Subjt:  RFDKEYRN----NTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLK

Query:  ISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGALKDGLYRLNTV
        IS++G + L      + L+NVLCV +I KNLVSVSKLA+DNNVY+EFH   C +KD   G+ +L   +KDGLY L+T+
Subjt:  ISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGALKDGLYRLNTV

A0A6J1D5J0 uncharacterized protein LOC1110175015.8e-9283.71Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
        MFVQQSIGNMETSQTNISAPSSSSIATEAA+NPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGS

Query:  LKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG
        LKMTDFL VMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPE+QAE                                RSVSGG
Subjt:  LKMTDFLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG

Query:  NQRQNQNSRPPFNNNRGGGRN
        NQRQNQNS+PPFNNNRGGGRN
Subjt:  NQRQNQNSRPPFNNNRGGGRN

A0A6J1DCW4 uncharacterized protein LOC1110195982.5e-7142.08Show/hide
Query:  TNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHA
        TNI   +SS   +   +NP YE+W+  D+LLLGWLYNSM  +VA QVMG+  + +LW A+QELFGVQS+AE DYL+QVFQQT KGSL+M ++L +MKSHA
Subjt:  TNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHA

Query:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVSVNMANSSRSVSGGNQRQNQNSRPPF
        DNL  AGS V  R L+SQVL GLDEEYNP+V  +QGK  +SW E+ AELL ++KRLE QNS K+ +  N   + SVN  +  RS     +  N N+    
Subjt:  DNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFN--NSVSVNMANSSRSVSGGNQRQNQNSRPPF

Query:  NNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVD
        N +RGGG  RG     N                            R    +  KNF   SN G N  +   TS   T           PETVIDP+WY D
Subjt:  NNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVD

Query:  SGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLK
        SGA++HVTA+ N++ Q  +Y G E V V NG+KL ISH+G + + + GG + L++VL V +IAKNL   S                        G+ +LK
Subjt:  SGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLK

Query:  GALKDGLYRLNTVGVVIGSTST---PVDCGLELAANKTICSVSLPKSS----SSINVVVET
        G LKD LYRL+       +T T   P+     ++ +    S   P  S      INVVV T
Subjt:  GALKDGLYRLNTVGVVIGSTST---PVDCGLELAANKTICSVSLPKSS----SSINVVVET

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-3028.89Show/hide
Query:  SSIATEAA--VNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQA
        ++I T+AA  VNP Y  W   D+L+   +  +++  V   V     A  +W  +++++   S      LR   +Q  KG+  + D++  + +  D L   
Subjt:  SSIATEAA--VNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQA

Query:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGK-RGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGG
        G P+     + +VL  L EEY PV+  I  K    +  EI   LL  + ++   +S        N+VS     ++ + + GN+    ++R   NN++   
Subjt:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGK-RGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGG

Query:  RNRGRGRWNNNNSRQI---CQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQ-NNNPFLANPETVIDPNWYVDSGA
        ++      NNN S+     CQ+CG  GHSA                S  ++F       +++ + Q     FT  Q   N  L +P +    NW +DSGA
Subjt:  RNRGRGRWNNNNSRQI---CQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQ-NNNPFLANPETVIDPNWYVDSGA

Query:  SNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGAL
        ++H+T+D+N++     Y G + V V +G  + ISH G + L +    + L N+L V NI KNL+SV +L   N V +EF   S  VKD+  G  +L+G  
Subjt:  SNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGAL

Query:  KDGLY
        KD LY
Subjt:  KDGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-2327.18Show/hide
Query:  SSIATEAA--VNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQA
        ++I T+A   VNP Y  W   D+L+   +  +++  V   V     A  +W  +++++   S      LR +                   +  D L   
Subjt:  SSIATEAA--VNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTDFLHVMKSHADNLGQA

Query:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGK-RGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGG
        G P+     + +VL  L ++Y PV+  I  K    S  EI   L+  + +L   NS +      N V+    N++R+ +     +N N+    NNNR   
Subjt:  GSPVPTRSLISQVLLGLDEEYNPVVATIQGK-RGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGGG

Query:  RNRGRGRWNNNNSRQ-----ICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQN------NNPFLANPETVIDPN
                 ++N +       CQ+C   GHSA           +   Q H   F   +NQ       Q ++  FT  Q       N+P+ AN       N
Subjt:  RNRGRGRWNNNNSRQ-----ICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQN------NNPFLANPETVIDPN

Query:  WYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK
        W +DSGA++H+T+D+N++     Y G + V + +G  + I+H G + L +    + L  VL V NI KNL+SV +L   N V +EF   S  VKD+  G 
Subjt:  WYVDSGASNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGK

Query:  VVLKGALKDGLY
         +L+G  KD LY
Subjt:  VVLKGALKDGLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCGTCAATCCACTATAT
GAGTCATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTA
TGGGCTGCCATACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAGGTATTTCAACAAACTCGAAAAGGTTCTCTTAAAATGACTGAT
TTTTTGCATGTTATGAAGTCTCATGCAGACAATTTGGGTCAAGCTGGAAGCCCCGTACCCACTCGATCTTTGATTTCTCAAGTTTTGCTGGGATTAGATGAAGAG
TATAATCCTGTGGTAGCAACGATCCAAGGAAAACGAGGCATTTCGTGGCCTGAAATACAAGCCGAATTGTTGGTATTTAAGAAGAGGTTAGAACTTCAGAATTCT
CATAAAAATACAGTATCTTTTAACAACTCTGTTTCTGTGAATATGGCTAATAGTAGCAGAAGTGTAAGTGGTGGAAACCAACGTCAAAATCAAAACTCTCGGCCA
CCATTCAACAACAATCGGGGGGGTGGTCGAAATCGAGGTAGAGGACGGTGGAACAACAACAATAGTCGGCAAATTTGTCAGGTGTGTGGTAAACCTGGACATTCA
GCACTAACGTACTACCATCGATTTGATAAGGAGTACAGGAACAATACACAAAGCCATGGTAAAAACTTCAATGGCGACTCTAACCAGGGGGTTAACAACAACTCT
GGACAAGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCCAATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCT
TCAAATCATGTCACCGCCGACTACAATAGTATGGTTCAACCTACTGAATATGGAGGTATGGAAAGAGTTACAGTAGGTAATGGCGATAAATTAAAAATATCTCAT
GTTGGCAAATCCTGTTTAGTTTCTGACGGTGGGTTGGTCATGCTTGAAAATGTGTTGTGCGTATCTAACATAGCTAAAAATCTAGTTAGCGTGTCTAAACTCGCT
AAAGACAATAACGTATACCTTGAATTTCATGCTGATTCTTGTCTTGTAAAGGATATACGTTTGGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACTTTAC
CGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTCGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAA
TCATCCAGTAGTATAAATGTTGTGGTTGAGACGGCCCAGCCATATCACAATTCATCCAACCAACGGAGAAGAGATGAGGAAAAGGGTTTAGCAGTTTGTTTTGGT
ACAGATCCAGAAAGAGACATGGGAAGCTTCTGGTTTCATATTCACCCAGAACCAAAATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCGTCAATCCACTATAT
GAGTCATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTA
TGGGCTGCCATACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAGGTATTTCAACAAACTCGAAAAGGTTCTCTTAAAATGACTGAT
TTTTTGCATGTTATGAAGTCTCATGCAGACAATTTGGGTCAAGCTGGAAGCCCCGTACCCACTCGATCTTTGATTTCTCAAGTTTTGCTGGGATTAGATGAAGAG
TATAATCCTGTGGTAGCAACGATCCAAGGAAAACGAGGCATTTCGTGGCCTGAAATACAAGCCGAATTGTTGGTATTTAAGAAGAGGTTAGAACTTCAGAATTCT
CATAAAAATACAGTATCTTTTAACAACTCTGTTTCTGTGAATATGGCTAATAGTAGCAGAAGTGTAAGTGGTGGAAACCAACGTCAAAATCAAAACTCTCGGCCA
CCATTCAACAACAATCGGGGGGGTGGTCGAAATCGAGGTAGAGGACGGTGGAACAACAACAATAGTCGGCAAATTTGTCAGGTGTGTGGTAAACCTGGACATTCA
GCACTAACGTACTACCATCGATTTGATAAGGAGTACAGGAACAATACACAAAGCCATGGTAAAAACTTCAATGGCGACTCTAACCAGGGGGTTAACAACAACTCT
GGACAAGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCCAATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCT
TCAAATCATGTCACCGCCGACTACAATAGTATGGTTCAACCTACTGAATATGGAGGTATGGAAAGAGTTACAGTAGGTAATGGCGATAAATTAAAAATATCTCAT
GTTGGCAAATCCTGTTTAGTTTCTGACGGTGGGTTGGTCATGCTTGAAAATGTGTTGTGCGTATCTAACATAGCTAAAAATCTAGTTAGCGTGTCTAAACTCGCT
AAAGACAATAACGTATACCTTGAATTTCATGCTGATTCTTGTCTTGTAAAGGATATACGTTTGGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACTTTAC
CGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTCGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAA
TCATCCAGTAGTATAAATGTTGTGGTTGAGACGGCCCAGCCATATCACAATTCATCCAACCAACGGAGAAGAGATGAGGAAAAGGGTTTAGCAGTTTGTTTTGGT
ACAGATCCAGAAAGAGACATGGGAAGCTTCTGGTTTCATATTCACCCAGAACCAAAATCCTAG
Protein sequenceShow/hide protein sequence
MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKGSLKMTD
FLHVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNPVVATIQGKRGISWPEIQAELLVFKKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRP
PFNNNRGGGRNRGRGRWNNNNSRQICQVCGKPGHSALTYYHRFDKEYRNNTQSHGKNFNGDSNQGVNNNSGQGTSYAFTATQNNNPFLANPETVIDPNWYVDSGA
SNHVTADYNSMVQPTEYGGMERVTVGNGDKLKISHVGKSCLVSDGGLVMLENVLCVSNIAKNLVSVSKLAKDNNVYLEFHADSCLVKDIRLGKVVLKGALKDGLY
RLNTVGVVIGSTSTPVDCGLELAANKTICSVSLPKSSSSINVVVETAQPYHNSSNQRRRDEEKGLAVCFGTDPERDMGSFWFHIHPEPKS