; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g13070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g13070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:7953834..7957618
RNA-Seq ExpressionMoc01g13070
SyntenyMoc01g13070
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.4e-7654.8Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTP-SNGR
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F+   F  G G G+ N+  GR     Q+RG    GL     P  +  
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTP-SNGR

Query:  IHCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG
          CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN H+TS+++ +S+A EYNG++QV VG+GQ+ PISHSG
Subjt:  IHCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG

Query:  YGVLKTSSTSLHLSNLFCVPHIA
        +  L+ +S S  ++  FC  + A
Subjt:  YGVLKTSSTSLHLSNLFCVPHIA

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.3e-7453.42Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHSG+
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIA
           + +S S  ++ LFC  + A
Subjt:  GVLKTSSTSLHLSNLFCVPHIA

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]3.7e-7457Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTP-SNGR
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F+   F  G G G+ N+  GR     Q+RG    GL     P  +  
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTP-SNGR

Query:  IHCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG
          CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN H+TS+++ +S+A EYNG++QV VG+GQ+ PISHSG
Subjt:  IHCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]1.3e-7453.42Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHSG+
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIA
           + +S S  ++ LFC  + A
Subjt:  GVLKTSSTSLHLSNLFCVPHIA

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]4.1e-7356.15Show/hide
Query:  VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQPVTFEE
        V  S TS++VW V+EKHYSS+ RTNVVNLK+DLQSI KK+ ESI  Y+ R KEIKDK ANVSITIN+E LLIYALNGL  EYNT  TSMRTR+Q V+FEE
Subjt:  VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQPVTFEE

Query:  LYVLLISEESAIEKQGKRDESSPSPTAMIATS-QAHHMQRNFSPQFS--RGKFGGGTGRGRSNHDRGRVYSPT---QSRGCTSSGLFSSNTPSNGRIHCQ
        L+V + SEESAIEKQ KR++    P A+ A+S Q+ +    F P  S  RG+ G   GRG++N      ++PT   Q RG  SSG F ++  ++ R  CQ
Subjt:  LYVLLISEESAIEKQGKRDESSPSPTAMIATS-QAHHMQRNFSPQFS--RGKFGGGTGRGRSNHDRGRVYSPT---QSRGCTSSGLFSSNTPSNGRIHCQ

Query:  ICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNL---SIASEYNGDDQVSVGSGQSLPISHSGY
        IC + GHT +DC+NRMN++FQ RH P QLA MVAVQN  + +  +  S+P  W+ DS CN H+T++LSNL   SIAS+YNG++ +SVGSGQS PI+H G 
Subjt:  ICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNL---SIASEYNGDDQVSVGSGQSLPISHSGY

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X26.2e-7553.42Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHSG+
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIA
           + +S S  ++ LFC  + A
Subjt:  GVLKTSSTSLHLSNLFCVPHIA

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.9e-7255.52Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHSG
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X16.2e-7553.42Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHSG+
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIA
           + +S S  ++ LFC  + A
Subjt:  GVLKTSSTSLHLSNLFCVPHIA

A0A5D3CLI6 T4.51.3e-7253.11Show/hide
Query:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT
        TLSP     V  S +S++VW+V+ K YSS  R+NVVNLK+DLQ+I KK  ESI  YI R KEIKDKLANVS  INEEDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPS----VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRT

Query:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
        RSQPVTFEEL+VLL +EESA+ KQ K D+S   PT ++++SQ+     + +P F    F  G G G+ ++  GR     Q+RG  SS      +  +   
Subjt:  RSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+R GHT +DCFNRMNYNFQ RH P QLA MVA QN  F S  +  S     +TDSGCN  +TS+++ +S+A EYNG++QV +G+GQ+ P+SHS Y
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIA
              ST  ++++     HIA
Subjt:  GVLKTSSTSLHLSNLFCVPHIA

A0A6J1D9L6 uncharacterized protein LOC1110188922.0e-7356.15Show/hide
Query:  VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQPVTFEE
        V  S TS++VW V+EKHYSS+ RTNVVNLK+DLQSI KK+ ESI  Y+ R KEIKDK ANVSITIN+E LLIYALNGL  EYNT  TSMRTR+Q V+FEE
Subjt:  VCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQPVTFEE

Query:  LYVLLISEESAIEKQGKRDESSPSPTAMIATS-QAHHMQRNFSPQFS--RGKFGGGTGRGRSNHDRGRVYSPT---QSRGCTSSGLFSSNTPSNGRIHCQ
        L+V + SEESAIEKQ KR++    P A+ A+S Q+ +    F P  S  RG+ G   GRG++N      ++PT   Q RG  SSG F ++  ++ R  CQ
Subjt:  LYVLLISEESAIEKQGKRDESSPSPTAMIATS-QAHHMQRNFSPQFS--RGKFGGGTGRGRSNHDRGRVYSPT---QSRGCTSSGLFSSNTPSNGRIHCQ

Query:  ICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNL---SIASEYNGDDQVSVGSGQSLPISHSGY
        IC + GHT +DC+NRMN++FQ RH P QLA MVAVQN  + +  +  S+P  W+ DS CN H+T++LSNL   SIAS+YNG++ +SVGSGQS PI+H G 
Subjt:  ICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNL---SIASEYNGDDQVSVGSGQSLPISHSGY

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-0422.06Show/hide
Query:  LSPSVCSSI----TSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTR
        LS  V ++I    T++ +W  +E  Y S   TN + LK  L ++      +   ++     +  +LAN+ + I EED  I  LN LP  Y+   T++   
Subjt:  LSPSVCSSI----TSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTR

Query:  SQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRS-NHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI
           +  +++   L+      EK  K+ E+     A+I   +    QR      S   +G    RG+S N  + RV                         
Subjt:  SQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRS-NHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQN--------QQFQSNFSLPSAPAPWITDSGCNAHVT--SNLSNLSIASEYNGDDQVSVGSG
        +C  CN+PGH   DC N      +             VQN         + +    L    + W+ D+  + H T   +L    +A ++     V +G+ 
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQN--------QQFQSNFSLPSAPAPWITDSGCNAHVT--SNLSNLSIASEYNGDDQVSVGSG

Query:  QSLPISHSGYGVLKTS-STSLHLSNLFCVPHIAANFTNDV
            I+  G   +KT+   +L L ++  VP +  N  + +
Subjt:  QSLPISHSGYGVLKTS-STSLHLSNLFCVPHIAANFTNDV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1825.75Show/hide
Query:  VTLSPSVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQ
        +++ P+V  + T+ ++W  + K Y++    +V  L+T L+  + K +++I DY+       D+LA +   ++ ++ +   L  LP EY      +  +  
Subjt:  VTLSPSVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQ

Query:  PVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI-HC
        P T  E++  L++ ES I        SS +   + A + +H   RN +   +     G       N +      P Q    +S+    +N  S   +  C
Subjt:  PVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRI-HC

Query:  QICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFS-----------LPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQ
        QIC   GH+   C    ++             + +V +QQ  S F+            P +   W+ DSG   H+TS+ +NLS+   Y G D V V  G 
Subjt:  QICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFS-----------LPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQ

Query:  SLPISHSGYGVLKTSSTSLHLSNLFCVPHIAANF--------TNDVDITMAPAC--TTDAHTNVP
        ++PISH+G   L T S  L+L N+  VP+I  N          N V +   PA     D +T VP
Subjt:  SLPISHSGYGVLKTSSTSLHLSNLFCVPHIAANF--------TNDVDITMAPAC--TTDAHTNVP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1526.4Show/hide
Query:  VTLSPSVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQ
        +++ P+V  + T+ ++W  + K Y++    +V  L+                +I R     D+LA +   ++ ++ +   L  LP +Y      +  +  
Subjt:  VTLSPSVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQ

Query:  PVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSN---TPSNGRI
        P +  E++  LI+ ES +       E  P  TA + T +  +  RN   Q +R    G      +N++R   + P      +SSG  S N    P  GR 
Subjt:  PVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSN---TPSNGRI

Query:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY
         CQIC+  GH+   C     + FQ      Q          +     + P     W+ DSG   H+TS+ +NLS    Y G D V +  G ++PI+H+G 
Subjt:  HCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVAVQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGY

Query:  GVLKTSSTSLHLSNLFCVPHIAANF--------TNDVDITMAPAC--TTDAHTNVP
          L TSS SL L+ +  VP+I  N         TN V +   PA     D +T VP
Subjt:  GVLKTSSTSLHLSNLFCVPHIAANF--------TNDVDITMAPAC--TTDAHTNVP

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.6e-0625.41Show/hide
Query:  TLSP-----SVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMR
        TL+P     S  +S TS+++W  ++  + ++     + L ++L++        ++DY  + K++ D L NV + + + +L++Y LNGL  +++     ++
Subjt:  TLSP-----SVCSSITSQEVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMR

Query:  TRSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSP-----QFSRGKFGGGTGRGRSNH-DRGR
         R    +F++   +L  EE  +++  K     P+PT +  +S +  +  + +P     Q S G   G  GRGR N+  RGR
Subjt:  TRSQPVTFEELYVLLISEESAIEKQGKRDESSPSPTAMIATSQAHHMQRNFSP-----QFSRGKFGGGTGRGRSNH-DRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCGGCTGGCGCAGCAGTTGAATCTTCGTTCGGTGTTGGTGGAATACGACTCTCGTCAAGCAATCGATCTGGTTCGAGGAGATCAAATTTCACTCCTTGTGGCTG
GAACATGGGTGCGGGACATTCAAGAGCTTGCAAGAAATTTCACCCACATCGACTTTTTGCACAAATTGCCCACAAGCTTGCAACTGATGCTTGGAAGTCGAGGGAAGATA
TTCTTTGGCTTCATTCTTTTCCCATCTGGATTTTGAGAGCTATCACACAAGGGAGTCACGTGTATCTTGTAACCCTCAGCCCTAGCGTATGTAGCAGCATTACTTCACAA
GAAGTCTGGAATGTTATGGAAAAACACTATTCTTCAAGCGGTCGAACGAATGTGGTAAATCTGAAGACTGATTTGCAATCTATTTCGAAGAAATCAAGTGAATCCATCAG
TGATTATATTATACGCAGCAAGGAGATCAAGGATAAACTGGCGAACGTTTCCATTACAATCAATGAAGAAGATCTGTTAATATATGCCCTAAATGGCCTTCCAATTGAGT
ACAACACTTTCCGGACTTCAATGCGCACTCGTTCGCAACCAGTTACTTTTGAAGAACTCTACGTTCTGTTGATTTCTGAGGAGTCTGCTATCGAGAAACAAGGGAAACGT
GATGAATCGTCTCCCTCTCCTACTGCTATGATTGCAACTTCTCAAGCACATCATATGCAACGCAATTTTTCTCCTCAATTTTCTCGAGGAAAGTTTGGAGGCGGAACTGG
TCGAGGACGCTCAAATCACGATCGCGGTCGTGTGTATTCTCCCACACAAAGTCGCGGATGTACCTCCAGCGGTCTTTTCTCCTCAAATACACCTTCTAATGGACGAATTC
ACTGTCAAATTTGTAATCGCCCTGGGCATACTGTGATAGATTGTTTTAATCGCATGAATTACAATTTCCAGGACCGTCATCTGCCTATGCAACTAGCTGTGATGGTTGCT
GTTCAAAATCAGCAATTTCAGTCAAATTTCTCTTTGCCGTCTGCACCTGCACCATGGATTACTGATTCGGGTTGTAATGCTCACGTTACTTCAAACTTGAGCAACTTATC
CATAGCATCAGAGTATAATGGTGATGACCAGGTTTCGGTAGGCAGCGGGCAATCCCTTCCAATATCACATTCAGGTTATGGAGTTCTTAAAACTTCTTCTACTTCTCTTC
ACTTGTCAAATCTTTTCTGTGTTCCTCATATAGCTGCTAACTTTACTAATGATGTTGACATTACAATGGCCCCTGCTTGTACTACTGATGCACATACCAATGTGCCTGCT
GAAATTGCTCCTTGTTATTCGCAATCTGCTGCTGCAAATAGTCTACCTATTGATATGGGTGCCGCTGTTGACAGTCACAATATTGCTATAAATGGCAATGCTACTATTGA
CTTAACACACCTCATTCGTTTCTATGCCTCCGCAACCATTGGTACAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCGGCTGGCGCAGCAGTTGAATCTTCGTTCGGTGTTGGTGGAATACGACTCTCGTCAAGCAATCGATCTGGTTCGAGGAGATCAAATTTCACTCCTTGTGGCTG
GAACATGGGTGCGGGACATTCAAGAGCTTGCAAGAAATTTCACCCACATCGACTTTTTGCACAAATTGCCCACAAGCTTGCAACTGATGCTTGGAAGTCGAGGGAAGATA
TTCTTTGGCTTCATTCTTTTCCCATCTGGATTTTGAGAGCTATCACACAAGGGAGTCACGTGTATCTTGTAACCCTCAGCCCTAGCGTATGTAGCAGCATTACTTCACAA
GAAGTCTGGAATGTTATGGAAAAACACTATTCTTCAAGCGGTCGAACGAATGTGGTAAATCTGAAGACTGATTTGCAATCTATTTCGAAGAAATCAAGTGAATCCATCAG
TGATTATATTATACGCAGCAAGGAGATCAAGGATAAACTGGCGAACGTTTCCATTACAATCAATGAAGAAGATCTGTTAATATATGCCCTAAATGGCCTTCCAATTGAGT
ACAACACTTTCCGGACTTCAATGCGCACTCGTTCGCAACCAGTTACTTTTGAAGAACTCTACGTTCTGTTGATTTCTGAGGAGTCTGCTATCGAGAAACAAGGGAAACGT
GATGAATCGTCTCCCTCTCCTACTGCTATGATTGCAACTTCTCAAGCACATCATATGCAACGCAATTTTTCTCCTCAATTTTCTCGAGGAAAGTTTGGAGGCGGAACTGG
TCGAGGACGCTCAAATCACGATCGCGGTCGTGTGTATTCTCCCACACAAAGTCGCGGATGTACCTCCAGCGGTCTTTTCTCCTCAAATACACCTTCTAATGGACGAATTC
ACTGTCAAATTTGTAATCGCCCTGGGCATACTGTGATAGATTGTTTTAATCGCATGAATTACAATTTCCAGGACCGTCATCTGCCTATGCAACTAGCTGTGATGGTTGCT
GTTCAAAATCAGCAATTTCAGTCAAATTTCTCTTTGCCGTCTGCACCTGCACCATGGATTACTGATTCGGGTTGTAATGCTCACGTTACTTCAAACTTGAGCAACTTATC
CATAGCATCAGAGTATAATGGTGATGACCAGGTTTCGGTAGGCAGCGGGCAATCCCTTCCAATATCACATTCAGGTTATGGAGTTCTTAAAACTTCTTCTACTTCTCTTC
ACTTGTCAAATCTTTTCTGTGTTCCTCATATAGCTGCTAACTTTACTAATGATGTTGACATTACAATGGCCCCTGCTTGTACTACTGATGCACATACCAATGTGCCTGCT
GAAATTGCTCCTTGTTATTCGCAATCTGCTGCTGCAAATAGTCTACCTATTGATATGGGTGCCGCTGTTGACAGTCACAATATTGCTATAAATGGCAATGCTACTATTGA
CTTAACACACCTCATTCGTTTCTATGCCTCCGCAACCATTGGTACAAAATAG
Protein sequenceShow/hide protein sequence
MSSAGAAVESSFGVGGIRLSSSNRSGSRRSNFTPCGWNMGAGHSRACKKFHPHRLFAQIAHKLATDAWKSREDILWLHSFPIWILRAITQGSHVYLVTLSPSVCSSITSQ
EVWNVMEKHYSSSGRTNVVNLKTDLQSISKKSSESISDYIIRSKEIKDKLANVSITINEEDLLIYALNGLPIEYNTFRTSMRTRSQPVTFEELYVLLISEESAIEKQGKR
DESSPSPTAMIATSQAHHMQRNFSPQFSRGKFGGGTGRGRSNHDRGRVYSPTQSRGCTSSGLFSSNTPSNGRIHCQICNRPGHTVIDCFNRMNYNFQDRHLPMQLAVMVA
VQNQQFQSNFSLPSAPAPWITDSGCNAHVTSNLSNLSIASEYNGDDQVSVGSGQSLPISHSGYGVLKTSSTSLHLSNLFCVPHIAANFTNDVDITMAPACTTDAHTNVPA
EIAPCYSQSAAANSLPIDMGAAVDSHNIAINGNATIDLTHLIRFYASATIGTK