; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationchr4:17524500..17527214
RNA-Seq ExpressionMoc04g24320
SyntenyMoc04g24320
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]2.6e-5028.65Show/hide
Query:  TEGSSSSNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG---TGANATTLISNPAFDSW
        T   SSS+  T   +   +  N++  S+PFGN L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +      G + +   SNP ++ W
Subjt:  TEGSSSSNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG---TGANATTLISNPAFDSW

Query:  STTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLD
           DQ L+ WLY S+T +VA  ++   T+  +WKALE+L+GA +K++   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N+L  LD
Subjt:  STTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLD

Query:  AEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR------------------
        +EY+P+   I  +E+ TWQE++ TLL++++ L H+N V                           T+N    N   N A NR                  
Subjt:  AEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR------------------

Query:  -----------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML---------------
                                 N  GS P AN+ A  P  F+ATP  V+D  W  DSGATNH TND   L  K  Y G+E L               
Subjt:  -----------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML---------------

Query:  -------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTN
                                                          DK +   +L GRL  GLYQL++P  K+ F       N +P   ++ SST 
Subjt:  -------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTN

Query:  HTLLPQMSQTSHSNPVGTVTKLATPAEQSSPSKSVSQQ----ENLVVNSTAATNVCGNGIAC
        H     +  ++ S       +  +  + S+  K+V  +    +N +V+  A++       +C
Subjt:  HTLLPQMSQTSHSNPVGTVTKLATPAEQSSPSKSVSQQ----ENLVVNSTAATNVCGNGIAC

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]1.1e-5029.26Show/hide
Query:  SNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG-----------TGANATTLISNPAFD
        S+ +T     +    N++  S+PFGN L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +              G + +   SNP ++
Subjt:  SNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG-----------TGANATTLISNPAFD

Query:  SWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTD
         W   DQ L+ WLY S+T +VA  ++   T+  +WKALE+L+GA +K++   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N L  
Subjt:  SWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTD

Query:  LDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR----------------
        LD+EY+P+   I  +E+ TWQE++ TLL++++ L H+N V                           T+N    N   N A NR                
Subjt:  LDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR----------------

Query:  -------------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML-------------
                                   N  GS P AN+ A  P  F+ATP  V+D  W  DSGATNH TND   L  K +Y G+E L             
Subjt:  -------------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML-------------

Query:  ---------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSS
                                                            DK +G  +L GRL  GLYQL++P  K+ F       N +P   ++ SS
Subjt:  ---------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSS

Query:  TNHTL-LPQMSQTSHSNPVGTVTKLATPAEQSSPSKS------VSQQENLVVNSTAA
        T H   LP +S  S+ + + +++       +SS SK       +    N V+N  A+
Subjt:  TNHTL-LPQMSQTSHSNPVGTVTKLATPAEQSSPSKS------VSQQENLVVNSTAA

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]6.9e-6445.16Show/hide
Query:  ITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKE
        ++P +ACD+L++ TSRDVWKALEDLY   NKARI QLK +LQ TRKNQLKMS+YLSTMKQLA  L LAGEP+S +SL+++VLT L+AEYL + CQIN KE
Subjt:  ITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKE

Query:  NMTWQEMHATLLAFENTLIHLNVVTNNIDVTNASANYASNRSFNQR------------------------------------------------------
        N++WQE+HATL+ FEN LIHLN V +  DV+  SANY  N+S +Q                                                       
Subjt:  NMTWQEMHATLLAFENTLIHLNVVTNNIDVTNASANYASNRSFNQR------------------------------------------------------

Query:  -----GSTPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEMLT-----------------------DKQSGRTILEGRLSEG
             G+TPQ  N+AP A++  P V+ D NWL+DSGATNH TND T LGQ+ EY GNE LT                       DK++GR +LEG+L++G
Subjt:  -----GSTPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEMLT-----------------------DKQSGRTILEGRLSEG

Query:  LYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTNHTLLPQM
        LYQLDL KPK      NK       H    +S   +  PQ+
Subjt:  LYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTNHTLLPQM

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]5.5e-6941.67Show/hide
Query:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF
        M TE + +S++    +  + + T   +   +  FG+PL TVL VKLD+KNY LW+ M+ A L GQK DGYV+GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF

Query:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT
          W   DQ+LL WL+GS+TPS+ACD+++ R+SR+VWKALEDLYGAT+KARI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL+
Subjt:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT

Query:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------
         L+AEYLP+ CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY         NR F+Q        RGS                  
Subjt:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------

Query:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG
                                              +NN    A+MA P +V + +WL DSGAT+H T+D++ L  K +YNG
Subjt:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]5.5e-6941.67Show/hide
Query:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF
        M TE + +S++    +  + + T   +   +  FG+PL TVL VKLD+KNY LW+ M+ A L GQK DGYV+GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF

Query:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT
          W   DQ+LL WL+GS+TPS+ACD+++ R+SR+VWKALEDLYGAT+KARI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL+
Subjt:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT

Query:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------
         L+AEYLP+ CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY         NR F+Q        RGS                  
Subjt:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------

Query:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG
                                              +NN    A+MA P +V + +WL DSGAT+H T+D++ L  K +YNG
Subjt:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein1.2e-5028.65Show/hide
Query:  TEGSSSSNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG---TGANATTLISNPAFDSW
        T   SSS+  T   +   +  N++  S+PFGN L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +      G + +   SNP ++ W
Subjt:  TEGSSSSNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG---TGANATTLISNPAFDSW

Query:  STTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLD
           DQ L+ WLY S+T +VA  ++   T+  +WKALE+L+GA +K++   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N+L  LD
Subjt:  STTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLD

Query:  AEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR------------------
        +EY+P+   I  +E+ TWQE++ TLL++++ L H+N V                           T+N    N   N A NR                  
Subjt:  AEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR------------------

Query:  -----------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML---------------
                                 N  GS P AN+ A  P  F+ATP  V+D  W  DSGATNH TND   L  K  Y G+E L               
Subjt:  -----------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML---------------

Query:  -------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTN
                                                          DK +   +L GRL  GLYQL++P  K+ F       N +P   ++ SST 
Subjt:  -------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTN

Query:  HTLLPQMSQTSHSNPVGTVTKLATPAEQSSPSKSVSQQ----ENLVVNSTAATNVCGNGIAC
        H     +  ++ S       +  +  + S+  K+V  +    +N +V+  A++       +C
Subjt:  HTLLPQMSQTSHSNPVGTVTKLATPAEQSSPSKSVSQQ----ENLVVNSTAATNVCGNGIAC

A0A5C7IJ06 Uncharacterized protein5.6e-5129.26Show/hide
Query:  SNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG-----------TGANATTLISNPAFD
        S+ +T     +    N++  S+PFGN L+   A+KLD +N++LWK+M+T  + G +LDG++  T   PPE +              G + +   SNP ++
Subjt:  SNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQG-----------TGANATTLISNPAFD

Query:  SWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTD
         W   DQ L+ WLY S+T +VA  ++   T+  +WKALE+L+GA +K++   ++ ++Q TRK    M EYL+ MK  A SLA+AG+P  EN L  N L  
Subjt:  SWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTD

Query:  LDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR----------------
        LD+EY+P+   I  +E+ TWQE++ TLL++++ L H+N V                           T+N    N   N A NR                
Subjt:  LDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVV---------------------------TNNIDVTNASANYASNR----------------

Query:  -------------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML-------------
                                   N  GS P AN+ A  P  F+ATP  V+D  W  DSGATNH TND   L  K +Y G+E L             
Subjt:  -------------------------SFNQRGSTPQANNKA--PGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEML-------------

Query:  ---------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSS
                                                            DK +G  +L GRL  GLYQL++P  K+ F       N +P   ++ SS
Subjt:  ---------------------------------------------------TDKQSGRTILEGRLSEGLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSS

Query:  TNHTL-LPQMSQTSHSNPVGTVTKLATPAEQSSPSKS------VSQQENLVVNSTAA
        T H   LP +S  S+ + + +++       +SS SK       +    N V+N  A+
Subjt:  TNHTL-LPQMSQTSHSNPVGTVTKLATPAEQSSPSKS------VSQQENLVVNSTAA

A0A6J1CLV9 uncharacterized protein LOC1110128093.4e-6445.16Show/hide
Query:  ITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKE
        ++P +ACD+L++ TSRDVWKALEDLY   NKARI QLK +LQ TRKNQLKMS+YLSTMKQLA  L LAGEP+S +SL+++VLT L+AEYL + CQIN KE
Subjt:  ITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKE

Query:  NMTWQEMHATLLAFENTLIHLNVVTNNIDVTNASANYASNRSFNQR------------------------------------------------------
        N++WQE+HATL+ FEN LIHLN V +  DV+  SANY  N+S +Q                                                       
Subjt:  NMTWQEMHATLLAFENTLIHLNVVTNNIDVTNASANYASNRSFNQR------------------------------------------------------

Query:  -----GSTPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEMLT-----------------------DKQSGRTILEGRLSEG
             G+TPQ  N+AP A++  P V+ D NWL+DSGATNH TND T LGQ+ EY GNE LT                       DK++GR +LEG+L++G
Subjt:  -----GSTPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEMLT-----------------------DKQSGRTILEGRLSEG

Query:  LYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTNHTLLPQM
        LYQLDL KPK      NK       H    +S   +  PQ+
Subjt:  LYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTNHTLLPQM

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X22.7e-6941.67Show/hide
Query:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF
        M TE + +S++    +  + + T   +   +  FG+PL TVL VKLD+KNY LW+ M+ A L GQK DGYV+GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF

Query:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT
          W   DQ+LL WL+GS+TPS+ACD+++ R+SR+VWKALEDLYGAT+KARI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL+
Subjt:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT

Query:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------
         L+AEYLP+ CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY         NR F+Q        RGS                  
Subjt:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------

Query:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG
                                              +NN    A+MA P +V + +WL DSGAT+H T+D++ L  K +YNG
Subjt:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X12.7e-6941.67Show/hide
Query:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF
        M TE + +S++    +  + + T   +   +  FG+PL TVL VKLD+KNY LW+ M+ A L GQK DGYV+GT+A+PP+ +       T+  L  NP +
Subjt:  MDTEGSSSSNL--NTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATT--LISNPAF

Query:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT
          W   DQ+LL WL+GS+TPS+ACD+++ R+SR+VWKALEDLYGAT+KARI QL+  LQ T+KN LKMSEYL  MKQ + SL LAGEPV+ N L++ VL+
Subjt:  DSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLT

Query:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------
         L+AEYLP+ CQI GK++ +WQE+ ATL+ FENTL+ LN+V+      +++ S NY         NR F+Q        RGS                  
Subjt:  DLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNI--DVTNASANY-------ASNRSFNQ--------RGS------------------

Query:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG
                                              +NN    A+MA P +V + +WL DSGAT+H T+D++ L  K +YNG
Subjt:  -----------------------------------TPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNG

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-1525.07Show/hide
Query:  KLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGAT
        KL   NYL+W   + A   G +L G++ G+   PP  I   G +A   + NP +  W   D+ + + + G+I+ SV   +    T+  +W+ L  +Y   
Subjt:  KLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITPSVACDILNLRTSRDVWKALEDLYGAT

Query:  NKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKEN-MTWQEMHATLLAFENTLIHLN-----V
        +   +TQL+    +TR +Q                LAL G+P+  +  +  VL +L  +Y PV  QI  K+   +  E+H  L+  E+ L+ LN      
Subjt:  NKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKEN-MTWQEMHATLLAFENTLIHLN-----V

Query:  VTNNIDVTNASANYASNRSFNQRGSTPQANNK---------------------------------------------------------------APGAF
        +T N+ VT+ + N  +NR+ N RG     NN                                                                 P A 
Subjt:  VTNNIDVTNASANYASNRSFNQRGSTPQANNK---------------------------------------------------------------APGAF

Query:  MATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNE
        +A  +  N  NWL+DSGAT+H T+D   L     Y G +
Subjt:  MATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNE

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0722.71Show/hide
Query:  LDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITP-SVACDILNLRTSRDVWKALEDLYGAT
        ++E NY  W+ +         + G++ GT+         T AN           +W   D  +   LYG++TP       +   TSRD+W  +++ +   
Subjt:  LDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITP-SVACDILNLRTSRDVWKALEDLYGAT

Query:  NKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNID
          AR  +L   L+      +++++Y   MK+LA SL     PV++ +L+  VL  L+ ++  +   I  ++     +  AT+L  E   +   +  N   
Subjt:  NKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKENMTWQEMHATLLAFENTLIHLNVVTNNID

Query:  VTNASAN
        V ++S++
Subjt:  VTNASAN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.0e-0922.96Show/hide
Query:  LAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITPSVACDILNLR-TSRDVWKALEDL
        + + L++ NY +W+ +         + G++ G+ + P  M +                 W   D  +  W+YG+IT S+   I+ +  T+RD+W +LE+L
Subjt:  LAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWLYGSITPSVACDILNLR-TSRDVWKALEDL

Query:  YGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKENM-TWQEMHATLLAFENTLIHLNVV
        +    +AR  Q +  L+ T  + L + EY   +K L+  L     P+S+  L+ ++L  L  +Y  +   I  K    ++ E  + LL  E+ L + +  
Subjt:  YGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKENM-TWQEMHATLLAFENTLIHLNVV

Query:  ----TNNIDVTNA-----------SANYASNRSFNQRGSTPQANNKAPGAFMATPNVVNDQNWLMDSGAT
            TN+  ++N               Y +N S   RG + + N    G   +     N+ NW ++   T
Subjt:  ----TNNIDVTNA-----------SANYASNRSFNQRGSTPQANNKAPGAFMATPNVVNDQNWLMDSGAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACCGAAGGTTCTTCCTCCTCAAATCTCAATACCCTAAATGCAATGCCAATAGCCACTGCAATTAACAACACCATATCGTCAAATCCCTTTGGCAACCCACTTAG
TACAGTATTGGCAGTCAAGTTGGATGAAAAGAATTATCTTCTCTGGAAATCCATGATTACTGCTGCTCTTCATGGACAGAAGCTTGATGGCTACGTTATGGGAACAATTG
CTCAACCTCCAGAAATGATTCAAGGTACCGGTGCAAATGCCACCACACTCATTTCAAATCCTGCGTTTGATTCATGGTCTACCACAGATCAATCGCTCCTAGCCTGGTTG
TATGGATCCATAACGCCATCTGTTGCTTGTGACATCCTCAATCTACGCACATCTAGAGATGTATGGAAAGCACTAGAAGATCTCTATGGAGCAACAAACAAGGCCCGAAT
CACCCAACTGAAAAGAAACCTTCAAATGACGAGGAAAAATCAGTTGAAAATGAGCGAATATCTTTCAACAATGAAGCAACTCGCTCACAGTCTTGCTCTAGCAGGCGAAC
CGGTAAGTGAAAATTCTCTCATCACTAATGTTCTCACGGATCTTGATGCAGAATATTTACCGGTAGCTTGCCAAATCAATGGAAAAGAAAATATGACATGGCAAGAGATG
CATGCCACGTTGCTAGCTTTTGAAAACACACTCATTCATCTGAATGTGGTAACCAACAACATTGATGTCACAAATGCATCAGCCAACTATGCCTCAAATAGGAGCTTTAA
TCAAAGAGGAAGCACACCACAAGCAAACAACAAGGCTCCGGGAGCTTTTATGGCAACCCCAAATGTTGTGAATGACCAAAATTGGCTTATGGATAGTGGGGCAACCAACC
ACACTACAAATGATGTCACCTACCTTGGACAAAAAGATGAGTACAATGGTAATGAAATGTTGACAGACAAGCAATCCGGGAGAACCATACTGGAGGGGAGGCTTAGTGAA
GGACTCTATCAGCTGGATCTTCCAAAGCCTAAAGCACATTTTTCTGCTTCAAATAAAGCTGTCAATTTTCGTCCAAGTCATCCTCAAAATCCATCATCCACAAATCATAC
TTTACTTCCACAAATGAGTCAAACAAGTCACTCCAATCCAGTTGGTACAGTCACAAAGTTAGCAACTCCTGCAGAACAAAGTTCTCCCTCAAAGTCTGTAAGTCAACAAG
AAAATTTGGTTGTTAATTCTACTGCCGCTACTAATGTTTGTGGAAATGGTATAGCTTGCTTTGATGTTGAAGATCTTGGACTATCAAGTTGTGAAATGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACACCGAAGGTTCTTCCTCCTCAAATCTCAATACCCTAAATGCAATGCCAATAGCCACTGCAATTAACAACACCATATCGTCAAATCCCTTTGGCAACCCACTTAG
TACAGTATTGGCAGTCAAGTTGGATGAAAAGAATTATCTTCTCTGGAAATCCATGATTACTGCTGCTCTTCATGGACAGAAGCTTGATGGCTACGTTATGGGAACAATTG
CTCAACCTCCAGAAATGATTCAAGGTACCGGTGCAAATGCCACCACACTCATTTCAAATCCTGCGTTTGATTCATGGTCTACCACAGATCAATCGCTCCTAGCCTGGTTG
TATGGATCCATAACGCCATCTGTTGCTTGTGACATCCTCAATCTACGCACATCTAGAGATGTATGGAAAGCACTAGAAGATCTCTATGGAGCAACAAACAAGGCCCGAAT
CACCCAACTGAAAAGAAACCTTCAAATGACGAGGAAAAATCAGTTGAAAATGAGCGAATATCTTTCAACAATGAAGCAACTCGCTCACAGTCTTGCTCTAGCAGGCGAAC
CGGTAAGTGAAAATTCTCTCATCACTAATGTTCTCACGGATCTTGATGCAGAATATTTACCGGTAGCTTGCCAAATCAATGGAAAAGAAAATATGACATGGCAAGAGATG
CATGCCACGTTGCTAGCTTTTGAAAACACACTCATTCATCTGAATGTGGTAACCAACAACATTGATGTCACAAATGCATCAGCCAACTATGCCTCAAATAGGAGCTTTAA
TCAAAGAGGAAGCACACCACAAGCAAACAACAAGGCTCCGGGAGCTTTTATGGCAACCCCAAATGTTGTGAATGACCAAAATTGGCTTATGGATAGTGGGGCAACCAACC
ACACTACAAATGATGTCACCTACCTTGGACAAAAAGATGAGTACAATGGTAATGAAATGTTGACAGACAAGCAATCCGGGAGAACCATACTGGAGGGGAGGCTTAGTGAA
GGACTCTATCAGCTGGATCTTCCAAAGCCTAAAGCACATTTTTCTGCTTCAAATAAAGCTGTCAATTTTCGTCCAAGTCATCCTCAAAATCCATCATCCACAAATCATAC
TTTACTTCCACAAATGAGTCAAACAAGTCACTCCAATCCAGTTGGTACAGTCACAAAGTTAGCAACTCCTGCAGAACAAAGTTCTCCCTCAAAGTCTGTAAGTCAACAAG
AAAATTTGGTTGTTAATTCTACTGCCGCTACTAATGTTTGTGGAAATGGTATAGCTTGCTTTGATGTTGAAGATCTTGGACTATCAAGTTGTGAAATGGTATAG
Protein sequenceShow/hide protein sequence
MDTEGSSSSNLNTLNAMPIATAINNTISSNPFGNPLSTVLAVKLDEKNYLLWKSMITAALHGQKLDGYVMGTIAQPPEMIQGTGANATTLISNPAFDSWSTTDQSLLAWL
YGSITPSVACDILNLRTSRDVWKALEDLYGATNKARITQLKRNLQMTRKNQLKMSEYLSTMKQLAHSLALAGEPVSENSLITNVLTDLDAEYLPVACQINGKENMTWQEM
HATLLAFENTLIHLNVVTNNIDVTNASANYASNRSFNQRGSTPQANNKAPGAFMATPNVVNDQNWLMDSGATNHTTNDVTYLGQKDEYNGNEMLTDKQSGRTILEGRLSE
GLYQLDLPKPKAHFSASNKAVNFRPSHPQNPSSTNHTLLPQMSQTSHSNPVGTVTKLATPAEQSSPSKSVSQQENLVVNSTAATNVCGNGIACFDVEDLGLSSCEMV