; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed RNA polymerase subunit beta''
Genome locationchr8:19085443..19095185
RNA-Seq ExpressionMoc08g26530
SyntenyMoc08g26530
Gene Ontology termsGO:0000166 - nucleotide binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0070569 - uridylyltransferase activity (molecular function)
GO:0140657 - ATP-dependent activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG51590.1 hypothetical protein EZV62_024114 [Acer yangbiense]2.1e-8264.58Show/hide
Query:  ESSKTNENIGVET------TVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTC
        ES+KT+ + G+ +      + P+  G++QNIQ +YRL+GKNYLKWSQ V+TFLKGKGK+SH++GTGPK  DPK +AWDE DSMVM+WLW+S+ P ISDTC
Subjt:  ESSKTNENIGVET------TVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTC

Query:  MFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQ
        MFL++AK+IWD I+QTYSKV+DAAQ+YDI+T+  ST+QGN+S TEYA  LQNLWQELDHY+CI MK S+DA T K+ +E+DR+  FLAGLN D D VRVQ
Subjt:  MFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQ

Query:  ILGK-ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIA
        ILGK ELP+LN  I ++R+EE+RR VML+SQ +DGSAMI+
Subjt:  ILGK-ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIA

TXG54514.1 hypothetical protein EZV62_019770 [Acer yangbiense]1.3e-9260.14Show/hide
Query:  SSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAK
        SS  NE++    + P+  G++QNIQ +YRL+GKNYLKWSQ V+TFLKGKGK+SH++GTGPK  DPKFEAWDE DSMVM+WLW+SM P ISDTCMFL++AK
Subjt:  SSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAK

Query:  EIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-EL
        +IWD I+QTYSKV+DAAQ+YDI+T+  ST+QGN+S TEYA  LQNLWQELDHY+CI MK S+DA T K+ +E+DR+  FLAGLN D D VRVQILGK EL
Subjt:  EIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-EL

Query:  PTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNTMITVAGQGD
        P+LN  I I+R+EES+    +     D             K   + W++DSGA DHMT + + F TY+PCPSNRK+S AD ++ TVAG G+
Subjt:  PTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNTMITVAGQGD

XP_017625595.1 PREDICTED: uncharacterized protein LOC108469210 [Gossypium arboreum]7.1e-8639.15Show/hide
Query:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT
        T+++S T E   +       + + QNIQ + RL+GKNYLKWSQ VRTFLKG+GK+SH++GTGPK  DPKF+AWDE DSMVMSWLW+SM+P ISDTCM L+
Subjt:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT

Query:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK
        ++KEIW+ +KQTYSKV+DAAQIY+IKT+I ST+QG+ S TEY+ LLQ+LWQE+DHYQCI MK SEDA   KRFVEKDRI  FLAGLN + D VRVQILGK
Subjt:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK

Query:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMA-------------------------------------------------------------
         ELP+LNE I I+R+EE RR VM+E+  +D SA++                                                                 
Subjt:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMA-------------------------------------------------------------

Query:  ------------------------------------LQTLKSSTE------------------------------PWIVDSGAIDHMTRTSDSFITYSPC
                                            L+ L  S E                               W++DSGA DHMT +S  F++Y+ C
Subjt:  ------------------------------------LQTLKSSTE------------------------------PWIVDSGAIDHMTRTSDSFITYSPC

Query:  PSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGDTNNSDR
         S+RK++ AD ++ITVAGQGD                                              +Q   +MIG AKE NGLYYLEE  +  +  +  
Subjt:  PSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGDTNNSDR

Query:  SISLNSKLSFASADKV
         +SL S+    + D++
Subjt:  SISLNSKLSFASADKV

XP_017630842.1 PREDICTED: uncharacterized protein LOC108473663 [Gossypium arboreum]1.9e-8344.5Show/hide
Query:  VETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQT
        V  + PT+  + QNIQ +YRL+GKNYLKWSQ VRTFLKG+GK+SH++ TGPK  DPKF+AWDE DSMVMSWLW+SM+P ISDTCMFL ++KEIW+ +KQT
Subjt:  VETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQT

Query:  YSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPTLNEVIGI
        YSKVQDAAQIY+IKT++ ST+QG+ S TEY+ LLQ+LWQE+D+YQCI MK SEDA   KRFVEKDRI  FLAGLN + D VRVQ+LGK ELP+LNE I I
Subjt:  YSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPTLNEVIGI

Query:  IRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------------------------
        +R+EE RR VM+E+  +D SA++                                                                             
Subjt:  IRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------------------------

Query:  ----------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNT
                                                  +A   + SS +           W++DS A DHMT +S  FI+Y+PCPS+RK+  AD +
Subjt:  ----------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNT

Query:  MITVAGQGD
        +ITVAGQGD
Subjt:  MITVAGQGD

XP_022768069.1 UDP-sugar pyrophosphorylase-like isoform X5 [Durio zibethinus]2.8e-8266.23Show/hide
Query:  KTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEI
        K  E IG  T +  + G+LQNIQ +YRL+GKNYLKW Q V+TFLKGKGK+SH++GTGP+  DPKFEAWDE DSMVMSWLW+SM P ISDTCMFL++AK+I
Subjt:  KTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEI

Query:  WDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPT
        W+ I+QTYSKV+DA Q+Y++K +  + +QGN S TEYA LL+NLWQE+DHYQCI MK SEDATT K+F+EKDR+  FLAGLN + D VRVQILGK +LP+
Subjt:  WDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPT

Query:  LNEVIGIIRSEESRRKVMLESQTIDGSAMIA
        LNEVI ++R+EESRR VML+S  ++GSAM++
Subjt:  LNEVIGIIRSEESRRKVMLESQTIDGSAMIA

TrEMBL top hitse value%identityAlignment
A0A1U8HQM6 uncharacterized protein LOC1078886302.2e-8839.37Show/hide
Query:  MTKSTIESSKTNE--NIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISD
        MTK T++ S T +   + V  + PT+  + QNIQ +YRL+GKNYLKWSQ VRTFLKG+GK+SH++ TGPK  DPKF+AWDE DSMVMSWLW+SM+P ISD
Subjt:  MTKSTIESSKTNE--NIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISD

Query:  TCMFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVR
        TCMFL ++KEIW+ +KQTYSKVQDAAQIY+IKT++ ST+QG+ S TEY+ LLQ+LWQE+D+YQCI MK SEDA   KRFVEKDRI  FLAGLN + D VR
Subjt:  TCMFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVR

Query:  VQILGK-ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------
        VQ+LGK ELP+LNE I I+R+EE RR VM+E+  +D SA++                                                           
Subjt:  VQILGK-ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------

Query:  ----------------------------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSF
                                                                    +A   + SS +           W++DSGA DHMT +S  F
Subjt:  ----------------------------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSF

Query:  ITYSPCPSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGD
        ++Y+PCPS+RK++ AD ++ITVAGQGD                                              +Q + +MIG AKE NGLYYLEE     
Subjt:  ITYSPCPSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGD

Query:  TNNSDRSISLNSKLSFASADKVWCTQEDIIISMIKL
          +S    +LNS  S   ++ +   Q+ I +  ++L
Subjt:  TNNSDRSISLNSKLSFASADKVWCTQEDIIISMIKL

A0A1U8HUX8 uncharacterized protein LOC1078874294.2e-8457.76Show/hide
Query:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT
        T+++S T E   +       + + QNIQ +YRL+GKNYLKWSQ VRTFLKG+GK+SH++GTGPK  DPKF+AWDE DSMVMSWLW+SM+P ISDTCMFL+
Subjt:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT

Query:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK
        ++KEIW+ +KQTYSKV+DAAQIY+IKT+I ST+QG+ S TEY+ LLQ+LWQE+DHYQCI MK SEDA   KRFVEKDRI  FLAGLN + D VRVQILGK
Subjt:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK

Query:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNR
         ELP LNE I I+R+EE RR VM+E+  +D SA++  +  +      +P   D+  I+     +   +  + C   R
Subjt:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNR

A0A5C7HC24 DNA helicase6.5e-9360.14Show/hide
Query:  SSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAK
        SS  NE++    + P+  G++QNIQ +YRL+GKNYLKWSQ V+TFLKGKGK+SH++GTGPK  DPKFEAWDE DSMVM+WLW+SM P ISDTCMFL++AK
Subjt:  SSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAK

Query:  EIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-EL
        +IWD I+QTYSKV+DAAQ+YDI+T+  ST+QGN+S TEYA  LQNLWQELDHY+CI MK S+DA T K+ +E+DR+  FLAGLN D D VRVQILGK EL
Subjt:  EIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-EL

Query:  PTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNTMITVAGQGD
        P+LN  I I+R+EES+    +     D             K   + W++DSGA DHMT + + F TY+PCPSNRK+S AD ++ TVAG G+
Subjt:  PTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNTMITVAGQGD

A0A6P4P1F3 uncharacterized protein LOC1084692103.4e-8639.15Show/hide
Query:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT
        T+++S T E   +       + + QNIQ + RL+GKNYLKWSQ VRTFLKG+GK+SH++GTGPK  DPKF+AWDE DSMVMSWLW+SM+P ISDTCM L+
Subjt:  TIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLT

Query:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK
        ++KEIW+ +KQTYSKV+DAAQIY+IKT+I ST+QG+ S TEY+ LLQ+LWQE+DHYQCI MK SEDA   KRFVEKDRI  FLAGLN + D VRVQILGK
Subjt:  SAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK

Query:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMA-------------------------------------------------------------
         ELP+LNE I I+R+EE RR VM+E+  +D SA++                                                                 
Subjt:  -ELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMA-------------------------------------------------------------

Query:  ------------------------------------LQTLKSSTE------------------------------PWIVDSGAIDHMTRTSDSFITYSPC
                                            L+ L  S E                               W++DSGA DHMT +S  F++Y+ C
Subjt:  ------------------------------------LQTLKSSTE------------------------------PWIVDSGAIDHMTRTSDSFITYSPC

Query:  PSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGDTNNSDR
         S+RK++ AD ++ITVAGQGD                                              +Q   +MIG AKE NGLYYLEE  +  +  +  
Subjt:  PSNRKVSTADNTMITVAGQGD----------------------------------------------DQVSGKMIGLAKERNGLYYLEEIVDGDTNNSDR

Query:  SISLNSKLSFASADKV
         +SL S+    + D++
Subjt:  SISLNSKLSFASADKV

A0A6P4P5C3 uncharacterized protein LOC1084736639.4e-8444.5Show/hide
Query:  VETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQT
        V  + PT+  + QNIQ +YRL+GKNYLKWSQ VRTFLKG+GK+SH++ TGPK  DPKF+AWDE DSMVMSWLW+SM+P ISDTCMFL ++KEIW+ +KQT
Subjt:  VETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQT

Query:  YSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPTLNEVIGI
        YSKVQDAAQIY+IKT++ ST+QG+ S TEY+ LLQ+LWQE+D+YQCI MK SEDA   KRFVEKDRI  FLAGLN + D VRVQ+LGK ELP+LNE I I
Subjt:  YSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVRVQILGK-ELPTLNEVIGI

Query:  IRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------------------------
        +R+EE RR VM+E+  +D SA++                                                                             
Subjt:  IRSEESRRKVMLESQTIDGSAMI-----------------------------------------------------------------------------

Query:  ----------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNT
                                                  +A   + SS +           W++DS A DHMT +S  FI+Y+PCPS+RK+  AD +
Subjt:  ----------------------------------------AGMALQTLKSSTE----------PWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNT

Query:  MITVAGQGD
        +ITVAGQGD
Subjt:  MITVAGQGD

SwissProt top hitse value%identityAlignment
A1XGM8 DNA-directed RNA polymerase subunit beta''4.8e-4593.88Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH
        +DGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGIT+ASLNTQSFISEASFQETARVLAKAALRGRIDWL+GLKENVVLGGMIPVGTGF+ L H
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH

A6MMT4 DNA-directed RNA polymerase subunit beta''2.2e-4592.93Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAHH
        +DGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGIT+ASLNTQSFISEASFQET RVLAKAALRGRIDWL+GLKENVVLGGMIPVGTGF+ L HH
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAHH

Q09FX1 DNA-directed RNA polymerase subunit beta''4.8e-4593.88Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH
        +DGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGIT+ASLNTQSFISEASFQETARVLAKAALRGRIDWL+GLKENVVLGGMIPVGTGF+ L H
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH

Q09G56 DNA-directed RNA polymerase subunit beta''1.4e-4492.86Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH
        +DGMSNVFSPGELIGL RAERTGRALEEAICYRAVLLGIT+ASLNTQSFISEASFQETARVLAKAALRGRIDWL+GLKENVVLGGMIPVGTGF+ L H
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH

Q4VZP3 DNA-directed RNA polymerase subunit beta''2.3e-4798.98Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH
        +DGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAH

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-1729.56Show/hide
Query:  DGKNYLKWSQFVRTFLKGKGKISHIMGTGPK--VWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILS
        D  NY+ W    R+FL+   K   I GT PK   + P ++ W++ ++MVM WL +SM   + ++ M+  +A ++W+ +++ +    D  +IY ++ R+ +
Subjt:  DGKNYLKWSQFVRTFLKGKGKISHIMGTGPK--VWDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILS

Query:  TRQGNHSNTEYATLLQNLWQELDHY-----------QCITMKDSEDATTHKRFVEKDRIDTFLAG--LNADLDVVRVQIL-GKELPTLNEVIGIIRSEES
         RQG  S  EY   L  +W EL  Y            C   K +E+A       EK++   FL G  LN   + V  +I+  K  P+L+E   +++  ES
Subjt:  TRQGNHSNTEYATLLQNLWQELDHY-----------QCITMKDSEDATTHKRFVEKDRIDTFLAG--LNADLDVVRVQIL-GKELPTLNEVIGIIRSEES

Query:  RRK
          K
Subjt:  RRK

AT4G35800.1 RNA polymerase II large subunit1.6e-0636.84Show/hide
Query:  EAICYRAVLLGITKASLNTQSF--ISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAHHEE
        + + YR  L+ IT+  +N      +   SF+ET  +L  AA     D LRG+ EN++LG + P+GTG  EL  ++E
Subjt:  EAICYRAVLLGITKASLNTQSF--ISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAHHEE

ATCG00170.1 DNA-directed RNA polymerase family protein1.0e-4290.53Show/hide
Query:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRE
        ++GMSNVF PGELIGLLRAERTGRALEEAICYRAVLLGIT+ASLNTQSFISEASFQETARVLAKAALRGRIDWL+GLKENVVLGG+IP GTGF +
Subjt:  QDGMSNVFSPGELIGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAGAGCACCATTGAGTCTTCAAAGACCAATGAAAATATTGGAGCTGAAACTACTGTTTCCACCAACCTAGGAGATTTACAAAATATTCAACCTAGCTAC
CGTTTGGATGGAAAAAATTATTTGAAATGGTCTCAATTTGTAAGAACTTTTCTGAAAGGAAAAGGAAAAATCAGTCATACTATGGGAACCAGACCAAAAGTTGGG
GACCCAAAATTTGAGGCATGGGATGAAGCTGATTCTATGGTAATGTCATGGTTATGGAGTTCAATGGTTCCTGTCATTAGTGACACCTGTATGTTCTTAACTCCT
TACAACTTTCTGTTTCTGACCTTCAACTTTGTGACCATGACCAAGAGCACCATTGAGTCTTCAAAGACCAATGAAAATATTGGAGTTGAAACTACTGTTCCCACC
AACCTAGGAGATTTACAAAATATTCAACCTAGCTACCGTTTGGATGGAAAAAATTATTTGAAATGGTCTCAATTTGTAAGAACTTTTCTGAAAGGAAAAGGAAAA
ATCAGTCATATTATGGGAACCGGACCAAAAGTTTGGGACCCAAAATTTGAGGCATGGGATGAAGCTGATTCTATGGTAATGTCATGGTTATGGAGTTCAATGGTT
CCTGTCATTAGTGACACCTGTATGTTCTTAACTTCAGCCAAGGAAATTTGGGACATGATTAAGCAAACTTACTCCAAGGTACAAGATGCAGCTCAAATCTATGAC
ATCAAGACTCGGATTTTATCTACTAGACAAGGTAACCATTCTAATACAGAATATGCAACTTTGCTGCAAAATCTTTGGCAAGAACTAGATCATTATCAATGCATT
ACGATGAAAGATAGTGAGGATGCTACCACACATAAAAGGTTTGTTGAGAAAGATAGAATTGATACTTTTCTTGCGGGTTTGAATGCTGATCTTGATGTTGTTAGA
GTACAAATACTTGGCAAAGAGTTACCAACCTTGAATGAAGTTATAGGGATCATCAGAAGTGAGGAAAGTAGACGAAAAGTGATGCTTGAATCGCAAACCATCGAT
GGATCAGCAATGATAGCAGGTATGGCCCTTCAAACCCTTAAATCTAGTACAGAACCATGGATAGTAGATTCAGGAGCAATTGATCACATGACAAGAACTAGTGAT
AGTTTTATTACTTATAGTCCTTGTCCGAGTAATAGAAAAGTTTCCACTGCAGATAACACTATGATAACTGTAGCAGGACAGGGGGATGACCAAGTCAGTGGGAAG
ATGATTGGGCTTGCTAAAGAAAGAAATGGGTTATATTACTTAGAGGAAATAGTTGATGGTGATACCAATAATAGTGATCGATCTATCTCTTTAAATTCTAAATTA
AGTTTTGCTTCCGCTGATAAAGTTTGGTGTACTCAAGAAGACATAATCATCTCTATGATCAAACTGTGCAAGAATCACAATCTGACAAAATCAATGGTAGCTCCA
CTGATGAGGTTTGAGTTGTCGGCACCTGGAGAGGAGGACTTGCCCGTAGCTATTCGGAAGGGTGTTCAAGATGGAATGTCTAATGTTTTTTCACCTGGAGAACTC
ATTGGGTTGTTGCGAGCGGAACGAACAGGGCGTGCTTTGGAAGAAGCGATCTGTTACCGAGCCGTATTATTGGGAATAACGAAAGCATCTCTAAATACTCAAAGT
TTCATATCGGAAGCAAGTTTTCAAGAAACTGCTCGAGTTTTAGCAAAAGCTGCCCTCCGAGGTCGTATCGATTGGTTGAGAGGTCTGAAAGAGAACGTTGTTCTA
GGAGGAATGATACCCGTTGGTACCGGATTCAGAGAATTAGCGCACCACGAAGAAGCTGCTGGTATAGGGCATGAGACCGAAGGAGGTTGCCTTCGAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAGAGCACCATTGAGTCTTCAAAGACCAATGAAAATATTGGAGCTGAAACTACTGTTTCCACCAACCTAGGAGATTTACAAAATATTCAACCTAGCTAC
CGTTTGGATGGAAAAAATTATTTGAAATGGTCTCAATTTGTAAGAACTTTTCTGAAAGGAAAAGGAAAAATCAGTCATACTATGGGAACCAGACCAAAAGTTGGG
GACCCAAAATTTGAGGCATGGGATGAAGCTGATTCTATGGTAATGTCATGGTTATGGAGTTCAATGGTTCCTGTCATTAGTGACACCTGTATGTTCTTAACTCCT
TACAACTTTCTGTTTCTGACCTTCAACTTTGTGACCATGACCAAGAGCACCATTGAGTCTTCAAAGACCAATGAAAATATTGGAGTTGAAACTACTGTTCCCACC
AACCTAGGAGATTTACAAAATATTCAACCTAGCTACCGTTTGGATGGAAAAAATTATTTGAAATGGTCTCAATTTGTAAGAACTTTTCTGAAAGGAAAAGGAAAA
ATCAGTCATATTATGGGAACCGGACCAAAAGTTTGGGACCCAAAATTTGAGGCATGGGATGAAGCTGATTCTATGGTAATGTCATGGTTATGGAGTTCAATGGTT
CCTGTCATTAGTGACACCTGTATGTTCTTAACTTCAGCCAAGGAAATTTGGGACATGATTAAGCAAACTTACTCCAAGGTACAAGATGCAGCTCAAATCTATGAC
ATCAAGACTCGGATTTTATCTACTAGACAAGGTAACCATTCTAATACAGAATATGCAACTTTGCTGCAAAATCTTTGGCAAGAACTAGATCATTATCAATGCATT
ACGATGAAAGATAGTGAGGATGCTACCACACATAAAAGGTTTGTTGAGAAAGATAGAATTGATACTTTTCTTGCGGGTTTGAATGCTGATCTTGATGTTGTTAGA
GTACAAATACTTGGCAAAGAGTTACCAACCTTGAATGAAGTTATAGGGATCATCAGAAGTGAGGAAAGTAGACGAAAAGTGATGCTTGAATCGCAAACCATCGAT
GGATCAGCAATGATAGCAGGTATGGCCCTTCAAACCCTTAAATCTAGTACAGAACCATGGATAGTAGATTCAGGAGCAATTGATCACATGACAAGAACTAGTGAT
AGTTTTATTACTTATAGTCCTTGTCCGAGTAATAGAAAAGTTTCCACTGCAGATAACACTATGATAACTGTAGCAGGACAGGGGGATGACCAAGTCAGTGGGAAG
ATGATTGGGCTTGCTAAAGAAAGAAATGGGTTATATTACTTAGAGGAAATAGTTGATGGTGATACCAATAATAGTGATCGATCTATCTCTTTAAATTCTAAATTA
AGTTTTGCTTCCGCTGATAAAGTTTGGTGTACTCAAGAAGACATAATCATCTCTATGATCAAACTGTGCAAGAATCACAATCTGACAAAATCAATGGTAGCTCCA
CTGATGAGGTTTGAGTTGTCGGCACCTGGAGAGGAGGACTTGCCCGTAGCTATTCGGAAGGGTGTTCAAGATGGAATGTCTAATGTTTTTTCACCTGGAGAACTC
ATTGGGTTGTTGCGAGCGGAACGAACAGGGCGTGCTTTGGAAGAAGCGATCTGTTACCGAGCCGTATTATTGGGAATAACGAAAGCATCTCTAAATACTCAAAGT
TTCATATCGGAAGCAAGTTTTCAAGAAACTGCTCGAGTTTTAGCAAAAGCTGCCCTCCGAGGTCGTATCGATTGGTTGAGAGGTCTGAAAGAGAACGTTGTTCTA
GGAGGAATGATACCCGTTGGTACCGGATTCAGAGAATTAGCGCACCACGAAGAAGCTGCTGGTATAGGGCATGAGACCGAAGGAGGTTGCCTTCGAGAATAA
Protein sequenceShow/hide protein sequence
MTKSTIESSKTNENIGAETTVSTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHTMGTRPKVGDPKFEAWDEADSMVMSWLWSSMVPVISDTCMFLTP
YNFLFLTFNFVTMTKSTIESSKTNENIGVETTVPTNLGDLQNIQPSYRLDGKNYLKWSQFVRTFLKGKGKISHIMGTGPKVWDPKFEAWDEADSMVMSWLWSSMV
PVISDTCMFLTSAKEIWDMIKQTYSKVQDAAQIYDIKTRILSTRQGNHSNTEYATLLQNLWQELDHYQCITMKDSEDATTHKRFVEKDRIDTFLAGLNADLDVVR
VQILGKELPTLNEVIGIIRSEESRRKVMLESQTIDGSAMIAGMALQTLKSSTEPWIVDSGAIDHMTRTSDSFITYSPCPSNRKVSTADNTMITVAGQGDDQVSGK
MIGLAKERNGLYYLEEIVDGDTNNSDRSISLNSKLSFASADKVWCTQEDIIISMIKLCKNHNLTKSMVAPLMRFELSAPGEEDLPVAIRKGVQDGMSNVFSPGEL
IGLLRAERTGRALEEAICYRAVLLGITKASLNTQSFISEASFQETARVLAKAALRGRIDWLRGLKENVVLGGMIPVGTGFRELAHHEEAAGIGHETEGGCLRE