; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:12850909..12867385
RNA-Seq ExpressionMoc02g17110
SyntenyMoc02g17110
Gene Ontology termsGO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP35727.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.4e-11565.15Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDR------------------GSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+R                  GSI E  NAKGFL  +EQYFT N+K +AS+L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDR------------------GSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

KYP39716.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]5.7e-11765.76Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+                  RGSI E  NAKGFL  +EQYFT N+KA+AS+L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAE-SSKQKKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E +S+QKK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAE-SSKQKKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

KYP40660.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan]1.3e-11362.03Show/hide
Query:  IIVKVANSDNMSTQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDRG------------------SIVEGTNAKGFLK
        I V VA++ N+S Q+N IP LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+R                   SI E  NAKG L 
Subjt:  IIVKVANSDNMSTQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDRG------------------SIVEGTNAKGFLK

Query:  EMEQYFTKNDKAEASTLMAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERM
         +EQYFT N+KA+AS+L+AKL S R  G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER 
Subjt:  EMEQYFTKNDKAEASTLMAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERM

Query:  QREKTESVHMASTSKSVKRKRVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISV
        QREKTES H+AS+S++ KRK   +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG F S VCSE+NL  VP  TWWVDSGATTHISV
Subjt:  QREKTESVHMASTSKSVKRKRVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISV

Query:  SMQGCIWSRPPSDAEAFIYVVDGNRAKVEAIGTFRLSLGTDFHLD
        SMQGC+W RPP D E FIYV DGN+  VEAIGTFRL L T FHL+
Subjt:  SMQGCIWSRPPSDAEAFIYVVDGNRAKVEAIGTFRLSLGTDFHLD

KYP69815.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.0e-11364.55Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLAL  + PT T ENPN+ ++EKW+                  RG I E  NAKGFL  +EQYFT N+K +AS L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+V YNTQKDKW+LNELI HCVQEEER  REKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]5.3e-13190.55Show/hide
Query:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFD
        VLDYLHSKELE PLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQ+S+AK  TTMGLMNALANMYEK SVNNKVYL TKFFNLKMA+ T ITAHLNEFD
Subjt:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFD

Query:  ALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSRS
         LINKLVAVDLEFS EVYAILLLRSLPDSWEPMRAAISNSC KEKLKFEDVRDAALA+EIRRKDSGIA TSG VLNVD GRNNNRGYGN+GKSKNNRSRS
Subjt:  ALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSRS

Query:  RNNRFECWNCGKTGHLKRNCKALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDII
        RN+RFECWNCGK GHLK NCKA KKNEGNEA ANVAEQIHDALV+AVESA+DTWVMDSGNHGK YLADGEPLDII
Subjt:  RNNRFECWNCGKTGHLKRNCKALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDII

TrEMBL top hitse value%identityAlignment
A0A151QZJ9 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-11665.15Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDR------------------GSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+R                  GSI E  NAKGFL  +EQYFT N+K +AS+L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDR------------------GSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

A0A151RB35 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-11765.76Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+                  RGSI E  NAKGFL  +EQYFT N+KA+AS+L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAE-SSKQKKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E +S+QKK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAE-SSKQKKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

A0A151RDF9 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)6.3e-11462.03Show/hide
Query:  IIVKVANSDNMSTQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDRG------------------SIVEGTNAKGFLK
        I V VA++ N+S Q+N IP LNG NFK WKE + I+LGCMDLDLALR + PT   ENP++ ++EKW+R                   SI E  NAKG L 
Subjt:  IIVKVANSDNMSTQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDRG------------------SIVEGTNAKGFLK

Query:  EMEQYFTKNDKAEASTLMAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERM
         +EQYFT N+KA+AS+L+AKL S R  G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW+LNELI HCVQEEER 
Subjt:  EMEQYFTKNDKAEASTLMAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERM

Query:  QREKTESVHMASTSKSVKRKRVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISV
        QREKTES H+AS+S++ KRK   +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG F S VCSE+NL  VP  TWWVDSGATTHISV
Subjt:  QREKTESVHMASTSKSVKRKRVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISV

Query:  SMQGCIWSRPPSDAEAFIYVVDGNRAKVEAIGTFRLSLGTDFHLD
        SMQGC+W RPP D E FIYV DGN+  VEAIGTFRL L T FHL+
Subjt:  SMQGCIWSRPPSDAEAFIYVVDGNRAKVEAIGTFRLSLGTDFHLD

A0A151TRZ9 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-11464.55Show/hide
Query:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE + I+LGCMDLDLAL  + PT T ENPN+ ++EKW+                  RG I E  NAKGFL  +EQYFT N+K +AS L+AK
Subjt:  LNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWD------------------RGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY G+GNIREYIM+MSN+A+KLK LKLE+S+D LVHLVL SLP  +  F+V YNTQKDKW+LNELI HCVQEEER  REKTES H+AS+S++ KRK
Subjt:  LTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KYA+W +KKG FLS VCSE+NLA VP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY
         DGN+  VEAIGTFRL L T FHLDLFE +
Subjt:  VDGNRAKVEAIGTFRLSLGTDFHLDLFEEY

A0A6J1DF43 uncharacterized protein LOC1110204692.6e-13190.55Show/hide
Query:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFD
        VLDYLHSKELE PLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQ+S+AK  TTMGLMNALANMYEK SVNNKVYL TKFFNLKMA+ T ITAHLNEFD
Subjt:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFD

Query:  ALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSRS
         LINKLVAVDLEFS EVYAILLLRSLPDSWEPMRAAISNSC KEKLKFEDVRDAALA+EIRRKDSGIA TSG VLNVD GRNNNRGYGN+GKSKNNRSRS
Subjt:  ALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSRS

Query:  RNNRFECWNCGKTGHLKRNCKALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDII
        RN+RFECWNCGK GHLK NCKA KKNEGNEA ANVAEQIHDALV+AVESA+DTWVMDSGNHGK YLADGEPLDII
Subjt:  RNNRFECWNCGKTGHLKRNCKALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDII

SwissProt top hitse value%identityAlignment
A0A1D6KL43 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic6.9e-0992.59Show/hide
Query:  CGGIGKWKALNRKRAKDIYEFTECPNC
        CGG GKWKALNRKRAKD+YEFTECPNC
Subjt:  CGGIGKWKALNRKRAKDIYEFTECPNC

O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic6.9e-0992.59Show/hide
Query:  CGGIGKWKALNRKRAKDIYEFTECPNC
        CGG GKWKALNRKRAKD+YEFTECPNC
Subjt:  CGGIGKWKALNRKRAKDIYEFTECPNC

P04146 Copia protein6.3e-1023.68Show/hide
Query:  RVLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEF
        R+   L  +++   ++G   +  +  WKK +R    TI   L+ +       ++T   ++  L  +YE+ S+ +++ L  +  +LK++   S+ +H + F
Subjt:  RVLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEF

Query:  DALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSR
        D LI++L+A   +  +      LL +LP  ++ +  AI  +  +E L    V++  L  EI+ K+     TS  V+N     NNN    N  K++  + +
Subjt:  DALINKLVAVDLEFSDEVYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSR

Query:  -----SRNNRFECWNCGKTGHLKRNC---KALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDIIVKVANSDNMST-QVNN
             +   + +C +CG+ GH+K++C   K +  N+  E    V       +   V+   +T VMD+     G++ D    D ++   N +++ T  V  
Subjt:  -----SRNNRFECWNCGKTGHLKRNC---KALKKNEGNEAGANVAEQIHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDIIVKVANSDNMST-QVNN

Query:  IPKL
        +P L
Subjt:  IPKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-2335.75Show/hide
Query:  KPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFDALINKLVAVDLEFSDE
        KPD M  ++W  LD +    IRL L+ +V  ++  E T  G+   L ++Y   ++ NK+YL  + + L M++GT+  +HLN F+ LI +L  + ++  +E
Subjt:  KPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFDALINKLVAVDLEFSDE

Query:  VYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRK---DSGIA-STSGLVLNVDSGRNNNRGYGNQGKSKN-NRSRSRNNRFECWNCG
          AILLL SLP S++ +   I +  GK  ++ +DV  A L +E  RK   + G A  T G   +     NN    G +GKSKN ++SR RN    C+NC 
Subjt:  VYAILLLRSLPDSWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRK---DSGIA-STSGLVLNVDSGRNNNRGYGNQGKSKN-NRSRSRNNRFECWNCG

Query:  KTGHLKRNCKALKKNEGNEAG
        + GH KR+C   +K +G  +G
Subjt:  KTGHLKRNCKALKKNEGNEAG

Q6YUA8 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic5.9e-0888.89Show/hide
Query:  CGGIGKWKALNRKRAKDIYEFTECPNC
        CGG GKWKALNRKRAKD+Y FTECPNC
Subjt:  CGGIGKWKALNRKRAKDIYEFTECPNC

Arabidopsis top hitse value%identityAlignment
AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.9e-1092.59Show/hide
Query:  CGGIGKWKALNRKRAKDIYEFTECPNC
        CGG GKWKALNRKRAKD+YEFTECPNC
Subjt:  CGGIGKWKALNRKRAKDIYEFTECPNC

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.9e-1092.59Show/hide
Query:  CGGIGKWKALNRKRAKDIYEFTECPNC
        CGG GKWKALNRKRAKD+YEFTECPNC
Subjt:  CGGIGKWKALNRKRAKDIYEFTECPNC

AT3G29785.1 unknown protein3.4e-1144.74Show/hide
Query:  RVLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKV
        ++ DYL+ K+L  PL  K + M + +W  L R+VL  IRLT++KN+  ++AKE +  GLM  L+++Y+KPS NN V
Subjt:  RVLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKV

AT5G53670.1 unknown protein7.8e-2434.16Show/hide
Query:  TQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPN------------KVEIEKWDRGSIVEG-TNAKGFLKEMEQYFTKNDKAEASTL
        + V++IP L+G+NF +WKE +L+VL  MDLDL+L  + P+S KE  +            K+ I +  RG + +  T AK FL  +E +F KN++AE S +
Subjt:  TQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPN------------KVEIEKWDRGSIVEG-TNAKGFLKEMEQYFTKNDKAEASTL

Query:  MAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLE---VSEDFLVHLVLNSLPAEYSHFRVSYNTQKDK-------------WSLNELIFHCVQEEERMQ
         A+ +S  YI   N+RE IM+M  +  K K L +     ++  L H  +  LP +Y   +  Y+  + K             WS  ELI  C  EEE ++
Subjt:  MAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLE---VSEDFLVHLVLNSLPAEYSHFRVSYNTQKDK-------------WSLNELIFHCVQEEERMQ

Query:  RE
         E
Subjt:  RE

AT5G53690.1 unknown protein2.8e-0542.86Show/hide
Query:  VLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSV---KRKRVNNAAES
        VL+SLP++Y   R +Y+  K +WS ++LI HCVQEEER+  EK E  H     K +   KRK+ +   E+
Subjt:  VLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQREKTESVHMASTSKSV---KRKRVNNAAES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACATCTTACAAGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTCACGGAGAACGGAGTGCCAATTAACGGTCCTCT
TCAACAAGCACGGGAAGAAAACGACCAGCTCAAGAGAGAGCTACGTCGAACGCAACACGAGCTCAACAACACGAGGTATAAGTTAGCCCGGGTTGAAGAAACGCGGGACT
TGCTGGAGGGACTGCTGAAGGAAGAGAAGGAGGAACGACTTAGTCTGGAGGACAGAGTATTAGATTACTTGCACTCGAAAGAGTTGGAATTGCCATTAGAAGGAAAGCCA
GATGATATGGGAGAAAAAGAATGGAAGAAGTTGGACAGGAAAGTGTTGGGTACGATTCGCCTGACATTAACTAAAAATGTTCAGACCAGCATGGCGAAGGAGATGACCAC
AATGGGGTTGATGAATGCCCTGGCCAACATGTATGAAAAACCTTCGGTAAATAATAAGGTGTATCTTGGAACTAAATTTTTTAATTTGAAAATGGCTAAAGGTACATCTA
TTACTGCCCATTTAAATGAGTTTGACGCGTTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAAGTTTATGCTATTTTGTTATTAAGATCTTTGCCTGAT
AGTTGGGAACCCATGCGAGCTGCTATTTCGAATTCTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGACGAAATTCGTAGGAAGGACTC
TGGTATCGCTTCTACTTCTGGTTTAGTATTGAATGTGGACAGTGGAAGAAATAATAATAGAGGTTATGGGAATCAAGGCAAGTCGAAAAACAACAGAAGCAGGTCGAGAA
ACAATAGGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGTAAGGCCCTGAAGAAAAATGAAGGGAATGAAGCTGGTGCTAATGTTGCTGAGCAG
ATACATGATGCTTTGGTTCTTGCAGTTGAAAGCGCTTATGACACATGGGTGATGGATTCAGGAAATCATGGAAAGGGCTATCTTGCCGATGGAGAGCCTTTGGATATCAT
TGTTAAGGTTGCTAATTCTGATAATATGTCCACTCAAGTCAACAACATTCCTAAACTGAATGGGGCTAATTTTAAGGACTGGAAAGAAGACATCCTGATAGTACTTGGGT
GTATGGATTTAGACCTTGCATTAAGGGTAGACCATCCTACTTCAACTAAGGAAAATCCTAATAAGGTTGAAATTGAAAAGTGGGATAGAGGCTCTATTGTTGAGGGAACG
AATGCCAAAGGCTTTCTAAAAGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACATTGGTGAAGG
AAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGACACTTAAGTTGGAAGTTTCTGAAGACTTTTTAGTGCATTTAGTTTTGAACTCTCTTC
CAGCAGAGTATAGCCACTTTAGGGTGAGTTACAACACTCAGAAGGATAAATGGTCCCTGAATGAGCTAATCTTTCACTGTGTTCAAGAGGAAGAGAGGATGCAGCGAGAG
AAGACAGAAAGTGTTCACATGGCTTCTACCTCAAAGAGTGTAAAAAGAAAGAGAGTGAATAATGCTGCGGAATCTTCTAAGCAGAAAAAGGAAAAGAAACATGATTCAGG
ACCTGTTTGTTTCTTCTGTAAAAAGACTGGGCACATGAAAAAACAATGTGCCAAATATGCTGCATGGCTACTAAAGAAGGGCATGTTTCTCTCCCTTGTTTGTTCTGAGA
TTAATCTAGCTTCTGTACCTATGCATACGTGGTGGGTAGATTCAGGTGCTACTACTCACATAAGTGTATCTATGCAGGGTTGCATTTGGAGCCGACCACCAAGTGATGCT
GAGGCTTTCATCTACGTGGTTGACGGCAATAGGGCAAAAGTAGAAGCAATAGGAACATTTAGATTATCTTTGGGAACTGATTTTCATTTGGATTTGTTTGAGGAATATAG
GGCAAAAGTGACATGTGGTGGTATAGGCAAATGGAAAGCTCTGAACAGAAAACGGGCTAAGGATATCTACGAGTTTACAGAATGTCCAAATTGTTGTAAATATTCATCTC
GATGTACAAATTACAGTCAGCTCATCTATGGATGCAGCATATCTTTGCATGGTTTCAGTTTGACACAACTTCACATCTGCTGGACATGGAAAGTAATGTCCTCTTATACT
GTTCCTGATAGGCTTGGGAAAATCGAACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAACATCTTACAAGTCGGACGGTTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCCCTCCTCCTTCACGGAGAACGGAGTGCCAATTAACGGTCCTCT
TCAACAAGCACGGGAAGAAAACGACCAGCTCAAGAGAGAGCTACGTCGAACGCAACACGAGCTCAACAACACGAGGTATAAGTTAGCCCGGGTTGAAGAAACGCGGGACT
TGCTGGAGGGACTGCTGAAGGAAGAGAAGGAGGAACGACTTAGTCTGGAGGACAGAGTATTAGATTACTTGCACTCGAAAGAGTTGGAATTGCCATTAGAAGGAAAGCCA
GATGATATGGGAGAAAAAGAATGGAAGAAGTTGGACAGGAAAGTGTTGGGTACGATTCGCCTGACATTAACTAAAAATGTTCAGACCAGCATGGCGAAGGAGATGACCAC
AATGGGGTTGATGAATGCCCTGGCCAACATGTATGAAAAACCTTCGGTAAATAATAAGGTGTATCTTGGAACTAAATTTTTTAATTTGAAAATGGCTAAAGGTACATCTA
TTACTGCCCATTTAAATGAGTTTGACGCGTTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAAGTTTATGCTATTTTGTTATTAAGATCTTTGCCTGAT
AGTTGGGAACCCATGCGAGCTGCTATTTCGAATTCTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGACGAAATTCGTAGGAAGGACTC
TGGTATCGCTTCTACTTCTGGTTTAGTATTGAATGTGGACAGTGGAAGAAATAATAATAGAGGTTATGGGAATCAAGGCAAGTCGAAAAACAACAGAAGCAGGTCGAGAA
ACAATAGGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGTAAGGCCCTGAAGAAAAATGAAGGGAATGAAGCTGGTGCTAATGTTGCTGAGCAG
ATACATGATGCTTTGGTTCTTGCAGTTGAAAGCGCTTATGACACATGGGTGATGGATTCAGGAAATCATGGAAAGGGCTATCTTGCCGATGGAGAGCCTTTGGATATCAT
TGTTAAGGTTGCTAATTCTGATAATATGTCCACTCAAGTCAACAACATTCCTAAACTGAATGGGGCTAATTTTAAGGACTGGAAAGAAGACATCCTGATAGTACTTGGGT
GTATGGATTTAGACCTTGCATTAAGGGTAGACCATCCTACTTCAACTAAGGAAAATCCTAATAAGGTTGAAATTGAAAAGTGGGATAGAGGCTCTATTGTTGAGGGAACG
AATGCCAAAGGCTTTCTAAAAGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACATTGGTGAAGG
AAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGACACTTAAGTTGGAAGTTTCTGAAGACTTTTTAGTGCATTTAGTTTTGAACTCTCTTC
CAGCAGAGTATAGCCACTTTAGGGTGAGTTACAACACTCAGAAGGATAAATGGTCCCTGAATGAGCTAATCTTTCACTGTGTTCAAGAGGAAGAGAGGATGCAGCGAGAG
AAGACAGAAAGTGTTCACATGGCTTCTACCTCAAAGAGTGTAAAAAGAAAGAGAGTGAATAATGCTGCGGAATCTTCTAAGCAGAAAAAGGAAAAGAAACATGATTCAGG
ACCTGTTTGTTTCTTCTGTAAAAAGACTGGGCACATGAAAAAACAATGTGCCAAATATGCTGCATGGCTACTAAAGAAGGGCATGTTTCTCTCCCTTGTTTGTTCTGAGA
TTAATCTAGCTTCTGTACCTATGCATACGTGGTGGGTAGATTCAGGTGCTACTACTCACATAAGTGTATCTATGCAGGGTTGCATTTGGAGCCGACCACCAAGTGATGCT
GAGGCTTTCATCTACGTGGTTGACGGCAATAGGGCAAAAGTAGAAGCAATAGGAACATTTAGATTATCTTTGGGAACTGATTTTCATTTGGATTTGTTTGAGGAATATAG
GGCAAAAGTGACATGTGGTGGTATAGGCAAATGGAAAGCTCTGAACAGAAAACGGGCTAAGGATATCTACGAGTTTACAGAATGTCCAAATTGTTGTAAATATTCATCTC
GATGTACAAATTACAGTCAGCTCATCTATGGATGCAGCATATCTTTGCATGGTTTCAGTTTGACACAACTTCACATCTGCTGGACATGGAAAGTAATGTCCTCTTATACT
GTTCCTGATAGGCTTGGGAAAATCGAACATTAG
Protein sequenceShow/hide protein sequence
MSTSYKSDGSSSDGATSPISNHPSSFTENGVPINGPLQQAREENDQLKRELRRTQHELNNTRYKLARVEETRDLLEGLLKEEKEERLSLEDRVLDYLHSKELELPLEGKP
DDMGEKEWKKLDRKVLGTIRLTLTKNVQTSMAKEMTTMGLMNALANMYEKPSVNNKVYLGTKFFNLKMAKGTSITAHLNEFDALINKLVAVDLEFSDEVYAILLLRSLPD
SWEPMRAAISNSCGKEKLKFEDVRDAALADEIRRKDSGIASTSGLVLNVDSGRNNNRGYGNQGKSKNNRSRSRNNRFECWNCGKTGHLKRNCKALKKNEGNEAGANVAEQ
IHDALVLAVESAYDTWVMDSGNHGKGYLADGEPLDIIVKVANSDNMSTQVNNIPKLNGANFKDWKEDILIVLGCMDLDLALRVDHPTSTKENPNKVEIEKWDRGSIVEGT
NAKGFLKEMEQYFTKNDKAEASTLMAKLTSSRYIGEGNIREYIMQMSNVATKLKTLKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWSLNELIFHCVQEEERMQRE
KTESVHMASTSKSVKRKRVNNAAESSKQKKEKKHDSGPVCFFCKKTGHMKKQCAKYAAWLLKKGMFLSLVCSEINLASVPMHTWWVDSGATTHISVSMQGCIWSRPPSDA
EAFIYVVDGNRAKVEAIGTFRLSLGTDFHLDLFEEYRAKVTCGGIGKWKALNRKRAKDIYEFTECPNCCKYSSRCTNYSQLIYGCSISLHGFSLTQLHICWTWKVMSSYT
VPDRLGKIEH