; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g21830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g21830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionZf-CCHC domain-containing protein/UBN2 domain-containing protein
Genome locationchr3:15059901..15073327
RNA-Seq ExpressionMoc03g21830
SyntenyMoc03g21830
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]5.5e-11968.03Show/hide
Query:  KKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKN
        +K+L  LPK W+ K+  IQE KDL  L ++ELIG LM HEI +++++EDE KKK KSIAL  I+LE+  E E+ LDEDD+ Y SRKYKNFIKRKK FKK 
Subjt:  KKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKN

Query:  FSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFEN
         S QK SK E SKKDEVICYECKK  HIR DCP LKS KKSK+KAMKATWDDS E  SESE EE AN   M  SDKEDE DDEVTL+P S +ELFE FEN
Subjt:  FSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFEN

Query:  MQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLK---NNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAK
        +QNDLEKL SKYV+LKKKY VL+SENKSLLD IAC K   N + + +N+S DKH+ DC+EK+ALLDK+RFLEHD CEKDNLIK+LK+NE N L +LDKAK
Subjt:  MQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLK---NNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAK

Query:  DSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKVSQVLPL
        ++IKKLTIGAQRLDKIIEVGK YGDKR LGYIDE STL SSKT FVKASP     + PK +  L +
Subjt:  DSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKVSQVLPL

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]2.6e-13272.25Show/hide
Query:  MFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENAL
        MFIRFTNIVNALEGL KEYSNLEKVKKLLWSLPK+WEPK+ +IQE KDLKTLSMDELIG LM HEIKIKKNMEDEKKKK+KSIALKAITLEVD EGEN L
Subjt:  MFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENAL

Query:  DEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD
        DEDDV YLSRK                                            DCPLLKS KKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD
Subjt:  DEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD

Query:  KEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEK
        KEDEQDDEV LDPLSYDELFEAFENMQN+LEKLGSKYVMLK K  V TSENKSL DDIACLK NEHDV                                
Subjt:  KEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEK

Query:  DNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV
        DNLIKLLKKNES+ALVELDKAKD IK+LTIGAQRLDKIIE GKPYGDKRGLGYI+E +T SSSKTIFVKASPNM KLVAPKV
Subjt:  DNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]1.1e-11964.97Show/hide
Query:  MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAI
        MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPK+  IQE KDLKTLSMDELI                              
Subjt:  MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAI

Query:  TLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENE
               GENALDEDDV YLSRKYKNFIKRKKQFKKNFSN KE KSE SKKDEVICYECKKPGHIR DCP LKS KKSKKKAMKATWDDSDESG+ESENE
Subjt:  TLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENE

Query:  EVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLD
        EVANFCFMAHSDKEDE+DDE+TLDPLSYDELFEAFENMQNDLEK                                                        
Subjt:  EVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLD

Query:  KIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV
                                  LVELDKAKDSIKKLTIGAQRLDKIIE+GKPYGDKRGLGYIDE ST SSSK IFVKASPNM KLVAPKV
Subjt:  KIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV

XP_022931810.1 uncharacterized protein LOC111438099 [Cucurbita moschata]3.4e-8463.81Show/hide
Query:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML
        MK +L S+DY LW  V  G  IPIKIV+ + V K + E+   + K CSLN  AINCL+CAL+  E+NRV +C +A +IW  LE+THEGT QVKETKI ML
Subjt:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML

Query:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI
        VH YE+FK++ENE I DMF RFTNI+NAL+ LGK YS  E V+K+L SLPK WE K+  IQE KDL  L +DEL+G LM HEI +  +ME+E KKK KSI
Subjt:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI

Query:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPG
        ALK+I  +VDSE E+ LDEDDV Y +RKYKNFIKRKKQFKK+F+NQKESK E SK DEVICYECKKPG
Subjt:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPG

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]4.6e-18266.92Show/hide
Query:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML
        MK +L SIDY+LW +V +G  +P+K VD V   K +EE+   E K CS N KAINCL+CAL++ E+NR+ +C +A++IW+ LEITHEGT QVKE+KI M 
Subjt:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML

Query:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI
        VH YE+FK+D NE I+DMF RFTNI+NAL+GLGK Y+  E V+K+L SLPK WE K+  IQE KDL  L ++ELIG LM HEI +K+++EDE KKK KSI
Subjt:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI

Query:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGS
        ALK I+LEVD E E+ LDEDD+ Y SRKYKNFIKRKK FKK+ S QKESK E SKKDEVICYECK+ GHIR DCPLLKS KKSKKKAMKATWDDS E  S
Subjt:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGS

Query:  ESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEH----DVVNISCDKHVLD
        ESE EE+AN   MAHSDK+DE DD+VTL+PLS DELFE FE+MQNDLEKL SKYV+LKKKY VL SENKSLLD IAC K NE+    + +N+S DKHV  
Subjt:  ESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEH----DVVNISCDKHVLD

Query:  CDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASP-----NM
        C EK+ALLDK+RFLEHD CEKDNLIK+LK+NE + L ELDKAK++IKKLTIGAQRLDKIIEVGK YGDKRGLGYIDE ST SSSKT FVKASP     NM
Subjt:  CDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASP-----NM

Query:  SKLVAPKV-SQVLPLVRSLGIEDHLK
        S  V+  V S  +P+  + G+E H++
Subjt:  SKLVAPKV-SQVLPLVRSLGIEDHLK

TrEMBL top hitse value%identityAlignment
A0A1Q3BDH6 DUF4219 domain-containing protein/UBN2 domain-containing protein6.3e-8441.24Show/hide
Query:  MKFFLLSIDYDLWDVVEEGFKIP-IKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDM
        M  F+ ++D++LWD++ +G ++P I + +G++ +KP+  ++  ++K   LN KA + + CALN  E+NRV  C TAK++WD+LE+T+EGT QVK+ KI+M
Subjt:  MKFFLLSIDYDLWDVVEEGFKIP-IKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDM

Query:  LVHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEK--KKKE
        LV +YEMF + ENE IS MF+RFTNI+N+L+ L K Y+N E V+K+L  LPK W PK+  I+E KDL T  ++EL+G LM HE+ IK + +DE+  KKK+
Subjt:  LVHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEK--KKKE

Query:  KSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKS--KKKAMKATWDDS
        K IA K+ T   DS  E++  +D++  ++R++K ++ +KK   K+F     SKSET KK+E+IC+EC KPGH   +CP LK  K +  KKKAM ATW DS
Subjt:  KSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKS--KKKAMKATWDDS

Query:  DESGS-ESENEEVANFCFMA--HSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDK
        DES S E E++EVA    MA   SDK++++D++   D     EL E        LEK   +   LKK  KVL  EN SL  +I CL N   D        
Subjt:  DESGS-ESENEEVANFCFMA--HSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDK

Query:  HVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTL
                                 D  I L K+NE+   V++D  K +  K +  +++LDK++ + +   +K GLGY DE + +
Subjt:  HVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTL

A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein2.7e-11968.03Show/hide
Query:  KKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKN
        +K+L  LPK W+ K+  IQE KDL  L ++ELIG LM HEI +++++EDE KKK KSIAL  I+LE+  E E+ LDEDD+ Y SRKYKNFIKRKK FKK 
Subjt:  KKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKN

Query:  FSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFEN
         S QK SK E SKKDEVICYECKK  HIR DCP LKS KKSK+KAMKATWDDS E  SESE EE AN   M  SDKEDE DDEVTL+P S +ELFE FEN
Subjt:  FSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFEN

Query:  MQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLK---NNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAK
        +QNDLEKL SKYV+LKKKY VL+SENKSLLD IAC K   N + + +N+S DKH+ DC+EK+ALLDK+RFLEHD CEKDNLIK+LK+NE N L +LDKAK
Subjt:  MQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLK---NNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAK

Query:  DSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKVSQVLPL
        ++IKKLTIGAQRLDKIIEVGK YGDKR LGYIDE STL SSKT FVKASP     + PK +  L +
Subjt:  DSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKVSQVLPL

A0A6J1DS74 uncharacterized protein LOC1110238061.2e-13272.25Show/hide
Query:  MFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENAL
        MFIRFTNIVNALEGL KEYSNLEKVKKLLWSLPK+WEPK+ +IQE KDLKTLSMDELIG LM HEIKIKKNMEDEKKKK+KSIALKAITLEVD EGEN L
Subjt:  MFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENAL

Query:  DEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD
        DEDDV YLSRK                                            DCPLLKS KKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD
Subjt:  DEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSD

Query:  KEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEK
        KEDEQDDEV LDPLSYDELFEAFENMQN+LEKLGSKYVMLK K  V TSENKSL DDIACLK NEHDV                                
Subjt:  KEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEK

Query:  DNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV
        DNLIKLLKKNES+ALVELDKAKD IK+LTIGAQRLDKIIE GKPYGDKRGLGYI+E +T SSSKTIFVKASPNM KLVAPKV
Subjt:  DNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV

A0A6J1DY46 uncharacterized protein LOC1110252595.4e-12064.97Show/hide
Query:  MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAI
        MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPK+  IQE KDLKTLSMDELI                              
Subjt:  MFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAI

Query:  TLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENE
               GENALDEDDV YLSRKYKNFIKRKKQFKKNFSN KE KSE SKKDEVICYECKKPGHIR DCP LKS KKSKKKAMKATWDDSDESG+ESENE
Subjt:  TLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENE

Query:  EVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLD
        EVANFCFMAHSDKEDE+DDE+TLDPLSYDELFEAFENMQNDLEK                                                        
Subjt:  EVANFCFMAHSDKEDEQDDEVTLDPLSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLD

Query:  KIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV
                                  LVELDKAKDSIKKLTIGAQRLDKIIE+GKPYGDKRGLGYIDE ST SSSK IFVKASPNM KLVAPKV
Subjt:  KIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKV

A0A6J1F0H1 uncharacterized protein LOC1114380991.6e-8463.81Show/hide
Query:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML
        MK +L S+DY LW  V  G  IPIKIV+ + V K + E+   + K CSLN  AINCL+CAL+  E+NRV +C +A +IW  LE+THEGT QVKETKI ML
Subjt:  MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDML

Query:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI
        VH YE+FK++ENE I DMF RFTNI+NAL+ LGK YS  E V+K+L SLPK WE K+  IQE KDL  L +DEL+G LM HEI +  +ME+E KKK KSI
Subjt:  VHQYEMFKIDENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSI

Query:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPG
        ALK+I  +VDSE E+ LDEDDV Y +RKYKNFIKRKKQFKK+F+NQKESK E SK DEVICYECKKPG
Subjt:  ALKAITLEVDSEGENALDEDDVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-0728.17Show/hide
Query:  NPQYNIWLNNDGLLTSWLLGIITEDVLATIEGTDSAYQMWKSLEEQLLTVTKENEIHLNEAILSLKKGSLSLDEYLKKTKSFCDQLAAMKKPVDDLTKVF
        NP Y  W   D L+ S +LG I+  V   +    +A Q+W++L +     +  +   L   +    KG+ ++D+Y++   +  DQLA + KP+D   +V 
Subjt:  NPQYNIWLNNDGLLTSWLLGIITEDVLATIEGTDSAYQMWKSLEEQLLTVTKENEIHLNEAILSLKKGSLSLDEYLKKTKSFCDQLAAMKKPVDDLTKVF

Query:  HVARGLGAKYHGFETVMLSKAPYPTYNEFILALKAHEVMINA
         V   L  +Y      + +K   PT  E    L  HE  I A
Subjt:  HVARGLGAKYHGFETVMLSKAPYPTYNEFILALKAHEVMINA

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.7e-1328.97Show/hide
Query:  VLTDKDGETIPNPQ-YNIWLNNDGLLTSWLLGIITEDVLATIEGTD-SAYQMWKSLEEQLLTVTKENEIHLNEAILSLKKGSLSLDEYLKKTKSFCDQLA
        VL   DG + P P     W   DGL+  W+ G IT+ +L TI     +A  +W SLE       +   +     + +     LS+ EY +K KS  D L 
Subjt:  VLTDKDGETIPNPQ-YNIWLNNDGLLTSWLLGIITEDVLATIEGTD-SAYQMWKSLEEQLLTVTKENEIHLNEAILSLKKGSLSLDEYLKKTKSFCDQLA

Query:  AMKKPVDDLTKVFHVARGLGAKYHGFETVMLSKAPYPTYNEFILALKAHEVMINANNGEEKMSQLDH------------NQAFYAQK------GKGRGRG
         +  P+ D   V H+  GL  KY     V+  K+P+P++ E    L   E  + +N  +  +S  +H             Q  Y Q+        GRGR 
Subjt:  AMKKPVDDLTKVFHVARGLGAKYHGFETVMLSKAPYPTYNEFILALKAHEVMINANNGEEKMSQLDH------------NQAFYAQK------GKGRGRG

Query:  RNFSSRGRGFSQGR
        +   +RG G S GR
Subjt:  RNFSSRGRGFSQGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTTTTCTTGCTTTCTATTGACTATGATTTGTGGGATGTTGTTGAAGAAGGATTTAAAATTCCAATAAAAATTGTTGATGGTGTTAGAGTTGTAAAGCCTAAAGA
AGAATGGTCTATAATTGAAAAGAAAGCATGTTCTTTAAATGTTAAAGCTATTAATTGTTTGTTTTGTGCTTTGAATGAAATTGAGTATAATAGGGTGTTGATTTGTAAAA
CCGCTAAGGATATATGGGATAAATTAGAAATTACTCATGAAGGAACTGGTCAAGTAAAAGAAACAAAGATTGACATGTTAGTTCATCAATATGAAATGTTTAAAATCGAT
GAAAATGAAGCTATTTCCGATATGTTTATTAGATTTACTAACATTGTCAATGCTTTAGAAGGACTTGGAAAAGAATATTCAAATCTTGAGAAGGTAAAGAAACTCTTATG
GTCCTTGCCTAAACAATGGGAGCCTAAAATCATTGTCATTCAAGAGGTAAAGGATCTCAAGACTCTCTCCATGGACGAACTCATTGGTTTGTTGATGATGCATGAGATAA
AGATCAAGAAAAACATGGAAGATGAGAAGAAAAAGAAAGAGAAGAGCATAGCATTAAAGGCCATCACCTTGGAAGTTGACTCCGAAGGTGAGAATGCTCTTGATGAAGAT
GATGTGACCTATCTCTCACGTAAGTATAAAAATTTCATCAAGAGAAAGAAACAATTCAAGAAGAATTTCTCCAACCAAAAAGAGTCAAAAAGTGAAACGAGCAAAAAGGA
TGAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATTAGAATCGATTGTCCTCTTCTTAAATCATTCAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATG
ATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACAAGGAGGATGAACAAGATGATGAGGTAACTCTTGATCCC
CTTTCTTATGATGAGTTGTTTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAGCTTGGTTCTAAATATGTTATGCTTAAAAAGAAATACAAGGTCTTAACTAGTGA
AAATAAGTCTTTACTTGATGATATTGCTTGCTTAAAGAATAATGAGCATGATGTTGTAAATATCTCTTGTGATAAGCATGTTCTTGATTGTGACGAGAAAAATGCATTAC
TTGATAAAATTAGATTTCTTGAGCATGATGGTTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTGGAACTTGATAAGGCTAAAGAT
TCTATTAAAAAATTAACAATAGGTGCTCAAAGGTTGGACAAGATTATTGAAGTAGGTAAACCTTATGGTGATAAAAGAGGTTTAGGCTATATTGATGAATGGTCTACTCT
CTCAAGTTCTAAAACTATCTTTGTTAAAGCATCTCCTAATATGTCTAAGCTTGTTGCTCCTAAAGTTTCGCAAGTTCTACCACTAGTGAGGAGTTTGGGAATTGAAGACC
ACCTCAAGGAAAGCAAGAAACCAGAGGAGGTCTTGACTGACAAAGATGGAGAAACTATTCCAAATCCTCAATACAACATTTGGCTCAACAATGATGGCCTTCTAACCTCT
TGGCTTCTAGGCATCATAACTGAGGATGTGTTGGCCACAATTGAAGGAACAGATTCAGCATACCAAATGTGGAAATCTCTAGAAGAACAACTCCTCACCGTTACAAAGGA
AAACGAAATTCACCTCAATGAAGCCATCCTGAGTCTAAAAAAGGGAAGTCTCTCTTTAGATGAGTATTTGAAGAAAACTAAATCGTTTTGTGATCAACTTGCAGCTATGA
AGAAACCAGTGGATGATCTCACAAAGGTGTTTCATGTTGCTAGAGGACTAGGAGCTAAATACCATGGATTTGAAACAGTCATGTTGTCCAAAGCACCATATCCAACCTAT
AATGAGTTCATCCTAGCTCTTAAAGCACATGAGGTAATGATAAATGCTAATAACGGCGAGGAAAAGATGTCACAACTTGACCACAATCAAGCATTTTATGCCCAAAAAGG
AAAAGGCAGAGGCAGAGGAAGAAATTTCTCTTCTAGAGGAAGAGGTTTCTCTCAAGGCAGACAAATCGCTCCTGAAAATGCAGGTTACACAAGTCCTCATAAAACGAGCT
CTTTCCAAGGGAGACACAAAGACTCAGGAGCAGACCCTCTCTTGTATGCTGATTCAGGTGCTACCTCACATATCTTGAATGATCCAAGTTTTGAAGAATGGGAGAACACA
TCAGAAGAATCAAACAAGGCAAAATCACAACTAACAACTCTTGGAGAGCCAATCAAAACCTCACACTGTTGTGACATAGAGAATAACAGTAGTTCACAAGAAGGGTCAAT
TGAGGAAGAAGATCATTTTCAAGAAAATAACAAGCCGCAGGAGATTATAATTGAGGAAGAAAATATTTCACCTACTACCAACACAGGAGATGATACTCAAGGAATAGAAG
AGTCTACCTCAAACAACAGAACACCTATCTTCAACAAACACAATGAGCATTCTCATCTTCTTAACCTTCCTAACTCTAATATATCAAATTCTAACAATCTTCCAGATATT
AGCAAACAGTTATATGTGGATCTTAAGTTACATTCTACCGGTGAACAAAATTCTGCAGCAAAGTTGCGCACGTTTTTGCTTTTTTCATGGAGACTGGGTCGGTATTCAGG
AATTCCTGCCAAGATGACGGATCCTGCCTGGCACGCTTTCGGGAAGAACATAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATTTTTCTTGCTTTCTATTGACTATGATTTGTGGGATGTTGTTGAAGAAGGATTTAAAATTCCAATAAAAATTGTTGATGGTGTTAGAGTTGTAAAGCCTAAAGA
AGAATGGTCTATAATTGAAAAGAAAGCATGTTCTTTAAATGTTAAAGCTATTAATTGTTTGTTTTGTGCTTTGAATGAAATTGAGTATAATAGGGTGTTGATTTGTAAAA
CCGCTAAGGATATATGGGATAAATTAGAAATTACTCATGAAGGAACTGGTCAAGTAAAAGAAACAAAGATTGACATGTTAGTTCATCAATATGAAATGTTTAAAATCGAT
GAAAATGAAGCTATTTCCGATATGTTTATTAGATTTACTAACATTGTCAATGCTTTAGAAGGACTTGGAAAAGAATATTCAAATCTTGAGAAGGTAAAGAAACTCTTATG
GTCCTTGCCTAAACAATGGGAGCCTAAAATCATTGTCATTCAAGAGGTAAAGGATCTCAAGACTCTCTCCATGGACGAACTCATTGGTTTGTTGATGATGCATGAGATAA
AGATCAAGAAAAACATGGAAGATGAGAAGAAAAAGAAAGAGAAGAGCATAGCATTAAAGGCCATCACCTTGGAAGTTGACTCCGAAGGTGAGAATGCTCTTGATGAAGAT
GATGTGACCTATCTCTCACGTAAGTATAAAAATTTCATCAAGAGAAAGAAACAATTCAAGAAGAATTTCTCCAACCAAAAAGAGTCAAAAAGTGAAACGAGCAAAAAGGA
TGAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATTAGAATCGATTGTCCTCTTCTTAAATCATTCAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATG
ATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACAAGGAGGATGAACAAGATGATGAGGTAACTCTTGATCCC
CTTTCTTATGATGAGTTGTTTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAGCTTGGTTCTAAATATGTTATGCTTAAAAAGAAATACAAGGTCTTAACTAGTGA
AAATAAGTCTTTACTTGATGATATTGCTTGCTTAAAGAATAATGAGCATGATGTTGTAAATATCTCTTGTGATAAGCATGTTCTTGATTGTGACGAGAAAAATGCATTAC
TTGATAAAATTAGATTTCTTGAGCATGATGGTTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTGGAACTTGATAAGGCTAAAGAT
TCTATTAAAAAATTAACAATAGGTGCTCAAAGGTTGGACAAGATTATTGAAGTAGGTAAACCTTATGGTGATAAAAGAGGTTTAGGCTATATTGATGAATGGTCTACTCT
CTCAAGTTCTAAAACTATCTTTGTTAAAGCATCTCCTAATATGTCTAAGCTTGTTGCTCCTAAAGTTTCGCAAGTTCTACCACTAGTGAGGAGTTTGGGAATTGAAGACC
ACCTCAAGGAAAGCAAGAAACCAGAGGAGGTCTTGACTGACAAAGATGGAGAAACTATTCCAAATCCTCAATACAACATTTGGCTCAACAATGATGGCCTTCTAACCTCT
TGGCTTCTAGGCATCATAACTGAGGATGTGTTGGCCACAATTGAAGGAACAGATTCAGCATACCAAATGTGGAAATCTCTAGAAGAACAACTCCTCACCGTTACAAAGGA
AAACGAAATTCACCTCAATGAAGCCATCCTGAGTCTAAAAAAGGGAAGTCTCTCTTTAGATGAGTATTTGAAGAAAACTAAATCGTTTTGTGATCAACTTGCAGCTATGA
AGAAACCAGTGGATGATCTCACAAAGGTGTTTCATGTTGCTAGAGGACTAGGAGCTAAATACCATGGATTTGAAACAGTCATGTTGTCCAAAGCACCATATCCAACCTAT
AATGAGTTCATCCTAGCTCTTAAAGCACATGAGGTAATGATAAATGCTAATAACGGCGAGGAAAAGATGTCACAACTTGACCACAATCAAGCATTTTATGCCCAAAAAGG
AAAAGGCAGAGGCAGAGGAAGAAATTTCTCTTCTAGAGGAAGAGGTTTCTCTCAAGGCAGACAAATCGCTCCTGAAAATGCAGGTTACACAAGTCCTCATAAAACGAGCT
CTTTCCAAGGGAGACACAAAGACTCAGGAGCAGACCCTCTCTTGTATGCTGATTCAGGTGCTACCTCACATATCTTGAATGATCCAAGTTTTGAAGAATGGGAGAACACA
TCAGAAGAATCAAACAAGGCAAAATCACAACTAACAACTCTTGGAGAGCCAATCAAAACCTCACACTGTTGTGACATAGAGAATAACAGTAGTTCACAAGAAGGGTCAAT
TGAGGAAGAAGATCATTTTCAAGAAAATAACAAGCCGCAGGAGATTATAATTGAGGAAGAAAATATTTCACCTACTACCAACACAGGAGATGATACTCAAGGAATAGAAG
AGTCTACCTCAAACAACAGAACACCTATCTTCAACAAACACAATGAGCATTCTCATCTTCTTAACCTTCCTAACTCTAATATATCAAATTCTAACAATCTTCCAGATATT
AGCAAACAGTTATATGTGGATCTTAAGTTACATTCTACCGGTGAACAAAATTCTGCAGCAAAGTTGCGCACGTTTTTGCTTTTTTCATGGAGACTGGGTCGGTATTCAGG
AATTCCTGCCAAGATGACGGATCCTGCCTGGCACGCTTTCGGGAAGAACATAGAATAG
Protein sequenceShow/hide protein sequence
MKFFLLSIDYDLWDVVEEGFKIPIKIVDGVRVVKPKEEWSIIEKKACSLNVKAINCLFCALNEIEYNRVLICKTAKDIWDKLEITHEGTGQVKETKIDMLVHQYEMFKID
ENEAISDMFIRFTNIVNALEGLGKEYSNLEKVKKLLWSLPKQWEPKIIVIQEVKDLKTLSMDELIGLLMMHEIKIKKNMEDEKKKKEKSIALKAITLEVDSEGENALDED
DVTYLSRKYKNFIKRKKQFKKNFSNQKESKSETSKKDEVICYECKKPGHIRIDCPLLKSFKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVTLDP
LSYDELFEAFENMQNDLEKLGSKYVMLKKKYKVLTSENKSLLDDIACLKNNEHDVVNISCDKHVLDCDEKNALLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKD
SIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDEWSTLSSSKTIFVKASPNMSKLVAPKVSQVLPLVRSLGIEDHLKESKKPEEVLTDKDGETIPNPQYNIWLNNDGLLTS
WLLGIITEDVLATIEGTDSAYQMWKSLEEQLLTVTKENEIHLNEAILSLKKGSLSLDEYLKKTKSFCDQLAAMKKPVDDLTKVFHVARGLGAKYHGFETVMLSKAPYPTY
NEFILALKAHEVMINANNGEEKMSQLDHNQAFYAQKGKGRGRGRNFSSRGRGFSQGRQIAPENAGYTSPHKTSSFQGRHKDSGADPLLYADSGATSHILNDPSFEEWENT
SEESNKAKSQLTTLGEPIKTSHCCDIENNSSSQEGSIEEEDHFQENNKPQEIIIEEENISPTTNTGDDTQGIEESTSNNRTPIFNKHNEHSHLLNLPNSNISNSNNLPDI
SKQLYVDLKLHSTGEQNSAAKLRTFLLFSWRLGRYSGIPAKMTDPAWHAFGKNIE