; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBED-type domain-containing protein
Genome locationchr8:19765266..19775151
RNA-Seq ExpressionMoc08g27330
SyntenyMoc08g27330
Gene Ontology termsNA
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU60096.1 hypothetical protein MA16_Dca020494 [Dendrobium catenatum]4.2e-8144.47Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K
        VKDA  +F+LLD +V+EIGE+L+VQ+VTDNAS+YK+A  KLMEKRKHL+WTPCAAHC++L+LEKLG+LPQHKNAL KAK                    K
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK
        + ++RPA TRFATAYLTL+S+  ++QPL+AMFTS QW    WAKK EGKE+++I+LN                                         AK
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK

Query:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL
        E IA NLGG E S++EIWNIID++WE QLHRH HAA Y+LNP +QY +N ST+PEIKL LY CMD +I D  ER  ADLQ+  FR +EGFFG QQA   +
Subjt:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL

Query:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDE
         K                                                                   RL+D++LK K L + ED L+ DD+ SDDE
Subjt:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDE

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]4.3e-11072.04Show/hide
Query:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------
        A+KDANM+FQLLDDVVDEIGENL+VQVVTDNASNYKSA KKLMEKRKHL+WTPCAAHCL+LMLEKLGELPQHKNALIKAK                    
Subjt:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------

Query:  KRGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NS
        KR LVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQ QDSVWAKKPEGKEVKRIIL                                         +S
Subjt:  KRGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NS

Query:  AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANS
        AKEEIAKN GGEEASYKEIWNIIDEKWEFQLHRH HAA YFLNPHFQYD+NFSTHPEIKL LYTC D +IVDE ERVKADLQ DSFRRREGFFGFQQA +
Subjt:  AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANS

Query:  ILQK
          +K
Subjt:  ILQK

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]6.8e-0796.97Show/hide
Query:  MDRFMVIDVGDVDADGGNKVQNQVTPTNAKEAR
        MDRFMV DVGDVDADGGNKVQNQVTPTNAKEAR
Subjt:  MDRFMVIDVGDVDADGGNKVQNQVTPTNAKEAR

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]2.2e-8255.48Show/hide
Query:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK-----RGLVRPATTRFATA
        A+KDAN++F+LLD+VV+E+GE+++VQVVTDNASNYK A KKLMEKR  L WTPCA+HC++LMLEK+  LPQHKNAL+KAKK     R L+RPA TRFATA
Subjt:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK-----RGLVRPATTRFATA

Query:  YLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRII----------------------------------------LNSAKEEIAKNLGGEEASY
        YLTLQSI   +QPL++MF S +W  S +++K EGK VK+II                                        ++ AKEEIAKNLG E+ SY
Subjt:  YLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRII----------------------------------------LNSAKEEIAKNLGGEEASY

Query:  KEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSILQKAISR
        K++WNIIDEKWEFQ+HRH HAAAYFLNPHFQYDD FS+H E+K  LY C++ +I +E++R KADLQ+D FR+R+G F    A S  +K   R
Subjt:  KEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSILQKAISR

XP_031094640.1 uncharacterized protein LOC115999046 isoform X1 [Ipomoea triloba]4.9e-8243.96Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------
        VKDA +LF+LLD+VV+E+GE L+VQV+TDN SNY++A   LMEKRKHL+WTPCAAHCL+LMLEK+GELPQHKNALIKAKK                    
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NSA
        R L+RPA TRFATAYLTLQSI Q +Q L+AM TS++W  S +A K +GKEV++IIL                                         + A
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NSA

Query:  KEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSI
        KE+IAKNLGGEE  YKEIW IIDEKW FQ+HRH HAAAY+LNP   Y  +FSTHPEIKL L+ C+D +I +  +  KADLQ  +F  REGFFG  QA + 
Subjt:  KEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSI

Query:  LQKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEW
        + K                                                                   +LKD+ LKLK+L+  ED L++D+L SDDE 
Subjt:  LQKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEW

Query:  IV-ENTRDSEFGVDAFIEHDDDPNINIFKLGEGSSTQQT
        +V EN  D+           DD N+++F+ GE S   QT
Subjt:  IV-ENTRDSEFGVDAFIEHDDDPNINIFKLGEGSSTQQT

XP_031097011.1 uncharacterized protein LOC116001266 [Ipomoea triloba]1.9e-8154.79Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------
        VKDA +LF+LLD+VV+E+GE L+VQV+TDNASNY++A   LMEKRKHL+WTPCAAHCL+LMLEK+GELPQHKNALIKAKK                    
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NSA
        R L+RPA TRFATAYLTLQSI Q +QPL+AMFTS++W +S +A K +GKEV++IIL                                         + A
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NSA

Query:  KEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSI
        KE+IAKNLGGEE  YKEIW IID+KW+FQ+HRH HAAAY+LNP   Y  + STHPEIKL L+ C+D +I ++ +  KADLQ  +F  REGFFG  QA S 
Subjt:  KEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSI

Query:  LQK
        L K
Subjt:  LQK

TrEMBL top hitse value%identityAlignment
A0A2I0WUQ6 DUF659 domain-containing protein5.0e-8053.74Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K
        VKDA  +F+LLD +V+EIGE+L+VQ+VTDNAS+YK+A  KLMEKRKHL+WTPCAAHC++L+LEKLG+LPQHKNAL KAK                    K
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK
        + ++RPA TRFATAYLTL+S+  ++QPL+AMFTS QW    WAKK EGKE+++I+LN                                         AK
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK

Query:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQ
        E IA NLGG E S++EIWNIID++WE QLHRH HAA Y+LNP +QY +N ST+PEIKL LY CMD +I D  ER  ADLQ+  FR +EGFFG Q
Subjt:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQ

A0A2I0X4E3 Uncharacterized protein3.8e-8041.07Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K
        VK+A  +F LLD V++EIGE L+VQVVTDNAS YK+A + LMEKR HL+WTPCAAHC++L+LE LG+LPQHK+AL++AK                    K
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK
        + ++RPATTRFAT+YLTLQS+ + +QPL+AMFTS +W +S W KK EGKE+++IILN                                         AK
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK

Query:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL
        E IA NLGGEEASY+EIW+IID +WE QLHRH HAAAY+LNP FQY +  S++PE+KL LY CM+ +I ++A+R  ADLQ+  FR +EGFFG   A + +
Subjt:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL

Query:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEWI
         K                                                                   RL+D+ L+ K LK+ ED LV DDL SD+EW 
Subjt:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEWI

Query:  VENTRDSEFGVDAFIEHDDDPNINIFKLGE----GSSTQQTLRSGASA
        +++  D     D  +E   D N+++ + GE    G+ST  T  +  S+
Subjt:  VENTRDSEFGVDAFIEHDDDPNINIFKLGE----GSSTQQTLRSGASA

A0A2I0XBK8 Uncharacterized protein5.0e-8040.85Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K
        VK+A  +F LLD V++EIGE L+VQVVTDNAS YK+A + LMEKR HL+WTPCAAHC++L+LE LG+LPQHK+AL++AK                    K
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK
        + ++RPATTRFAT+YLTLQS+ + +QPL+AMFTS +W +S W KK EGKE+++IILN                                         AK
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK

Query:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL
        E IA NLGGEEASY+EIW+I+D +WE QLHRH HAAAY+LNP FQY +  S++PE+KL LY CM+ +I ++A+R  ADLQ+  FR +EGFFG   A + +
Subjt:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL

Query:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEWI
         K                                                                   RL+D+ L+ K LK+ ED LV DDL SD+EW 
Subjt:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDEWI

Query:  VENTRDSEFGVDAFIEHDDDPNINIFKLGE----GSSTQQTLRSGASA
        +++  D     D  +E   D N+++ + GE    G+ST  T  +  S+
Subjt:  VENTRDSEFGVDAFIEHDDDPNINIFKLGE----GSSTQQTLRSGASA

A0A6J1E3R9 uncharacterized protein LOC1110258022.1e-11072.04Show/hide
Query:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------
        A+KDANM+FQLLDDVVDEIGENL+VQVVTDNASNYKSA KKLMEKRKHL+WTPCAAHCL+LMLEKLGELPQHKNALIKAK                    
Subjt:  AVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------

Query:  KRGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NS
        KR LVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQ QDSVWAKKPEGKEVKRIIL                                         +S
Subjt:  KRGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIIL-----------------------------------------NS

Query:  AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANS
        AKEEIAKN GGEEASYKEIWNIIDEKWEFQLHRH HAA YFLNPHFQYD+NFSTHPEIKL LYTC D +IVDE ERVKADLQ DSFRRREGFFGFQQA +
Subjt:  AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANS

Query:  ILQK
          +K
Subjt:  ILQK

A0A6J1E3R9 uncharacterized protein LOC1110258023.3e-0796.97Show/hide
Query:  MDRFMVIDVGDVDADGGNKVQNQVTPTNAKEAR
        MDRFMV DVGDVDADGGNKVQNQVTPTNAKEAR
Subjt:  MDRFMVIDVGDVDADGGNKVQNQVTPTNAKEAR

A0A6J1E3R9 uncharacterized protein LOC1110258022.0e-8144.47Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K
        VKDA  +F+LLD +V+EIGE+L+VQ+VTDNAS+YK+A  KLMEKRKHL+WTPCAAHC++L+LEKLG+LPQHKNAL KAK                    K
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAK--------------------K

Query:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK
        + ++RPA TRFATAYLTL+S+  ++QPL+AMFTS QW    WAKK EGKE+++I+LN                                         AK
Subjt:  RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILN----------------------------------------SAK

Query:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL
        E IA NLGG E S++EIWNIID++WE QLHRH HAA Y+LNP +QY +N ST+PEIKL LY CMD +I D  ER  ADLQ+  FR +EGFFG QQA   +
Subjt:  EEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSIL

Query:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDE
         K                                                                   RL+D++LK K L + ED L+ DD+ SDDE
Subjt:  QKAI----------------------------------------------------------------SRLKDKHLKLKALKEGEDLLVLDDLVSDDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17450.1 hAT dimerisation domain-containing protein3.0e-2929.39Show/hide
Query:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------
        V+DA  LF+ LD +VD+IGE  +VQV+T N + ++SA K L EKRK+L+WTPCA HC  L+LE   +L      L KA++                    
Subjt:  VKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHKNALIKAKK--------------------

Query:  --RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQW-QDSVWAKKPEGKEVKRIILNS---------------------------------------
            L+RPA  R A+ + TLQS+   K  L+ +F S  W      AK  EG+EV++++L++                                       
Subjt:  --RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQW-QDSVWAKKPEGKEVKRIILNS---------------------------------------

Query:  --AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG
          AK  I      +   Y   W +I+ +W    H   + AAYF NP ++Y  +F    E+   +  C+  +  D   R+ A +QI  +   +  FG
Subjt:  --AKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG

AT3G22220.1 hAT transposon superfamily8.6e-2429.62Show/hide
Query:  LFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELP-----------------QHKNALIKAKK----RGLVR
        L++LL +VV+EIG+  +VQV+T    +Y +A KKLM+    L+W PCAAHC++ MLE+ G++                   H   L   +K      +V+
Subjt:  LFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELP-----------------QHKNALIKAKK----RGLVR

Query:  PATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEG--------------------------KEVKRIILNS--------------AKEEIAK
        P  T  AT + T+  I   K  LQAM TS +W D  ++K+  G                            V RI+ +               AKE I  
Subjt:  PATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEG--------------------------KEVKRIILNS--------------AKEEIAK

Query:  NLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG
        NL   E  Y   W IID  W   L +  +AA ++LNP F Y  +     EI L +  C++ ++ D   +      I+S++   G FG
Subjt:  NLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG

AT3G22220.2 hAT transposon superfamily8.6e-2429.62Show/hide
Query:  LFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELP-----------------QHKNALIKAKK----RGLVR
        L++LL +VV+EIG+  +VQV+T    +Y +A KKLM+    L+W PCAAHC++ MLE+ G++                   H   L   +K      +V+
Subjt:  LFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELP-----------------QHKNALIKAKK----RGLVR

Query:  PATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEG--------------------------KEVKRIILNS--------------AKEEIAK
        P  T  AT + T+  I   K  LQAM TS +W D  ++K+  G                            V RI+ +               AKE I  
Subjt:  PATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEG--------------------------KEVKRIILNS--------------AKEEIAK

Query:  NLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG
        NL   E  Y   W IID  W   L +  +AA ++LNP F Y  +     EI L +  C++ ++ D   +      I+S++   G FG
Subjt:  NLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFG

AT4G15020.1 hAT transposon superfamily3.8e-2427.3Show/hide
Query:  VGDVDADGGNKVQNQVTPTNAK----EARNSMYAVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLG
        V ++++D G KV N +     K    ++ ++   +  A+ LF+LL ++V+E+G   +VQV+T     Y  A K+LM     L+W PCAAHC++ MLE+ G
Subjt:  VGDVDADGGNKVQNQVTPTNAK----EARNSMYAVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLG

Query:  ELPQHKNALIKAKK---------------------RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVK-------------
        +L      + +A+                        ++ PA +  AT + TL  I + K  LQAM TS +W +  ++++P G  +              
Subjt:  ELPQHKNALIKAKK---------------------RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVK-------------

Query:  ------------RII--------------LNSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDN
                    RI+              L  AK+ I  +L   E  Y   W IID  WE Q H    AA +FLNP   Y+ N     E+ L++  C++ 
Subjt:  ------------RII--------------LNSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDN

Query:  IIVDEAERVKADLQIDSFRRREGFFG
        ++ D+  + K   ++ S++   G FG
Subjt:  IIVDEAERVKADLQIDSFRRREGFFG

AT4G15020.2 hAT transposon superfamily3.8e-2427.3Show/hide
Query:  VGDVDADGGNKVQNQVTPTNAK----EARNSMYAVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLG
        V ++++D G KV N +     K    ++ ++   +  A+ LF+LL ++V+E+G   +VQV+T     Y  A K+LM     L+W PCAAHC++ MLE+ G
Subjt:  VGDVDADGGNKVQNQVTPTNAK----EARNSMYAVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLG

Query:  ELPQHKNALIKAKK---------------------RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVK-------------
        +L      + +A+                        ++ PA +  AT + TL  I + K  LQAM TS +W +  ++++P G  +              
Subjt:  ELPQHKNALIKAKK---------------------RGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVK-------------

Query:  ------------RII--------------LNSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDN
                    RI+              L  AK+ I  +L   E  Y   W IID  WE Q H    AA +FLNP   Y+ N     E+ L++  C++ 
Subjt:  ------------RII--------------LNSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNPHFQYDDNFSTHPEIKLNLYTCMDN

Query:  IIVDEAERVKADLQIDSFRRREGFFG
        ++ D+  + K   ++ S++   G FG
Subjt:  IIVDEAERVKADLQIDSFRRREGFFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGATTCATGGTGATTGATGTTGGTGACGTTGATGCCGATGGTGGAAATAAGGTTCAAAATCAAGTCACTCCTACAAATGCGAAGGAGGCTCGTAATTCTATGTA
TGCTGTAAAGGATGCAAATATGTTATTTCAGCTTCTGGATGACGTAGTAGATGAAATTGGAGAGAATCTTATAGTTCAAGTTGTGACCGACAATGCATCAAACTACAAGT
CGGCAAAAAAAAAGTTGATGGAAAAACGAAAGCATTTGCATTGGACTCCATGTGCTGCCCATTGTCTCAATTTGATGCTTGAGAAGCTTGGTGAACTGCCTCAACATAAG
AATGCCTTGATAAAAGCAAAAAAAAGAGGGCTTGTTCGTCCCGCCACCACCAGATTTGCTACTGCTTATTTGACATTGCAAAGCATTTGCCAATCTAAACAACCATTACA
AGCAATGTTTACCTCTAAGCAATGGCAGGATAGTGTGTGGGCAAAAAAGCCAGAAGGGAAGGAAGTTAAGAGAATAATTTTAAATTCAGCAAAGGAGGAGATTGCCAAAA
ATCTTGGAGGGGAGGAAGCAAGCTATAAAGAGATATGGAACATTATTGATGAAAAGTGGGAATTTCAACTTCACCGACATTTTCATGCCGCAGCATATTTCTTGAATCCA
CATTTCCAATATGATGATAATTTTTCCACTCATCCGGAGATCAAGTTGAATTTATATACATGTATGGACAACATAATTGTCGATGAAGCTGAAAGAGTAAAAGCTGATCT
TCAGATTGATTCATTTCGAAGGAGGGAAGGATTTTTTGGCTTCCAACAGGCAAATAGCATCTTGCAAAAAGCGATCTCCAGATTAAAGGATAAGCACTTGAAGCTAAAGG
CTCTCAAAGAGGGAGAAGATCTATTGGTACTAGATGATTTGGTGTCTGATGATGAGTGGATTGTTGAGAACACTAGGGATAGTGAATTTGGAGTTGATGCCTTTATTGAG
CATGATGATGATCCTAACATCAATATTTTTAAGCTTGGAGAAGGAAGTAGTACTCAACAGACACTCAGGAGTGGCGCCTCAGCTTGGTGGGACCAACTGCAAACAAATAG
GTGGCGTTATGAAAAGCCCCCCCGTTCGATCTTGGCCAAAGATGCAAAGGCTAATGAGGAAACGGTTCTTGCCAATGAACTATCAATACTTCTTTACCATAAATATCAAC
ATTGTATGCAAGAGGGAAGAACTATAGCCGACTACATGGAAGAGTTTCTTAGACTAGATTTGGCTCTGTTCCTAGCCATTGTTGGGGTTTTTCAACGGGTTGTTGCTACG
CAGAGGTCTCATCGCACGCTGGAGGAAGTTGTTGTCGTTAGGTCATGTTCCACCGCATTGGTCTGTCACTGTCTCAAGAGGAATCGTGCAAGGGATCAGATGTGGAAAGA
CCTTCATGCCACCATAGGTTCGTTGCCGAAGAAAAGGGTGGCCATGGGGTATGGGTCATCGCTGAACATCGTCGTAGTGGAGAGATCGTCGCTTGAATCTGAATGTCGCA
GGTCTCGATGTCGCAAATCTATTGGTACCGCTAGTCACCGTCGTACAGGGGAGCCGCTGGTCATGGGCCACTCGCTGCCTTCAAGTCATGGGTCTGCTACCACTGCAAGG
AGTCATAGGGTCGAGGGGAAGAGTGGGTCAACCTCGAATGTTGCCGGAACAAAAAATAACACCACGAAACTTGCTGGGCTTCACAAATCTGGGCTGTTCAAGGTCGTCGG
AGGGATCACGTCGCCGGTCACAGCAGAACAGCGCCGGAGAGAGCCGCAGATATGGTTGTTCGGCCGCTACAGGTGTCGGATCTGCTCGAATCTCACCGCCGCAGGCCTAC
TGAAACGTCGAGGTCCGAAGCTGCTGCCGTGGGACGCCGGATCTAAAAGTGAATCGGGCTGTTCGGCGAAGACGCGCCACCGCGAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGATTCATGGTGATTGATGTTGGTGACGTTGATGCCGATGGTGGAAATAAGGTTCAAAATCAAGTCACTCCTACAAATGCGAAGGAGGCTCGTAATTCTATGTA
TGCTGTAAAGGATGCAAATATGTTATTTCAGCTTCTGGATGACGTAGTAGATGAAATTGGAGAGAATCTTATAGTTCAAGTTGTGACCGACAATGCATCAAACTACAAGT
CGGCAAAAAAAAAGTTGATGGAAAAACGAAAGCATTTGCATTGGACTCCATGTGCTGCCCATTGTCTCAATTTGATGCTTGAGAAGCTTGGTGAACTGCCTCAACATAAG
AATGCCTTGATAAAAGCAAAAAAAAGAGGGCTTGTTCGTCCCGCCACCACCAGATTTGCTACTGCTTATTTGACATTGCAAAGCATTTGCCAATCTAAACAACCATTACA
AGCAATGTTTACCTCTAAGCAATGGCAGGATAGTGTGTGGGCAAAAAAGCCAGAAGGGAAGGAAGTTAAGAGAATAATTTTAAATTCAGCAAAGGAGGAGATTGCCAAAA
ATCTTGGAGGGGAGGAAGCAAGCTATAAAGAGATATGGAACATTATTGATGAAAAGTGGGAATTTCAACTTCACCGACATTTTCATGCCGCAGCATATTTCTTGAATCCA
CATTTCCAATATGATGATAATTTTTCCACTCATCCGGAGATCAAGTTGAATTTATATACATGTATGGACAACATAATTGTCGATGAAGCTGAAAGAGTAAAAGCTGATCT
TCAGATTGATTCATTTCGAAGGAGGGAAGGATTTTTTGGCTTCCAACAGGCAAATAGCATCTTGCAAAAAGCGATCTCCAGATTAAAGGATAAGCACTTGAAGCTAAAGG
CTCTCAAAGAGGGAGAAGATCTATTGGTACTAGATGATTTGGTGTCTGATGATGAGTGGATTGTTGAGAACACTAGGGATAGTGAATTTGGAGTTGATGCCTTTATTGAG
CATGATGATGATCCTAACATCAATATTTTTAAGCTTGGAGAAGGAAGTAGTACTCAACAGACACTCAGGAGTGGCGCCTCAGCTTGGTGGGACCAACTGCAAACAAATAG
GTGGCGTTATGAAAAGCCCCCCCGTTCGATCTTGGCCAAAGATGCAAAGGCTAATGAGGAAACGGTTCTTGCCAATGAACTATCAATACTTCTTTACCATAAATATCAAC
ATTGTATGCAAGAGGGAAGAACTATAGCCGACTACATGGAAGAGTTTCTTAGACTAGATTTGGCTCTGTTCCTAGCCATTGTTGGGGTTTTTCAACGGGTTGTTGCTACG
CAGAGGTCTCATCGCACGCTGGAGGAAGTTGTTGTCGTTAGGTCATGTTCCACCGCATTGGTCTGTCACTGTCTCAAGAGGAATCGTGCAAGGGATCAGATGTGGAAAGA
CCTTCATGCCACCATAGGTTCGTTGCCGAAGAAAAGGGTGGCCATGGGGTATGGGTCATCGCTGAACATCGTCGTAGTGGAGAGATCGTCGCTTGAATCTGAATGTCGCA
GGTCTCGATGTCGCAAATCTATTGGTACCGCTAGTCACCGTCGTACAGGGGAGCCGCTGGTCATGGGCCACTCGCTGCCTTCAAGTCATGGGTCTGCTACCACTGCAAGG
AGTCATAGGGTCGAGGGGAAGAGTGGGTCAACCTCGAATGTTGCCGGAACAAAAAATAACACCACGAAACTTGCTGGGCTTCACAAATCTGGGCTGTTCAAGGTCGTCGG
AGGGATCACGTCGCCGGTCACAGCAGAACAGCGCCGGAGAGAGCCGCAGATATGGTTGTTCGGCCGCTACAGGTGTCGGATCTGCTCGAATCTCACCGCCGCAGGCCTAC
TGAAACGTCGAGGTCCGAAGCTGCTGCCGTGGGACGCCGGATCTAAAAGTGAATCGGGCTGTTCGGCGAAGACGCGCCACCGCGAAGGATAG
Protein sequenceShow/hide protein sequence
MDRFMVIDVGDVDADGGNKVQNQVTPTNAKEARNSMYAVKDANMLFQLLDDVVDEIGENLIVQVVTDNASNYKSAKKKLMEKRKHLHWTPCAAHCLNLMLEKLGELPQHK
NALIKAKKRGLVRPATTRFATAYLTLQSICQSKQPLQAMFTSKQWQDSVWAKKPEGKEVKRIILNSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHFHAAAYFLNP
HFQYDDNFSTHPEIKLNLYTCMDNIIVDEAERVKADLQIDSFRRREGFFGFQQANSILQKAISRLKDKHLKLKALKEGEDLLVLDDLVSDDEWIVENTRDSEFGVDAFIE
HDDDPNINIFKLGEGSSTQQTLRSGASAWWDQLQTNRWRYEKPPRSILAKDAKANEETVLANELSILLYHKYQHCMQEGRTIADYMEEFLRLDLALFLAIVGVFQRVVAT
QRSHRTLEEVVVVRSCSTALVCHCLKRNRARDQMWKDLHATIGSLPKKRVAMGYGSSLNIVVVERSSLESECRRSRCRKSIGTASHRRTGEPLVMGHSLPSSHGSATTAR
SHRVEGKSGSTSNVAGTKNNTTKLAGLHKSGLFKVVGGITSPVTAEQRRREPQIWLFGRYRCRICSNLTAAGLLKRRGPKLLPWDAGSKSESGCSAKTRHREG