; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029722 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029722
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:41503814..41510813
RNA-Seq ExpressionLag0029722
SyntenyLag0029722
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81355.1 hypothetical protein VITISV_039158 [Vitis vinifera]6.4e-6344.59Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT   P  I Q I +L+++FAL+D+G LSY+LG+EV     ++ L Q K                           S F+G LM D+ +YRSV+GALQY 
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        T TRPDI++ +NK  QF+ +PT  HW  VKRILRYL GTI + L L  S+ F+I AYTD D                            KQ VV+RS+AE
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR LA  TAEI W QA LRE+ +  +S+P ++ DN SA  +A+NPVFH+R+KH+E+D+HF+RD+VL   ++  Y+PS+DQ AD LTK ++ S F+SLR
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
        S L +   PF L G
Subjt:  SKLSVHLAPFRLWG

RVW18104.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.8e-6344.62Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT   P  I Q I +L+++FAL+D+G LSY+LG+EV     +M L Q K                           S F+G LM D+ +YRSV+GALQY 
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        T TRPDI++ VNK  QF+ +PT  HW  VKRILRYL GT  + LFL  S+ F+I AYTD D                            KQ VV+RS+AE
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR LA  TAEI W QA L E+ +  +S+P ++ DN SA  +A+NPVFH+R+KH+E+D+HF+RD+VL   ++  Y+PS+DQ AD LTK ++ S F+SLR
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWGIL
        S L +   PF L G++
Subjt:  SKLSVHLAPFRLWGIL

RVW22017.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.8e-6344.9Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT   P  I Q I +L+++FAL+D+G LSY+LG+EV     ++ L Q K                           S F+G LM D+ +YRSV+GALQY 
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        T TRPDI++ VNK  QF+ +PT  HW  VKRILRYL GTI + L L  S+ F+I AYTD D                            KQ VV+RS+AE
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR LA  TAEI W QA LRE+ +  +S+P ++ DN SA  +A+NPVFH+R+KH+E+D+HF+RD+VL   ++  Y+PS+DQ AD LTK ++ S F+SLR
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
        S L +   PF L G
Subjt:  SKLSVHLAPFRLWG

RVW52450.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]6.4e-6344.9Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT   P  I Q I +L+++FAL+D+G LSY+LG+EV     +M L Q K                           S F+G LM D+ +YRSV+GALQY 
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        T TRPDI++ VNK  QF+ +PT  HW  VKRILRYL GT  + LFL  S+ F+I AYTD D                            KQ VV+RS+AE
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR LA  TAEI W QA L E+ +  +S+P ++ DN SA  +A+NPVFH+R+KH+E+D+HF+RD+VL   ++  Y+PS+DQ AD LTK ++ S F+SLR
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
        S L +   PF L G
Subjt:  SKLSVHLAPFRLWG

XP_030505199.1 uncharacterized protein LOC115720181 [Cannabis sativa]3.8e-6344.59Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT +    +  FI+RLN +F+LKD+G L Y+LG+EV+R    ++L Q K                           S  +G  + +   YRS+IGALQYL
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        +HTRPDIS+ VNKL+QFLK PT +HW   KRILRYL GTI + + +  S    +  ++DDD                            KQ+VVA+S+ +
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR+LAQ  AEI+WVQ  L+E+K+  ++ PIIWCDN SA +LA N VFH+R+KH+ELD+HFVRDKVL+K I+  YVP+ DQ AD LTK +S + F  L 
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
         KL V  +P RL G
Subjt:  SKLSVHLAPFRLWG

TrEMBL top hitse value%identityAlignment
A0A438CFM4 Retrovirus-related Pol polyprotein from transposon RE11.8e-6344.9Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL
        VT   P  I Q I +L+++FAL+D+G LSY+LG+EV     ++ L Q K                           S F+G LM D+ +YRSV+GALQY 
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        T TRPDI++ VNK  QF+ +PT  HW  VKRILRYL GTI + L L  S+ F+I AYTD D                            KQ VV+RS+AE
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR LA  TAEI W QA LRE+ +  +S+P ++ DN SA  +A+NPVFH+R+KH+E+D+HF+RD+VL   ++  Y+PS+DQ AD LTK ++ S F+SLR
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
        S L +   PF L G
Subjt:  SKLSVHLAPFRLWG

A0A803NFK9 Uncharacterized protein1.4e-6345.49Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQS-------------------------KKFSAFEGSLMSDLQLYRSVIGALQYL
        VT  C   + +F+++L+ VF+LKD+G L ++LG+EV+R     +L QS                         K+ S + G+ M+D   Y+SV+GALQYL
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQS-------------------------KKFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDN-ALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLC
        +HTR DIS++ NKL+QFLK PT +HW   KR+LRYL G++D   +FL      S+++++    KQ+VVARS+ ESEYR+L    AE+SW+Q  L+E+K  
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDN-ALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLC

Query:  SSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG
         S+ P+IWCDN SA +LA NPV+H+R+KH+ELD+HFVRDKVL+K ++  Y+PS +Q  + LTK +S S F  L  KL    +PFRL G
Subjt:  SSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG

A0A803NUC9 Uncharacterized protein1.7e-6449.64Show/hide
Query:  FISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIV
        FI+RLN +F+LKD+G L Y+LG+E +R    ++L Q                         +K  S  +G  M+D  LYRSVIGALQYL+HTRPDIS+ V
Subjt:  FISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIV

Query:  NKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNT
        NKL+QFLK PT +HW   KR+LRYL GT+++   +      ++I+++    KQ+VVARS+ ESEYR+LAQ  AE++W+Q  L+E+K    + PIIWCDN 
Subjt:  NKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNT

Query:  SAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG
        SA +LA NPV+H+R+KH+ELDIHFVRDKVLQK ++  Y PS DQ  D LTK +S S F  L  KL V  +P RL G
Subjt:  SAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG

A0A803PR46 Uncharacterized protein1.9e-6847.74Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL
        +T +    +  FI+RL+ VFALKD+G LS++LG+EV+R    M+L Q                          K  S +EG+ + D   YRSVIGALQYL
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCS
        THTRPDI++ VNKL+QFLK PT  HW   KR+LRYL GT+ + + +  S   ++ A++    KQ+VVARS+ ESEYR+LAQ TAE++W+++ L+EI    
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCS

Query:  SSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG
         SVP++WCDN SA +LA NPV+H+R+KH+ELD+HF+RDKVLQK I+  ++ S DQ AD LTK ++   F  L  KL V  +P RL G
Subjt:  SSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWG

A0A803QD96 Uncharacterized protein1.3e-6444.9Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL
        VT +    + QF  +LN VFALKD+GLL Y+LG+EV+R    M+L Q                          K  S  +G+L+++   YRS+IG LQYL
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
        THTRPDIS+ VNKL+QFLK PT +HW   KRILRYL  T D+ L +  S   ++ ++TD D                            KQ+VV+RS+ E
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        SEYR+LAQ TAE++W+Q+ L+E++      PIIWCDN  A +LA NPV+H+R+KH+ELD+HFVRDKVL K++   Y+PS DQ AD LTK +S S F  L 
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAPFRLWG
         KL V   P  L G
Subjt:  SKLSVHLAPFRLWG

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-2829.37Show/hide
Query:  ISRLNNV-------FALKDIGLLSYYLGVEVYRTNDNMFLLQS-------KKF-----SAFEGSLMSDLQL------------YRSVIGALQY-LTHTRP
        ++R+NN        F + D+  + +++G+ +    D ++L QS        KF     +A    L S +               RS+IG L Y +  TRP
Subjt:  ISRLNNV-------FALKDIGLLSYYLGVEVYRTNDNMFLLQS-------KKF-----SAFEGSLMSDLQL------------YRSVIGALQY-LTHTRP

Query:  DISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSF--SIIAYTDDD----------------------------LKQSVVARSNAESE
        D++  VN L+++  +     WQ +KR+LRYL GTID  L   ++ +F   II Y D D                             +Q+ VA S+ E+E
Subjt:  DISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSF--SIIAYTDDD----------------------------LKQSVVARSNAESE

Query:  YRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSK
        Y +L +   E  W++  L  I +   +   I+ DN   IS+A NP  H R+KH+++  HF R++V    I   Y+P+ +Q AD  TKP+  + F+ LR K
Subjt:  YRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSK

Query:  LSV
        L +
Subjt:  LSV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-2531.19Show/hide
Query:  YRSVIGALQY-LTHTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLK-------------------------
        Y S +G+L Y +  TRPDI++ V  +++FL+ P   HW+ VK ILRYL GT  + L    S    +  YTD D+                          
Subjt:  YRSVIGALQY-LTHTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLK-------------------------

Query:  --QSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALT
          Q  VA S  E+EY +  +T  E+ W++ FL+E+ L      +++CD+ SAI L++N ++H+R+KH+++  H++R+ V  +S+K   + +++  AD LT
Subjt:  --QSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALT

Query:  KPISKSHFISLRSKLSVH
        K + ++ F   +  + +H
Subjt:  KPISKSHFISLRSKLSVH

P92519 Uncharacterized mitochondrial protein AtMg008101.4e-2031.22Show/hide
Query:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSKKF------------------------SAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVNK
        I +L++ F++KD+G + Y+LG+++      +FL Q+K                          S+   +   D   +RS++GALQYLT TRPDISY VN 
Subjt:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSKKF------------------------SAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVNK

Query:  LNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAEI
        + Q + +PT+  +  +KR+LRY+ GTI + L++ ++S  ++ A+ D D                            +Q  V+RS+ E+EYR+LA T AE+
Subjt:  LNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAEI

Query:  SWVQA
        +W  A
Subjt:  SWVQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-4935.6Show/hide
Query:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL
        +T + P  ++  +  L+  F++KD   L Y+LG+E  R    + L Q                         S K S + G+ ++D   YR ++G+LQYL
Subjt:  VTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYL

Query:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE
          TRPDISY VN+L+QF+  PT  H Q +KRILRYL GT ++ +FL + ++ S+ AY+D D                            KQ  V RS+ E
Subjt:  THTRPDISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAE

Query:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR
        +EYRS+A T++E+ W+ + L E+ +  +  P+I+CDN  A  L  NPVFHSR KH+ +D HF+R++V   +++  +V + DQ AD LTKP+S++ F +  
Subjt:  SEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLR

Query:  SKLSVHLAP
        SK+ V   P
Subjt:  SKLSVHLAP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-4734.97Show/hide
Query:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVN
        +  L+  F++K+   L Y+LG+E  R    + L Q                         S K +   G+ + D   YR ++G+LQYL  TRPD+SY VN
Subjt:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQ-------------------------SKKFSAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVN

Query:  KLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAE
        +L+Q++  PT  HW  +KR+LRYL GT D+ +FL + ++ S+ AY+D D                            KQ  V RS+ E+EYRS+A T++E
Subjt:  KLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAE

Query:  ISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRL
        + W+ + L E+ +  S  P+I+CDN  A  L  NPVFHSR KH+ LD HF+R++V   +++  +V + DQ AD LTKP+S+  F +   K+ V   P   
Subjt:  ISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRL

Query:  WGILEL
         G+L +
Subjt:  WGILEL

Arabidopsis top hitse value%identityAlignment
AT2G16910.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.2e-1129.03Show/hide
Query:  KSKNLHAERRRRQKLSDRLLLLRA------TMNKATIIDDAITYIQQLQKTVDILKDQLVELEASSE---------------------------KILCPP
        ++KNL AERRRR+KL+DRL  LR+       +++A+I+ DAI Y+++LQ     L+D+L E   + +                            +    
Subjt:  KSKNLHAERRRRQKLSDRLLLLRA------TMNKATIIDDAITYIQQLQKTVDILKDQLVELEASSE---------------------------KILCPP

Query:  PRIDLKKSYTQG-----DVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSIINIH-IANELVSS
          +DL+ S  +G      V+V+Q+D    ++K++ E + G FT+L++AL+SLG E+  T+  T +   L++++  +    NE+V +
Subjt:  PRIDLKKSYTQG-----DVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSIINIH-IANELVSS

AT4G21330.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.5e-2138.1Show/hide
Query:  DESTEYKSKNLHAERRRRQKLSDRLLLLRA------TMNKATIIDDAITYIQQLQKTVDILKDQLVELEASSEKILCPPPRID-----------------
        +E   +KS NL AERRRR+KL  RL+ LR+       M KA+I++DAITYI +LQ  V  L +   E+E +       PP ID                 
Subjt:  DESTEYKSKNLHAERRRRQKLSDRLLLLRA------TMNKATIIDDAITYIQQLQKTVDILKDQLVELEASSEKILCPPPRID-----------------

Query:  ---LKKSYTQGDVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSII
           +KK   + +V + +I E + W+KI+ EKR G FTK ++ +  LGFE+ID S+TT  GA+L+++ +
Subjt:  ---LKKSYTQGDVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSII

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.5e-3632.75Show/hide
Query:  AHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYLTHTRPD
        A + +  S+L + F L+D+G L Y+LG+E+ R+   + + Q K                          FSA  G    D + YR +IG L YL  TR D
Subjt:  AHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSK-------------------------KFSAFEGSLMSDLQLYRSVIGALQYLTHTRPD

Query:  ISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDL---------------------------KQSVVARSNAESEYRSL
        IS+ VNKL+QF + P + H Q V +IL Y+ GT+   LF    +   +  ++D                              KQ VV++S+AE+EYR+L
Subjt:  ISYIVNKLNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDL---------------------------KQSVVARSNAESEYRSL

Query:  AQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDK-VLQKSIKFWYVPSSDQHADALTKPIS
        +  T E+ W+  F RE++L  S   +++CDNT+AI +A N VFH R+KH+E D H VR++ V Q ++ + +    +Q  D  T+ +S
Subjt:  AQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFVRDK-VLQKSIKFWYVPSSDQHADALTKPIS

AT5G57150.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-0928.64Show/hide
Query:  DMAYAGSGPDEDDSGGRWITGQRRPTDESTEYKSKNLHAERRRRQKLSDRLLLLRAT------MNKATIIDDAITYIQQLQKTVDILKDQLVELEASS--
        D  Y  S P E+   G + +    P   ++   SKN+ +ER RRQKL+ RL  LR+       M+KA+II DAI+YI+ LQ     L+ ++ ELE++   
Subjt:  DMAYAGSGPDEDDSGGRWITGQRRPTDESTEYKSKNLHAERRRRQKLSDRLLLLRAT------MNKATIIDDAITYIQQLQKTVDILKDQLVELEASS--

Query:  --------EKILCPPPRIDLKKSYTQG---------DVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSIINIHIAN
                ++ L  P      K    G         ++ V+ + E  + + +   KR     KL +   SL  +++ +++T+  G +  T  I I IAN
Subjt:  --------EKILCPPPRIDLKKSYTQG---------DVNVSQIDEHRLWIKILFEKRKGAFTKLIQALNSLGFELIDTSVTTVKGAVLLTSIINIHIAN

ATMG00810.1 DNA/RNA polymerases superfamily protein1.0e-2131.22Show/hide
Query:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSKKF------------------------SAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVNK
        I +L++ F++KD+G + Y+LG+++      +FL Q+K                          S+   +   D   +RS++GALQYLT TRPDISY VN 
Subjt:  ISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSKKF------------------------SAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVNK

Query:  LNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAEI
        + Q + +PT+  +  +KR+LRY+ GTI + L++ ++S  ++ A+ D D                            +Q  V+RS+ E+EYR+LA T AE+
Subjt:  LNQFLKQPTVIHWQGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDD---------------------------LKQSVVARSNAESEYRSLAQTTAEI

Query:  SWVQA
        +W  A
Subjt:  SWVQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTAAGCAGACACCTCGAGCATGTTACCAATGACTGTCCTGCTCACATATATCAATTTATTAGTCGACTGAATAACGTTTTTGCCCTCAAAGATATTGGTTTGTT
GAGTTATTATCTAGGGGTTGAGGTTTATCGGACTAATGACAACATGTTTTTACTCCAGTCCAAGAAGTTCTCCGCTTTTGAAGGCTCTCTCATGTCTGATCTGCAACTTT
ATCGCAGTGTAATTGGGGCTCTCCAATACCTTACTCATACTCGTCCGGACATCTCTTATATTGTCAATAAACTCAACCAGTTTCTAAAGCAGCCCACTGTTATACATTGG
CAAGGAGTGAAACGTATTCTACGGTACTTAACAGGCACTATTGACAATGCTTTGTTTCTCCCTCGGTCTTCGTCATTCTCTATTATTGCTTACACTGATGATGATTTGAA
ACAAAGTGTTGTGGCTCGTTCCAATGCGGAGTCTGAGTATCGTTCTCTTGCTCAAACTACTGCTGAAATATCATGGGTTCAAGCTTTTCTTCGAGAAATTAAGCTATGTT
CGTCCTCTGTTCCTATCATTTGGTGTGATAACACCAGTGCAATCTCTTTGGCTCAAAATCCGGTGTTCCATTCGCGAAGCAAACATGTTGAACTTGACATTCATTTTGTC
CGAGACAAAGTTCTTCAGAAGTCGATTAAGTTTTGGTATGTGCCTTCTTCCGACCAGCATGCTGATGCCTTAACTAAACCGATCTCGAAAAGTCATTTCATCTCTCTTCG
CTCCAAACTCAGTGTGCATCTCGCACCCTTTCGTTTGTGGGGGATATTAGAGCTAAAGATTACTGTGCAGTTGAGGAGAATTTACCAGGCGAGAAGAGCTTTCCGCCCCA
CCAAGAGAGACGAGATCAACATTCTCAACGACATGGCATACGCTGGCTCTGGACCAGACGAAGATGATAGTGGCGGCAGATGGATAACGGGCCAACGACGACCCACCGAT
GAATCCACCGAATACAAATCGAAGAACCTCCATGCAGAGAGACGACGGAGGCAAAAGCTTAGCGATAGACTACTATTACTTCGTGCTACTATGAATAAAGCAACCATCAT
CGACGACGCTATAACCTACATCCAGCAGCTACAGAAGACGGTTGACATTCTCAAAGACCAGCTTGTTGAATTGGAGGCCTCATCTGAAAAAATATTATGCCCACCACCAC
GAATAGACTTAAAGAAGAGTTATACTCAGGGAGACGTGAACGTTTCTCAAATTGACGAACACAGACTCTGGATTAAAATACTTTTCGAGAAGCGGAAAGGCGCATTCACT
AAATTAATTCAAGCGTTGAATTCTCTAGGCTTTGAACTCATTGATACTAGTGTCACGACAGTAAAAGGAGCAGTTCTCTTAACCAGCATTATCAACATCCACATTGCAAA
TGAGTTGGTTTCTTCTGTCGCCGCCGTTGAAGGGAGCGCCACCGCCATCTCTAGCTTGCATTTCCCTCTCCTTTCGTGGCATTCTTTCTTCCTCTCTCTCGCCGAGTCGC
ACCGTCGCCATGAAGCCCAACCCTTTGCGCCTCGCCCCTATGCGTTTCTTCTCTCTCTCGGTGTTTCTCCTCCTGCAGCGCTGTTCTCCACCAGAAATCGAGACCCTTGC
ACCACAGCCGTGAAAACCCAGCCCGAGCCCTGCGTGGAGGTCGTTCAACTTGTGAATCTTTCCCCCTCTCTTGTTCAGTCCATGGCCCAAGCTGCACCTGAAAGTTTTGG
AATTCGCCTAGCGTCGAGACGCTGTAGGGATAGCGTCACGACGCTGTCCGATTTCTTGGCCAGCAAGTCGATGACGTCACAGCGTTGTGACGCTGTCCAATTTCCGGCCT
ATAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTTAAGCAGACACCTCGAGCATGTTACCAATGACTGTCCTGCTCACATATATCAATTTATTAGTCGACTGAATAACGTTTTTGCCCTCAAAGATATTGGTTTGTT
GAGTTATTATCTAGGGGTTGAGGTTTATCGGACTAATGACAACATGTTTTTACTCCAGTCCAAGAAGTTCTCCGCTTTTGAAGGCTCTCTCATGTCTGATCTGCAACTTT
ATCGCAGTGTAATTGGGGCTCTCCAATACCTTACTCATACTCGTCCGGACATCTCTTATATTGTCAATAAACTCAACCAGTTTCTAAAGCAGCCCACTGTTATACATTGG
CAAGGAGTGAAACGTATTCTACGGTACTTAACAGGCACTATTGACAATGCTTTGTTTCTCCCTCGGTCTTCGTCATTCTCTATTATTGCTTACACTGATGATGATTTGAA
ACAAAGTGTTGTGGCTCGTTCCAATGCGGAGTCTGAGTATCGTTCTCTTGCTCAAACTACTGCTGAAATATCATGGGTTCAAGCTTTTCTTCGAGAAATTAAGCTATGTT
CGTCCTCTGTTCCTATCATTTGGTGTGATAACACCAGTGCAATCTCTTTGGCTCAAAATCCGGTGTTCCATTCGCGAAGCAAACATGTTGAACTTGACATTCATTTTGTC
CGAGACAAAGTTCTTCAGAAGTCGATTAAGTTTTGGTATGTGCCTTCTTCCGACCAGCATGCTGATGCCTTAACTAAACCGATCTCGAAAAGTCATTTCATCTCTCTTCG
CTCCAAACTCAGTGTGCATCTCGCACCCTTTCGTTTGTGGGGGATATTAGAGCTAAAGATTACTGTGCAGTTGAGGAGAATTTACCAGGCGAGAAGAGCTTTCCGCCCCA
CCAAGAGAGACGAGATCAACATTCTCAACGACATGGCATACGCTGGCTCTGGACCAGACGAAGATGATAGTGGCGGCAGATGGATAACGGGCCAACGACGACCCACCGAT
GAATCCACCGAATACAAATCGAAGAACCTCCATGCAGAGAGACGACGGAGGCAAAAGCTTAGCGATAGACTACTATTACTTCGTGCTACTATGAATAAAGCAACCATCAT
CGACGACGCTATAACCTACATCCAGCAGCTACAGAAGACGGTTGACATTCTCAAAGACCAGCTTGTTGAATTGGAGGCCTCATCTGAAAAAATATTATGCCCACCACCAC
GAATAGACTTAAAGAAGAGTTATACTCAGGGAGACGTGAACGTTTCTCAAATTGACGAACACAGACTCTGGATTAAAATACTTTTCGAGAAGCGGAAAGGCGCATTCACT
AAATTAATTCAAGCGTTGAATTCTCTAGGCTTTGAACTCATTGATACTAGTGTCACGACAGTAAAAGGAGCAGTTCTCTTAACCAGCATTATCAACATCCACATTGCAAA
TGAGTTGGTTTCTTCTGTCGCCGCCGTTGAAGGGAGCGCCACCGCCATCTCTAGCTTGCATTTCCCTCTCCTTTCGTGGCATTCTTTCTTCCTCTCTCTCGCCGAGTCGC
ACCGTCGCCATGAAGCCCAACCCTTTGCGCCTCGCCCCTATGCGTTTCTTCTCTCTCTCGGTGTTTCTCCTCCTGCAGCGCTGTTCTCCACCAGAAATCGAGACCCTTGC
ACCACAGCCGTGAAAACCCAGCCCGAGCCCTGCGTGGAGGTCGTTCAACTTGTGAATCTTTCCCCCTCTCTTGTTCAGTCCATGGCCCAAGCTGCACCTGAAAGTTTTGG
AATTCGCCTAGCGTCGAGACGCTGTAGGGATAGCGTCACGACGCTGTCCGATTTCTTGGCCAGCAAGTCGATGACGTCACAGCGTTGTGACGCTGTCCAATTTCCGGCCT
ATAAATAG
Protein sequenceShow/hide protein sequence
MDLSRHLEHVTNDCPAHIYQFISRLNNVFALKDIGLLSYYLGVEVYRTNDNMFLLQSKKFSAFEGSLMSDLQLYRSVIGALQYLTHTRPDISYIVNKLNQFLKQPTVIHW
QGVKRILRYLTGTIDNALFLPRSSSFSIIAYTDDDLKQSVVARSNAESEYRSLAQTTAEISWVQAFLREIKLCSSSVPIIWCDNTSAISLAQNPVFHSRSKHVELDIHFV
RDKVLQKSIKFWYVPSSDQHADALTKPISKSHFISLRSKLSVHLAPFRLWGILELKITVQLRRIYQARRAFRPTKRDEINILNDMAYAGSGPDEDDSGGRWITGQRRPTD
ESTEYKSKNLHAERRRRQKLSDRLLLLRATMNKATIIDDAITYIQQLQKTVDILKDQLVELEASSEKILCPPPRIDLKKSYTQGDVNVSQIDEHRLWIKILFEKRKGAFT
KLIQALNSLGFELIDTSVTTVKGAVLLTSIINIHIANELVSSVAAVEGSATAISSLHFPLLSWHSFFLSLAESHRRHEAQPFAPRPYAFLLSLGVSPPAALFSTRNRDPC
TTAVKTQPEPCVEVVQLVNLSPSLVQSMAQAAPESFGIRLASRRCRDSVTTLSDFLASKSMTSQRCDAVQFPAYK