; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027200 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027200
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:45799170..45800117
RNA-Seq ExpressionLag0027200
SyntenyLag0027200
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]8.5e-5642.26Show/hide
Query:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA
        M S S+     + + +S   +I   G+KI+ VKL++D FLLWK QILTAL  + L++ +E + E PSK+ ++++SSSAS T  PNPAY+ W RQD LI++
Subjt:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA

Query:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV
        WLLGSMS  +L++ML C SA+E+W  L   FSSR +A+ M  K+KL + KKG++ L++YFLK+   VD+LA+  + +S +DH+L++ AGLG++Y S +SV
Subjt:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV

Query:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG
        I+ + ++P +Q+V SLL  Q+++      +  + + PSVN+ T   T+++ + S    + N   NN + N R GR NG + R    N N+PQCQ+C + G
Subjt:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG

Query:  HTVQRCYYRF
        ++  RC++R+
Subjt:  HTVQRCYYRF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]8.5e-5642.26Show/hide
Query:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA
        M S S+     + + +S   +I   G+KI+ VKL++D FLLWK QILTAL  + L++ +E + E PSK+ ++++SSSAS T  PNPAY+ W RQD LI++
Subjt:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA

Query:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV
        WLLGSMS  +L++ML C SA+E+W  L   FSSR +A+ M  K+KL + KKG++ L++YFLK+   VD+LA+  + +S +DH+L++ AGLG++Y S +SV
Subjt:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV

Query:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG
        I+ + ++P +Q+V SLL  Q+++      +  + + PSVN+ T   T+++ + S    + N   NN + N R GR NG + R    N N+PQCQ+C + G
Subjt:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG

Query:  HTVQRCYYRF
        ++  RC++R+
Subjt:  HTVQRCYYRF

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]4.1e-5053.81Show/hide
Query:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG
        ++QD LIT+WL  SM   +L EM+ C++AREVW+IL N ++SRN+ARVM LKSKL + KKGNL L+DYF KVK +VDSLAAAG+K++ EDH++H+  GL 
Subjt:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG

Query:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP
        +E++S VSVI+ + +T  LQ+VYSLL + + R  RN SIN DG+ PSVNLT      +Q  +SNS +S +G     +NN ++N  N N RR WN+N N P
Subjt:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP

Query:  QCQLCRRFGHTVQRCYYRFERWF
        QCQ+  +FGHT  RCY RFE+ F
Subjt:  QCQLCRRFGHTVQRCYYRFERWF

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]4.1e-5053.81Show/hide
Query:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG
        ++QD LIT+WL  SM   +L EM+ C++AREVW+IL N ++SRN+ARVM LKSKL + KKGNL L+DYF KVK +VDSLAAAG+K++ EDH++H+  GL 
Subjt:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG

Query:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP
        +E++S VSVI+ + +T  LQ+VYSLL + + R  RN SIN DG+ PSVNLT      +Q  +SNS +S +G     +NN ++N  N N RR WN+N N P
Subjt:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP

Query:  QCQLCRRFGHTVQRCYYRFERWF
        QCQ+  +FGHT  RCY RFE+ F
Subjt:  QCQLCRRFGHTVQRCYYRFERWF

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]9.1e-7452.94Show/hide
Query:  SDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFV--ASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSL
        SD     Q  K +NPGSK++ V+L++DN LLWK QI TAL+G+GL+ +I+ + + P++FV    D SS+S+   NPAY  W++QD LI+AWLLGSM+  +
Subjt:  SDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFV--ASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSL

Query:  LSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPL
        LS+ML+C SARE+W +L   F+SR +ARVM LK KL + KKGNL L+DYFLK+KN+VDSLA AG+K+S EDH++H+ AGLG E+D+ +SVIT +N    L
Subjt:  LSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPL

Query:  QKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSK---QASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRCYY
        Q+V SLL  Q+ R  RNL IN DGS PSVNLT + S+ K     S   +   SN +   R  N R+    +NRR W T +N+PQCQ+C RFGHT  RCY 
Subjt:  QKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSK---QASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRCYY

Query:  RFERWF
        RFER F
Subjt:  RFERWF

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-5642.26Show/hide
Query:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA
        M S S+     + + +S   +I   G+KI+ VKL++D FLLWK QILTAL  + L++ +E + E PSK+ ++++SSSAS T  PNPAY+ W RQD LI++
Subjt:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA

Query:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV
        WLLGSMS  +L++ML C SA+E+W  L   FSSR +A+ M  K+KL + KKG++ L++YFLK+   VD+LA+  + +S +DH+L++ AGLG++Y S +SV
Subjt:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV

Query:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG
        I+ + ++P +Q+V SLL  Q+++      +  + + PSVN+ T   T+++ + S    + N   NN + N R GR NG + R    N N+PQCQ+C + G
Subjt:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG

Query:  HTVQRCYYRF
        ++  RC++R+
Subjt:  HTVQRCYYRF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-5642.26Show/hide
Query:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA
        M S S+     + + +S   +I   G+KI+ VKL++D FLLWK QILTAL  + L++ +E + E PSK+ ++++SSSAS T  PNPAY+ W RQD LI++
Subjt:  MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKF-VASDSSSAS-TKIPNPAYEHWVRQDNLITA

Query:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV
        WLLGSMS  +L++ML C SA+E+W  L   FSSR +A+ M  K+KL + KKG++ L++YFLK+   VD+LA+  + +S +DH+L++ AGLG++Y S +SV
Subjt:  WLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSV

Query:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG
        I+ + ++P +Q+V SLL  Q+++      +  + + PSVN+ T   T+++ + S    + N   NN + N R GR NG + R    N N+PQCQ+C + G
Subjt:  ITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGR-NGNNRRFWNTNSNEPQCQLCRRFG

Query:  HTVQRCYYRF
        ++  RC++R+
Subjt:  HTVQRCYYRF

A0A6J1C6N9 dr1-associated corepressor homolog isoform X12.0e-5053.81Show/hide
Query:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG
        ++QD LIT+WL  SM   +L EM+ C++AREVW+IL N ++SRN+ARVM LKSKL + KKGNL L+DYF KVK +VDSLAAAG+K++ EDH++H+  GL 
Subjt:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG

Query:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP
        +E++S VSVI+ + +T  LQ+VYSLL + + R  RN SIN DG+ PSVNLT      +Q  +SNS +S +G     +NN ++N  N N RR WN+N N P
Subjt:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP

Query:  QCQLCRRFGHTVQRCYYRFERWF
        QCQ+  +FGHT  RCY RFE+ F
Subjt:  QCQLCRRFGHTVQRCYYRFERWF

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.0e-5053.81Show/hide
Query:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG
        ++QD LIT+WL  SM   +L EM+ C++AREVW+IL N ++SRN+ARVM LKSKL + KKGNL L+DYF KVK +VDSLAAAG+K++ EDH++H+  GL 
Subjt:  VRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLG

Query:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP
        +E++S VSVI+ + +T  LQ+VYSLL + + R  RN SIN DG+ PSVNLT      +Q  +SNS +S +G     +NN ++N  N N RR WN+N N P
Subjt:  TEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNG-NGNNRNNNTRNGRNGNNRRFWNTNSNEP

Query:  QCQLCRRFGHTVQRCYYRFERWF
        QCQ+  +FGHT  RCY RFE+ F
Subjt:  QCQLCRRFGHTVQRCYYRFERWF

A0A6J1DLT9 uncharacterized protein LOC1110217574.4e-7452.94Show/hide
Query:  SDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFV--ASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSL
        SD     Q  K +NPGSK++ V+L++DN LLWK QI TAL+G+GL+ +I+ + + P++FV    D SS+S+   NPAY  W++QD LI+AWLLGSM+  +
Subjt:  SDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFV--ASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSL

Query:  LSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPL
        LS+ML+C SARE+W +L   F+SR +ARVM LK KL + KKGNL L+DYFLK+KN+VDSLA AG+K+S EDH++H+ AGLG E+D+ +SVIT +N    L
Subjt:  LSEMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPL

Query:  QKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSK---QASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRCYY
        Q+V SLL  Q+ R  RNL IN DGS PSVNLT + S+ K     S   +   SN +   R  N R+    +NRR W T +N+PQCQ+C RFGHT  RCY 
Subjt:  QKVYSLLFAQKNRIARNLSINLDGSTPSVNLTTHSSTSK---QASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRCYY

Query:  RFERWF
        RFER F
Subjt:  RFERWF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.3e-2126.6Show/hide
Query:  NLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECDSA
        N  I+N      T KL   N+L+W  Q+     G+ L   ++    +P   + +D++       NP Y  W RQD LI + +LG++S S+   +    +A
Subjt:  NLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECDSA

Query:  REVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQ
         ++W  L   +++ +   V  L+++L    KG   ++DY   +    D LA  G+ M H++ V  +   L  EY   +  I  K+  P L +++  L   
Subjt:  REVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQ

Query:  KNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFW---------NTNSNEP---QCQLCRRFGHTVQRC
        +++I   L+++        + T    T+   S  N+  ++N N  NRNN   N  N NN + W         N N ++P   +CQ+C   GH+ +RC
Subjt:  KNRIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFW---------NTNSNEP---QCQLCRRFGHTVQRC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-1425.26Show/hide
Query:  NLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIP--NPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECD
        N  I+N      T KL   N+L+W  Q+     G+ L   ++    +P   + +D+      +P  NP Y  W RQD LI + +LG++S S+   +    
Subjt:  NLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIP--NPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECD

Query:  SAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLF
        +A ++W  L   +++ +   V  L+                F+      D LA  G+ M H++ V  +   L  +Y   +  I  K+  P L +++  L 
Subjt:  SAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLF

Query:  AQKNRIARNLSINLDGSTP-SVNLTTHSSTSKQASSSNSGESSN-GNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRC
           NR ++ L++N     P + N+ TH +T+   + +N G++ N  N NNR+N+ +   +G+            +CQ+C   GH+ +RC
Subjt:  AQKNRIARNLSINLDGSTP-SVNLTTHSSTSKQASSSNSGESSN-GNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRC

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.2e-0923.53Show/hide
Query:  ITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQ
        I  +  DEDN++ WK++  + LR             +  KF   D +       +P Y+ W + + ++  WL+ SM++ LL  ++  ++A ++W  L   
Subjt:  ITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECDSAREVWRILNNQ

Query:  FSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNI
        F      ++  L+ +L + ++G   +E+YF K+  +
Subjt:  FSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-1022.27Show/hide
Query:  LDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYE-HWVRQDNLITAWLLGSMS-NSLLSEMLECDSAREVWRILNNQFSS
        ++E N+  W+   LT      +  HI+                  T +P  A + +W ++D ++   L G+++        +   ++R++W  + NQF +
Subjt:  LDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYE-HWVRQDNLITAWLLGSMS-NSLLSEMLECDSAREVWRILNNQFSS

Query:  RNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSIN--
           AR + L S+L +   G++++ DY+ K+K + DSL      ++  + V+++  GL  ++D+ ++VI  +   P      ++L  +++R+ R +  N  
Subjt:  RNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIARNLSIN--

Query:  -LDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNG
         +D S+ S  L    +        + G      G  R NN   GR G
Subjt:  -LDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.1e-1224.81Show/hide
Query:  TVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECD-SAREVWRILNNQF
        T+ L++ N+ +W+    T     G+  HI             D SS  T +     + W  +D L+  W+ G++++SLL  +++   +AR++W  L N F
Subjt:  TVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLSEMLECD-SAREVWRILNNQF

Query:  SSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIAR----N
             AR +  +++L +T   +L + +Y  K+K++ D L      +S    V+HL  GL  +YD  ++VI  K+  P   +  S+L  +++R++     +
Subjt:  SSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQKNRIAR----N

Query:  LSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNG---NNRRFWNTN
        LS     S  +V  T      +     ++  S+ G G ++  N   G +    NN   W  N
Subjt:  LSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNG---NNRRFWNTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCCCCTCAACTGAGAAGAGTGGCTCAGACCTTGATAATGCTTCTCAGAATCTAAAGATTGTCAATCCAGGGAGCAAGATTACTACTGTAAAGCTTGATGAAGA
TAACTTCCTTCTATGGAAATTGCAAATTCTTACAGCCCTAAGGGGACATGGATTGAAGCACCACATTGAAGAAGATGTAGAAATTCCCTCCAAATTCGTTGCCTCCGACA
GCTCTTCGGCATCAACGAAAATCCCTAATCCTGCGTATGAACACTGGGTACGTCAAGATAATCTCATCACCGCCTGGCTCTTGGGATCAATGTCGAATTCGTTGCTGTCT
GAAATGTTGGAGTGCGACTCTGCCCGTGAGGTATGGAGAATACTTAACAATCAATTTTCATCAAGAAATGTGGCGAGAGTTATGGATCTGAAATCCAAGCTGGGTTCAAC
CAAGAAAGGTAATCTTAAATTAGAAGATTACTTTCTGAAAGTAAAGAATATAGTTGACTCCTTGGCGGCAGCTGGAAGAAAGATGTCGCATGAAGACCATGTCCTGCATC
TCTGTGCTGGGTTGGGGACTGAGTATGACTCTGCTGTTTCGGTCATAACGGAAAAGAATGAGACACCCCCACTTCAAAAGGTATATTCTCTTCTTTTCGCTCAAAAGAAT
AGAATTGCAAGAAATCTTTCTATTAATCTAGATGGATCGACTCCTTCTGTTAATCTTACCACCCATTCCTCCACTTCGAAGCAAGCTTCCTCTTCGAACTCCGGTGAGTC
TTCAAATGGAAATGGCAACAATCGAAACAATAACACTAGAAATGGACGAAATGGAAACAATCGAAGATTCTGGAATACAAACAGCAATGAGCCGCAATGTCAACTCTGTC
GTCGCTTTGGTCACACAGTTCAACGGTGTTACTACCGGTTTGAGAGATGGTTCCAGGTCCAAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCCCCTCAACTGAGAAGAGTGGCTCAGACCTTGATAATGCTTCTCAGAATCTAAAGATTGTCAATCCAGGGAGCAAGATTACTACTGTAAAGCTTGATGAAGA
TAACTTCCTTCTATGGAAATTGCAAATTCTTACAGCCCTAAGGGGACATGGATTGAAGCACCACATTGAAGAAGATGTAGAAATTCCCTCCAAATTCGTTGCCTCCGACA
GCTCTTCGGCATCAACGAAAATCCCTAATCCTGCGTATGAACACTGGGTACGTCAAGATAATCTCATCACCGCCTGGCTCTTGGGATCAATGTCGAATTCGTTGCTGTCT
GAAATGTTGGAGTGCGACTCTGCCCGTGAGGTATGGAGAATACTTAACAATCAATTTTCATCAAGAAATGTGGCGAGAGTTATGGATCTGAAATCCAAGCTGGGTTCAAC
CAAGAAAGGTAATCTTAAATTAGAAGATTACTTTCTGAAAGTAAAGAATATAGTTGACTCCTTGGCGGCAGCTGGAAGAAAGATGTCGCATGAAGACCATGTCCTGCATC
TCTGTGCTGGGTTGGGGACTGAGTATGACTCTGCTGTTTCGGTCATAACGGAAAAGAATGAGACACCCCCACTTCAAAAGGTATATTCTCTTCTTTTCGCTCAAAAGAAT
AGAATTGCAAGAAATCTTTCTATTAATCTAGATGGATCGACTCCTTCTGTTAATCTTACCACCCATTCCTCCACTTCGAAGCAAGCTTCCTCTTCGAACTCCGGTGAGTC
TTCAAATGGAAATGGCAACAATCGAAACAATAACACTAGAAATGGACGAAATGGAAACAATCGAAGATTCTGGAATACAAACAGCAATGAGCCGCAATGTCAACTCTGTC
GTCGCTTTGGTCACACAGTTCAACGGTGTTACTACCGGTTTGAGAGATGGTTCCAGGTCCAAACATGA
Protein sequenceShow/hide protein sequence
MDSPSTEKSGSDLDNASQNLKIVNPGSKITTVKLDEDNFLLWKLQILTALRGHGLKHHIEEDVEIPSKFVASDSSSASTKIPNPAYEHWVRQDNLITAWLLGSMSNSLLS
EMLECDSAREVWRILNNQFSSRNVARVMDLKSKLGSTKKGNLKLEDYFLKVKNIVDSLAAAGRKMSHEDHVLHLCAGLGTEYDSAVSVITEKNETPPLQKVYSLLFAQKN
RIARNLSINLDGSTPSVNLTTHSSTSKQASSSNSGESSNGNGNNRNNNTRNGRNGNNRRFWNTNSNEPQCQLCRRFGHTVQRCYYRFERWFQVQT