; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007897 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007897
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:7538734..7539636
RNA-Seq ExpressionLag0007897
SyntenyLag0007897
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-7756.54Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLEN+L  E +PPSK L+S    T ++  +   TPNPAY VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW++L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q  DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVISA+ 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT
           SV EVMSLLLTQE++ ESKL S + +LPS N++ QT  K  E+ +  N  NN+ +N +   RG    RG   +NRG R   NRN+PQCQ+C K  ++
Subjt:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT

Query:  APKCFF
        A +CFF
Subjt:  APKCFF

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]7.3e-6556.13Show/hide
Query:  MEDTAVSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLEE--DPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMS
        +E+T VS    F  GNKIS+VKL+DDNFLLWKFQIL ALE YDLEN+ E   +PPSK L S    +T+A +    TPNP Y VWK+ +R+IS WL+GSMS
Subjt:  MEDTAVSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLEE--DPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMS

Query:  EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGP
        E+IL QM++C SAKEIW +L  IF++R LAQ M+ K KL  I+KG MSLKEYF KIQQ  DALA++ KPV  +DHIL+IL GLG +Y+SM+S+ISA+   
Subjt:  EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGP

Query:  QSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETD-VTKNVSNNF
         S+ EVMSLLLTQE++ ESKL S + +LP   ++ QT     +   +N  NN+
Subjt:  QSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETD-VTKNVSNNF

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.5e-5760.98Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F   NKIS+VKL+DDNFLLWKFQIL ALE YDLEN+L  E +PPSK L+S   G+++A  TR  TPNP Y VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW +L  IF++R LAQ MK K KL  I+K  M LKEYF KIQ   DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVI  + 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSV
           SV
Subjt:  GPQSV

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-7756.54Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLEN+L  E +PPSK L+S    T ++  +   TPNPAY VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW++L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q  DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVISA+ 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT
           SV EVMSLLLTQE++ ESKL S + +LPS N++ QT  K  E+ +  N  NN+ +N +   RG    RG   +NRG R   NRN+PQCQ+C K  ++
Subjt:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT

Query:  APKCFF
        A +CFF
Subjt:  APKCFF

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]6.9e-6345.61Show/hide
Query:  VSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLE--EDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILY
        + + +T +PG+K+SIV+L DDN LLWKFQI  AL+G  LE+Y++  ED P++  V  TE  +++   ++   NPAY  W KQD++IS+WL+GSM+EDIL 
Subjt:  VSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLE--EDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILY

Query:  QMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHE
        QM++C SA+EIW  L  +F +R LA++M++K KL+  +KG +SLK+YF KI+   D+LA  GK +  EDHI+ IL+GLG E+++++SVI+A+  PQ++ E
Subjt:  QMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHE

Query:  VMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHTAPKCF
        V SLLL QE R E  L ++ GSLPS NL +     ++    N+  +   N +  N  + G RG  N +   R+W   N+PQCQ+CG+F HTA +C+
Subjt:  VMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHTAPKCF

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-946.3e-7856.54Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLEN+L  E +PPSK L+S    T ++  +   TPNPAY VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW++L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q  DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVISA+ 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT
           SV EVMSLLLTQE++ ESKL S + +LPS N++ QT  K  E+ +  N  NN+ +N +   RG    RG   +NRG R   NRN+PQCQ+C K  ++
Subjt:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT

Query:  APKCFF
        A +CFF
Subjt:  APKCFF

A0A5A7UB21 Keratin, type II cytoskeletal 1-like3.6e-6556.13Show/hide
Query:  MEDTAVSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLEE--DPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMS
        +E+T VS    F  GNKIS+VKL+DDNFLLWKFQIL ALE YDLEN+ E   +PPSK L S    +T+A +    TPNP Y VWK+ +R+IS WL+GSMS
Subjt:  MEDTAVSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLEE--DPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMS

Query:  EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGP
        E+IL QM++C SAKEIW +L  IF++R LAQ M+ K KL  I+KG MSLKEYF KIQQ  DALA++ KPV  +DHIL+IL GLG +Y+SM+S+ISA+   
Subjt:  EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGP

Query:  QSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETD-VTKNVSNNF
         S+ EVMSLLLTQE++ ESKL S + +LP   ++ QT     +   +N  NN+
Subjt:  QSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETD-VTKNVSNNF

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like7.2e-5860.98Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F   NKIS+VKL+DDNFLLWKFQIL ALE YDLEN+L  E +PPSK L+S   G+++A  TR  TPNP Y VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW +L  IF++R LAQ MK K KL  I+K  M LKEYF KIQ   DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVI  + 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSV
           SV
Subjt:  GPQSV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-946.3e-7856.54Show/hide
Query:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS
        +E+T  SS   Q F  GNKIS+VKL DD FLLWKFQIL ALE YDLEN+L  E +PPSK L+S    T ++  +   TPNPAY VWK+QDR+ISSWL+GS
Subjt:  MEDTAVSS--LQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYL--EEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGS

Query:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA
        MSE+IL QM++C SAKEIW++L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q  DALA++ KPV  +DHIL+IL+GLGS+Y+SM+SVISA+ 
Subjt:  MSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKA

Query:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT
           SV EVMSLLLTQE++ ESKL S + +LPS N++ QT  K  E+ +  N  NN+ +N +   RG    RG   +NRG R   NRN+PQCQ+C K  ++
Subjt:  GPQSVHEVMSLLLTQENRIESKLTSTKGSLPSANLMVQT--KPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHT

Query:  APKCFF
        A +CFF
Subjt:  APKCFF

A0A6J1DLT9 uncharacterized protein LOC1110217573.3e-6345.61Show/hide
Query:  VSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLE--EDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILY
        + + +T +PG+K+SIV+L DDN LLWKFQI  AL+G  LE+Y++  ED P++  V  TE  +++   ++   NPAY  W KQD++IS+WL+GSM+EDIL 
Subjt:  VSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLE--EDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILY

Query:  QMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHE
        QM++C SA+EIW  L  +F +R LA++M++K KL+  +KG +SLK+YF KI+   D+LA  GK +  EDHI+ IL+GLG E+++++SVI+A+  PQ++ E
Subjt:  QMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHE

Query:  VMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHTAPKCF
        V SLLL QE R E  L ++ GSLPS NL +     ++    N+  +   N +  N  + G RG  N +   R+W   N+PQCQ+CG+F HTA +C+
Subjt:  VMSLLLTQENRIESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHTAPKCF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.3e-2727.7Show/hide
Query:  NKISIVKLTDDNFLLWKFQILMALEGYDLENYLEEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIW
        N  ++ KLT  N+L+W  Q+    +GY+L  +L+    S T+   T GT AA +      NP Y+ WK+QD++I S ++G++S  +   +   T+A +IW
Subjt:  NKISIVKLTDDNFLLWKFQILMALEGYDLENYLEEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIW

Query:  DSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRI
        ++L +I+   +   + +++T+L+   KG  ++ +Y   +    D LA +GKP+D ++ +  +L  L  EY+ ++  I+AK  P ++ E+   LL  E++I
Subjt:  DSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRI

Query:  ESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSW----------NNRNRP---QCQLCGKFNHTAPKC
         +  ++T              P   +   + +   T+N N+GNR         N N   + W          NN+++P   +CQ+CG   H+A +C
Subjt:  ESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSW----------NNRNRP---QCQLCGKFNHTAPKC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-2026.01Show/hide
Query:  NKISIVKLTDDNFLLWKFQILMALEGYDLENYLEEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIW
        N  ++ KLT  N+L+W  Q+    +GY+L  +L+   P   +   T GT A  +      NP Y+ W++QD++I S ++G++S  +   +   T+A +IW
Subjt:  NKISIVKLTDDNFLLWKFQILMALEGYDLENYLEEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIW

Query:  DSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRI
        ++L +I+   +   +    T+L+ I +                D LA +GKP+D ++ +  +L  L  +Y+ ++  I+AK  P S+ E+   L+ +E+++
Subjt:  DSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRI

Query:  ESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSW----------NNRNRP---QCQLCGKFNHTAPKC
                 +L SA ++         +T NV  +  +N N     RG  R   N N    SW          N + +P   +CQ+C    H+A +C
Subjt:  ESKLTSTKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSW----------NNRNRP---QCQLCGKFNHTAPKC

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.6e-0723.28Show/hide
Query:  DLENYLEEDPPSKTLVSVTE--GTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTI
        D +NY+      ++ + VT+  G       + +  +P Y  W++ + ++  WL+ SM++ +L  ++   +A ++W+ L ++F      ++ +++ +L T+
Subjt:  DLENYLEEDPPSKTLVSVTE--GTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTI

Query:  QKGGMSLKEYFSKIQQ
        ++GG S++EYF K+ +
Subjt:  QKGGMSLKEYFSKIQQ

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-1023.7Show/hide
Query:  WKKQDRIISSWLVGSMS-EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSG
        W+K+D I+   L G+++ +      +  +++++IW  +   F     A+ +++ ++L+T   G M + +Y+ K+++  D+L  V  PV   + ++++L+G
Subjt:  WKKQDRIISSWLVGSMS-EDILYQMINCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSG

Query:  LGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRIESKL----TSTKGSLPSANLMVQTKPPETDVTKNVSNNF----TSNANSGNRGRGGVRGGYNTNR
        L  +++++++VI  +    S  +  ++L  +E+R++  +    T    S  S  L     PP T+  ++  N          N+  RGRGG    YN   
Subjt:  LGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRIESKL----TSTKGSLPSANLMVQTKPPETDVTKNVSNNF----TSNANSGNRGRGGVRGGYNTNR

Query:  GGRSWNNRNRP
           ++N+ NRP
Subjt:  GGRSWNNRNRP

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-1528.77Show/hide
Query:  TPNP-AYSVWKKQDRIISSWLVGSMSEDILYQMI--NCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDI
        TP P     WK++D ++  W+ G++++ +L  +I   CT A+++W SL  +F     A+ ++ + +L+T     +S+ EY  K++  +D L  V  P+  
Subjt:  TPNP-AYSVWKKQDRIISSWLVGSMSEDILYQMI--NCTSAKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDI

Query:  EDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRIESKLTSTKG--SLPSANLMVQTKPPETDVTKNVSNNFTSNANSG-----NRGRGG
           ++ +L+GL  +Y+ +++VI  K+   S  E  S+LL +E+R+ +K  S+    + PS + ++ T P + +      +N  SN   G     NRG G 
Subjt:  EDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRIESKLTSTKG--SLPSANLMVQTKPPETDVTKNVSNNFTSNANSG-----NRGRGG

Query:  VRGGYNTNRGGR
          G YN N   R
Subjt:  VRGGYNTNRGGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATACCGCCGTCTCCTCTTTACAAACCTTCAGCCCTGGTAATAAAATATCTATAGTTAAGCTAACTGATGACAATTTCTTGTTATGGAAGTTTCAGATTCTAAT
GGCCCTAGAGGGGTACGATTTGGAGAATTACCTCGAAGAAGATCCACCTTCAAAAACCCTAGTCTCTGTGACTGAGGGCACCACGGCTGCTGAACAAACTCGCCGAGAGA
CTCCGAATCCAGCTTATTCCGTATGGAAGAAACAAGATCGAATCATTTCTTCATGGCTGGTCGGTTCGATGTCAGAAGACATCTTATATCAGATGATTAACTGTACGTCC
GCAAAGGAAATTTGGGACAGTCTTCACCAAATATTCACTACTCGCAACCTTGCACAGATGATGAAAATCAAGACCAAACTCCAAACTATACAAAAGGGAGGTATGTCCTT
AAAAGAATACTTCTCTAAAATTCAACAATATACTGATGCTCTCGCTGCTGTGGGAAAACCTGTGGATATCGAGGATCATATTTTGTTTATTTTATCTGGACTGGGCTCTG
AGTATGAATCCATGGTCTCTGTAATCTCTGCTAAAGCTGGACCTCAATCTGTTCATGAGGTTATGTCCTTGCTATTAACGCAAGAAAATCGTATTGAAAGTAAGCTCACA
TCTACGAAAGGCTCTCTTCCCTCTGCGAATCTGATGGTTCAAACTAAACCCCCTGAGACTGATGTTACAAAAAATGTGTCTAACAATTTTACTTCCAATGCTAATAGTGG
AAACAGGGGAAGAGGTGGAGTCCGAGGAGGTTATAACACCAACAGAGGTGGTCGGTCGTGGAACAATAGGAACAGACCTCAATGTCAACTCTGCGGAAAGTTCAATCACA
CAGCACCAAAATGTTTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATACCGCCGTCTCCTCTTTACAAACCTTCAGCCCTGGTAATAAAATATCTATAGTTAAGCTAACTGATGACAATTTCTTGTTATGGAAGTTTCAGATTCTAAT
GGCCCTAGAGGGGTACGATTTGGAGAATTACCTCGAAGAAGATCCACCTTCAAAAACCCTAGTCTCTGTGACTGAGGGCACCACGGCTGCTGAACAAACTCGCCGAGAGA
CTCCGAATCCAGCTTATTCCGTATGGAAGAAACAAGATCGAATCATTTCTTCATGGCTGGTCGGTTCGATGTCAGAAGACATCTTATATCAGATGATTAACTGTACGTCC
GCAAAGGAAATTTGGGACAGTCTTCACCAAATATTCACTACTCGCAACCTTGCACAGATGATGAAAATCAAGACCAAACTCCAAACTATACAAAAGGGAGGTATGTCCTT
AAAAGAATACTTCTCTAAAATTCAACAATATACTGATGCTCTCGCTGCTGTGGGAAAACCTGTGGATATCGAGGATCATATTTTGTTTATTTTATCTGGACTGGGCTCTG
AGTATGAATCCATGGTCTCTGTAATCTCTGCTAAAGCTGGACCTCAATCTGTTCATGAGGTTATGTCCTTGCTATTAACGCAAGAAAATCGTATTGAAAGTAAGCTCACA
TCTACGAAAGGCTCTCTTCCCTCTGCGAATCTGATGGTTCAAACTAAACCCCCTGAGACTGATGTTACAAAAAATGTGTCTAACAATTTTACTTCCAATGCTAATAGTGG
AAACAGGGGAAGAGGTGGAGTCCGAGGAGGTTATAACACCAACAGAGGTGGTCGGTCGTGGAACAATAGGAACAGACCTCAATGTCAACTCTGCGGAAAGTTCAATCACA
CAGCACCAAAATGTTTCTTCTGA
Protein sequenceShow/hide protein sequence
MEDTAVSSLQTFSPGNKISIVKLTDDNFLLWKFQILMALEGYDLENYLEEDPPSKTLVSVTEGTTAAEQTRRETPNPAYSVWKKQDRIISSWLVGSMSEDILYQMINCTS
AKEIWDSLHQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYTDALAAVGKPVDIEDHILFILSGLGSEYESMVSVISAKAGPQSVHEVMSLLLTQENRIESKLT
STKGSLPSANLMVQTKPPETDVTKNVSNNFTSNANSGNRGRGGVRGGYNTNRGGRSWNNRNRPQCQLCGKFNHTAPKCFF