; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000249 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000249
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:2234075..2236976
RNA-Seq ExpressionLag0000249
SyntenyLag0000249
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.6e-4234.08Show/hide
Query:  NFGYWKMQVKDYLTCKKVHKTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK------------------
        +F +W+MQ++DYL  KK+H+ L  + ++M+ E+W+ +D + +  IR+ LS +VA  VA E T   LM+ L+  YEKPS NNK                  
Subjt:  NFGYWKMQVKDYLTCKKVHKTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK------------------

Query:  -----------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSALVMTKGKD--KVDEDNEP
                                     LL SLP+SWE M+ AVSNS GN  LKF +V D  + EE+RR  + + ST  +  V  +G+D  + +++   
Subjt:  -----------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSALVMTKGKD--KVDEDNEP

Query:  SSSRK---KWKYRNEVECYYCHK---------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGK
        S SR    + K R  VEC+ C K                       W+LDS AS H   DR++  ++  G++G V + NG      GIGD++LK   G  
Subjt:  SSSRK---KWKYRNEVECYYCHK---------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGK

Query:  LVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY
          +  VR+VP +  NLIS+G+L D G+   FG    K+K G  VVA GH++ +LY
Subjt:  LVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.5e-4333.5Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRC--QLNVAKGSERQWMPVK
        +G   K  GIGDV +KT  G    LQ+VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL     +L V++ +E +W  +K
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRC--QLNVAKGSERQWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]7.4e-4334.04Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL
        +G   K  GIGDV +KT  G    LQ+VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL

TKS15174.1 hypothetical protein D5086_0000036030 [Populus alba]3.9e-4435.71Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  I++ LS  VA  V  E +  KLMEAL+  YEK S NN                  
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH
           LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+    +D   S  R K +YR+        +VEC+ C 
Subjt:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH

Query:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD
        K                                       +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT  G    LQ+
Subjt:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD

Query:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY
        VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TLY
Subjt:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY

XP_034902342.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118039689 [Populus alba]3.9e-4435.71Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  I++ LS  VA  V  E +  KLMEAL+  YEK S NN                  
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH
           LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+    +D   S  R K +YR+        +VEC+ C 
Subjt:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH

Query:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD
        K                                       +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT  G    LQ+
Subjt:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD

Query:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY
        VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TLY
Subjt:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein1.8e-4233.51Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL
        +G      GIGDV +KT  G    LQ++R+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL

A0A4U5PY83 CCHC-type domain-containing protein7.2e-4433.5Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRC--QLNVAKGSERQWMPVK
        +G   K  GIGDV +KT  G    LQ+VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL     +L V++ +E +W  +K
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRC--QLNVAKGSERQWMPVK

A0A4U5QGR0 Uncharacterized protein3.6e-4334.04Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL
        +G   K  GIGDV +KT  G    LQ+VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL

A0A4U5QVL9 CCHC-type domain-containing protein1.9e-4435.71Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M  E+W+ +D + +  I++ LS  VA  V  E +  KLMEAL+  YEK S NN                  
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH
           LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+    +D   S  R K +YR+        +VEC+ C 
Subjt:  ---LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEPSSSRKKWKYRN--------EVECYYCH

Query:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD
        K                                       +WILDS AS H      +  ++ GG HG+V + +G      GIGDV +KT  G    LQ+
Subjt:  K--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQD

Query:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY
        VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TLY
Subjt:  VRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLY

A0A4V6XW18 CCHC-type domain-containing protein1.8e-4233.51Show/hide
Query:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------
        +FGYWKMQ++DYL  KK+H   L  + ++M +E+W+ +D + +  IR+ LS  VA  V  E +  KLMEAL+  YEKPS NNK                 
Subjt:  NFGYWKMQVKDYLTCKKVH-KTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNK-----------------

Query:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP
                                      LL SLP SWE M+TAVSNS G + LK+ ++ DL + EE+RR+ S + S+ GSAL + T+G+     D   
Subjt:  ------------------------------LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSAL-VMTKGKDKVDEDNEP

Query:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        S  R K +YR+        +VEC+ C K                                       +WILDS AS H      +  ++ GG HG+V + 
Subjt:  SSSRKKWKYRN--------EVECYYCHK--------------------------------------KDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL
        +G      G GDV +KT  G    LQ+VR+VP +K  LIS+G+L D G+   F     K+  G  V+A G +  TL
Subjt:  NGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-2028.57Show/hide
Query:  LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSALVMTKGKDKVDEDNE----------PSSSRKKWKYRNEVECYYCH--
        LL SLP S++ + T + +  G  T++  +V    ++ E  R+   K    G AL+ T+G+ +  + +            S +R K + RN   CY C+  
Subjt:  LLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSALVMTKGKDKVDEDNE----------PSSSRKKWKYRNEVECYYCH--

Query:  ----------------------------------------------------KKDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDV
                                                            + +W++D+AAS H    R LF  +  G  G V+MGN   SK  GIGD+
Subjt:  ----------------------------------------------------KKDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDV

Query:  SLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRCQLNVAKG
         +KT  G  LVL+DVR+VP+++MNLIS   L  DGY   F + + +L  G  V+A G  + TLYR    + +G
Subjt:  SLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGHRKSTLYRCQLNVAKG

P25601 Putative transposon Ty5-1 protein YCL075W8.3e-0536.49Show/hide
Query:  DWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISI
        +WI D+  + H+  DRS+F+SFT         G G +    G G V++     G + L DV YVP++ +NLIS+
Subjt:  DWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.5e-0638.46Show/hide
Query:  DWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLAD-DGYMCEF--GSHQCK-L
        +W+LDS A+ HI SD    SL   +TGG    V + +G T      G  SL T+    L L ++ YVPNI  NLIS+ +L + +G   EF   S Q K L
Subjt:  DWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLAD-DGYMCEF--GSHQCK-L

Query:  KFGLQVVAVGHRKSTLY
          G+ ++  G  K  LY
Subjt:  KFGLQVVAVGHRKSTLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-0440.24Show/hide
Query:  DWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLAD
        +W+LDS A+ HI SD    S    +TGG    V + +G T      G  SL T     L L  V YVPNI  NLIS+ +L +
Subjt:  DWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLAD

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein9.4e-0425.58Show/hide
Query:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFG
        WI+   A +++      FT+        V   +G      G GDV ++ + G K  +++V +VP +  N++S GK+    Y    G
Subjt:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFG

AT3G29785.1 unknown protein4.3e-0935.29Show/hide
Query:  NFGYWKMQVKDYLTCKKVHKTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNKLLT
        ++ + +M+++DYL  KK+H+ L K+ + MS +DW  +  + +  IR+ +S ++A  VA E +   LM+ L+  Y+KPSTNN +++
Subjt:  NFGYWKMQVKDYLTCKKVHKTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASLVAHETTAVKLMEALTKRYEKPSTNNKLLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCAAAATCTTCACCGGAGGAAAAGGAAAGACCACGACCGAAGGGAAGAACAAGAGCTGCGGCGCTGAAGGTTGAAGAAGGGGAGTTGCTGCGCCGAAGGAGTCA
TGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCATA
AGACATTGAAGAAGAGATCGAAAGAGATGTCGGACGAAGATTGGGAAGCTGTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCGATGGATGTGGCAAGTCTA
GTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAAAAGGTATGAAAAACCCTCTACAAATAATAAGTTGTTAACGTCTTTACCTGATAGTTGGGAAAC
GATGAAGACAGCAGTGTCTAATTCTACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGTTGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGT
CTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAATATAGGAATGAGGTAGAATGT
TATTACTGCCATAAGAAAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGCCTAGTGAG
GATGGGGAATGGTAGAACCTCTAAGACTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCAAGATGTCAGGTACGTGCCTAATATCA
AGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTACATGTGTGAGTTTGGTAGTCACCAGTGTAAACTCAAGTTCGGATTGCAGGTAGTGGCAGTTGGTCAT
AGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGATGCCGGTTAAAGCTATAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGC
AACAAGGATAGCCAATTTCGATCAGTTCGATCAAGACCCTTTAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGAC
GGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAAT
AGAGGATTCAAGCCATTCTCAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATA
A
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCAAAATCTTCACCGGAGGAAAAGGAAAGACCACGACCGAAGGGAAGAACAAGAGCTGCGGCGCTGAAGGTTGAAGAAGGGGAGTTGCTGCGCCGAAGGAGTCA
TGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCATA
AGACATTGAAGAAGAGATCGAAAGAGATGTCGGACGAAGATTGGGAAGCTGTAGATGAAGAGGCAGTTGCAACCATAAGGATGTGTTTGTCGATGGATGTGGCAAGTCTA
GTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAAAAGGTATGAAAAACCCTCTACAAATAATAAGTTGTTAACGTCTTTACCTGATAGTTGGGAAAC
GATGAAGACAGCAGTGTCTAATTCTACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGTTGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGT
CTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAATATAGGAATGAGGTAGAATGT
TATTACTGCCATAAGAAAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGCCTAGTGAG
GATGGGGAATGGTAGAACCTCTAAGACTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCAAGATGTCAGGTACGTGCCTAATATCA
AGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTACATGTGTGAGTTTGGTAGTCACCAGTGTAAACTCAAGTTCGGATTGCAGGTAGTGGCAGTTGGTCAT
AGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGATGCCGGTTAAAGCTATAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGC
AACAAGGATAGCCAATTTCGATCAGTTCGATCAAGACCCTTTAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGAC
GGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAAT
AGAGGATTCAAGCCATTCTCAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATA
A
Protein sequenceShow/hide protein sequence
MFAKSSPEEKERPRPKGRTRAAALKVEEGELLRRRSHGFCRAKKFRWSHEVRWKNFGYWKMQVKDYLTCKKVHKTLKKRSKEMSDEDWEAVDEEAVATIRMCLSMDVASL
VAHETTAVKLMEALTKRYEKPSTNNKLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIVEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSRKKWKYRNEVEC
YYCHKKDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLQDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGLQVVAVGH
RKSTLYRCQLNVAKGSERQWMPVKAIDGSCRGTVEPATRIANFDQFDQDPLVQKQLGSPGEKVDGYRESPVVRRSNELKKSLRRVEASKWKARAVAKVKGQVSSLVTGLN
RGFKPFSECIFFRNSCSGWKKMTGILSLGVYQEDQR