; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021019 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021019
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr02:20888953..20890253
RNA-Seq ExpressionPay0021019
SyntenyPay0021019
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050409.1 hypothetical protein E6C27_scaffold1166G00260 [Cucumis melo var. makuwa]3.8e-12264.25Show/hide
Query:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY
        LMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKP+V EKDPT++QLKNL TWTETDFICKNLILNGLTDELYDYYSTMTTAK+VW+ALQKKY
Subjt:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY

Query:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE
        DTEEA SKKYA+SRYLRYQMTDD+S              II+EGMPLDDQFQV +IIDKL  LWKDFKNTLRHKTKEFSLE+L TRLRIEEEA+KHDKKE
Subjt:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE

Query:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW-----RSSH
        EVNAI RKK TAVLK DLKPKGN+MK+  NKQNNPQS SM                           ANLIE+ELVAMISEVNVIGG +GW      S H
Subjt:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW-----RSSH

Query:  NQRGW-YWRSRNEI----------HIQQDACAE--------GSSAYSRNL-------NEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK
               +R  NE+          HI +    E        G +   + +             YLLNKAGFTQTIGS+LFTL+KNNVF+ K ++TDGM +
Subjt:  NQRGW-YWRSRNEI----------HIQQDACAE--------GSSAYSRNL-------NEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK

KAA0059670.1 putative Polyprotein [Cucumis melo var. makuwa]3.3e-12663.01Show/hide
Query:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV
        MAGQ QSDLMSF+LN PFRF+GAHFKRWKQKMLFFLTLKKVATACT EKP+V EKDPT++QL +LATWTETDFICKNLILNGLTDELYDYYSTM TAKEV
Subjt:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV

Query:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKSIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAIL
        WD LQ KYD EE  SKKY +SRYLRYQMTDDKS+                        +DFKNTLRHKTKE SL+SLITRLRIEEEARKH+KKEEVNAI 
Subjt:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKSIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAIL

Query:  RKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW----RSSHN--QRGW
        RKK TAVLKPDLKPKGN+MKRESNKQNNPQS+S                            ANLIEDELVAMISEVNVI GF+GW     +SH+      
Subjt:  RKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW----RSSHN--QRGW

Query:  YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEIN
         +R  NE+  ++    +  +     + +  L                          YLLNKAGFTQTIGSDLFTLTKNNVF+ K +ATDGMFKLNL+IN
Subjt:  YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEIN

Query:  KIAFFAYMLTSFNVWHARL
        KIA  AYMLTSFNVWHARL
Subjt:  KIAFFAYMLTSFNVWHARL

KAA0065374.1 uncharacterized protein E6C27_scaffold17G00360 [Cucumis melo var. makuwa]5.5e-12164.48Show/hide
Query:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV
        M GQIQSDLMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEK +V EKDP E+QLKNLATWTETDFICKNLILNGLTDELYDYYSTMTT KEV
Subjt:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV

Query:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEE
        WDALQKKYDT+EA SKKYA+SRYLRYQMTDDKS              IISEGMPLDDQFQVAVIIDKLP LWKDFKNTLRHKTKEFSLESLITRL+IEEE
Subjt:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEE

Query:  ARKHDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWR
        ARKHDKKEEVNAI RKK TAVLK DL+    E K++  +    +  ++ +   ++  K                 A + E EL     +  V+       
Subjt:  ARKHDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWR

Query:  SSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEINKIAFFAYMLTSFNV
                    +  +H         +    +NL      YLLNKAGFTQTIGSDLFTLTKNNVF+ K +ATDGMFKLN+EINKIA  AYMLTSFN+
Subjt:  SSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEINKIAFFAYMLTSFNV

PON99483.1 Zinc finger, CCHC-type, partial [Trema orientale]3.8e-11453.27Show/hide
Query:  MSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYD
        ++ DLN+PFRFEG HFKRW+QKMLF+LT KKVA  CT+EKP +L  +P E+Q K   +W E DF+CKN ILNGL+D+LYDYY++  +AKE+WDALQKKYD
Subjt:  MSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYD

Query:  TEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEE
        TEEA +KKYA+SRYL+YQMTDDKS              IISEGM LD+QFQVAV+IDKLPP WKDFK+ LRHKTKEFSLESLITRLRIEEEARK D+K+E
Subjt:  TEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEE

Query:  VNAI---LRKKLTAVLKPDLKPKGNEMKRESNKQNN----------------PQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEV
        V  +    +K   AVLKP+ K   N+ +  +  +NN                P      Q +CYNCNKPGH+ARNCRNR RP  QANL E++L+AMISE+
Subjt:  VNAI---LRKKLTAVLKPDLKPKGNEMKRESNKQNN----------------PQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEV

Query:  NVIGGFKGW-------RSSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFT
        N++GG +GW       R   N R  + ++ +E   ++    +  +       E  L                         GYLLNK GFTQTIG+DLFT
Subjt:  NVIGGFKGW-------RSSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFT

Query:  LTKNNVFMEKSFATDGMFKLNLEINKIA
        +TKNNVF+ K +ATDGMFKLN++ NKIA
Subjt:  LTKNNVFMEKSFATDGMFKLNLEINKIA

TYJ98000.1 hypothetical protein E5676_scaffold487G00230 [Cucumis melo var. makuwa]8.5e-12263Show/hide
Query:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY
        LMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACT EKP+V EKDPT++QLKNL TWTETDFICKNLILNGLTDELYDYYSTMTTAK+VW+ALQKKY
Subjt:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY

Query:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE
        DTEEA SKKYA+SRYLRYQMTDD+S              II+EGMPLDDQFQV +IIDKLP LWKDFKNTLRHKTKEFSLE+LITRL+IEEEA+K DKK+
Subjt:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE

Query:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWRSSHNQRGW
        EVNAI RKK TAVLK DLKPKGN+MK+  NKQNNPQS SM                           ANLIE+ELVAMISEVNVIGG +GW    N    
Subjt:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWRSSHNQRGW

Query:  ------YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK
               +R  NE+  +     +        + E  L                          YLLNKAGFTQTIGS+LFTL+KNNVF+ K ++TDGM +
Subjt:  ------YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK

TrEMBL top hitse value%identityAlignment
A0A2P5FP19 Zinc finger, CCHC-type (Fragment)1.8e-11453.27Show/hide
Query:  MSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYD
        ++ DLN+PFRFEG HFKRW+QKMLF+LT KKVA  CT+EKP +L  +P E+Q K   +W E DF+CKN ILNGL+D+LYDYY++  +AKE+WDALQKKYD
Subjt:  MSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYD

Query:  TEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEE
        TEEA +KKYA+SRYL+YQMTDDKS              IISEGM LD+QFQVAV+IDKLPP WKDFK+ LRHKTKEFSLESLITRLRIEEEARK D+K+E
Subjt:  TEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEE

Query:  VNAI---LRKKLTAVLKPDLKPKGNEMKRESNKQNN----------------PQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEV
        V  +    +K   AVLKP+ K   N+ +  +  +NN                P      Q +CYNCNKPGH+ARNCRNR RP  QANL E++L+AMISE+
Subjt:  VNAI---LRKKLTAVLKPDLKPKGNEMKRESNKQNN----------------PQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEV

Query:  NVIGGFKGW-------RSSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFT
        N++GG +GW       R   N R  + ++ +E   ++    +  +       E  L                         GYLLNK GFTQTIG+DLFT
Subjt:  NVIGGFKGW-------RSSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFT

Query:  LTKNNVFMEKSFATDGMFKLNLEINKIA
        +TKNNVF+ K +ATDGMFKLN++ NKIA
Subjt:  LTKNNVFMEKSFATDGMFKLNLEINKIA

A0A5A7UA92 Uncharacterized protein1.8e-12264.25Show/hide
Query:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY
        LMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKP+V EKDPT++QLKNL TWTETDFICKNLILNGLTDELYDYYSTMTTAK+VW+ALQKKY
Subjt:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY

Query:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE
        DTEEA SKKYA+SRYLRYQMTDD+S              II+EGMPLDDQFQV +IIDKL  LWKDFKNTLRHKTKEFSLE+L TRLRIEEEA+KHDKKE
Subjt:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE

Query:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW-----RSSH
        EVNAI RKK TAVLK DLKPKGN+MK+  NKQNNPQS SM                           ANLIE+ELVAMISEVNVIGG +GW      S H
Subjt:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW-----RSSH

Query:  NQRGW-YWRSRNEI----------HIQQDACAE--------GSSAYSRNL-------NEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK
               +R  NE+          HI +    E        G +   + +             YLLNKAGFTQTIGS+LFTL+KNNVF+ K ++TDGM +
Subjt:  NQRGW-YWRSRNEI----------HIQQDACAE--------GSSAYSRNL-------NEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK

A0A5D3BDS3 Uncharacterized protein4.1e-12263Show/hide
Query:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY
        LMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACT EKP+V EKDPT++QLKNL TWTETDFICKNLILNGLTDELYDYYSTMTTAK+VW+ALQKKY
Subjt:  LMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKY

Query:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE
        DTEEA SKKYA+SRYLRYQMTDD+S              II+EGMPLDDQFQV +IIDKLP LWKDFKNTLRHKTKEFSLE+LITRL+IEEEA+K DKK+
Subjt:  DTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKE

Query:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWRSSHNQRGW
        EVNAI RKK TAVLK DLKPKGN+MK+  NKQNNPQS SM                           ANLIE+ELVAMISEVNVIGG +GW    N    
Subjt:  EVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWRSSHNQRGW

Query:  ------YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK
               +R  NE+  +     +        + E  L                          YLLNKAGFTQTIGS+LFTL+KNNVF+ K ++TDGM +
Subjt:  ------YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFK

A0A5D3DC59 Reverse transcriptase Ty1/copia-type domain-containing protein2.7e-12164.48Show/hide
Query:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV
        M GQIQSDLMS DLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEK +V EKDP E+QLKNLATWTETDFICKNLILNGLTDELYDYYSTMTT KEV
Subjt:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV

Query:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEE
        WDALQKKYDT+EA SKKYA+SRYLRYQMTDDKS              IISEGMPLDDQFQVAVIIDKLP LWKDFKNTLRHKTKEFSLESLITRL+IEEE
Subjt:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKS--------------IISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEE

Query:  ARKHDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWR
        ARKHDKKEEVNAI RKK TAVLK DL+    E K++  +    +  ++ +   ++  K                 A + E EL     +  V+       
Subjt:  ARKHDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWR

Query:  SSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEINKIAFFAYMLTSFNV
                    +  +H         +    +NL      YLLNKAGFTQTIGSDLFTLTKNNVF+ K +ATDGMFKLN+EINKIA  AYMLTSFN+
Subjt:  SSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGLGYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEINKIAFFAYMLTSFNV

A0A5D3DRT2 Putative Polyprotein1.6e-12663.01Show/hide
Query:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV
        MAGQ QSDLMSF+LN PFRF+GAHFKRWKQKMLFFLTLKKVATACT EKP+V EKDPT++QL +LATWTETDFICKNLILNGLTDELYDYYSTM TAKEV
Subjt:  MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEV

Query:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKSIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAIL
        WD LQ KYD EE  SKKY +SRYLRYQMTDDKS+                        +DFKNTLRHKTKE SL+SLITRLRIEEEARKH+KKEEVNAI 
Subjt:  WDALQKKYDTEEAWSKKYAISRYLRYQMTDDKSIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAIL

Query:  RKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW----RSSHN--QRGW
        RKK TAVLKPDLKPKGN+MKRESNKQNNPQS+S                            ANLIEDELVAMISEVNVI GF+GW     +SH+      
Subjt:  RKKLTAVLKPDLKPKGNEMKRESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGW----RSSHN--QRGW

Query:  YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEIN
         +R  NE+  ++    +  +     + +  L                          YLLNKAGFTQTIGSDLFTLTKNNVF+ K +ATDGMFKLNL+IN
Subjt:  YWRSRNEIHIQQDACAEGSSAYSRNLNEFGL-------------------------GYLLNKAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEIN

Query:  KIAFFAYMLTSFNVWHARL
        KIA  AYMLTSFNVWHARL
Subjt:  KIAFFAYMLTSFNVWHARL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-0720Show/hide
Query:  RFEGAH-FKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYDTEEAWSKK
        +F G + F  W+++M   L  + +         +VL+ D  +        W + D    + I   L+D++ +      TA+ +W  L+  Y ++   +K 
Subjt:  RFEGAH-FKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYDTEEAWSKK

Query:  YAISRYLRYQMTDDKSIISE--------------GMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAILRKK
        Y   +     M++  + +S               G+ ++++ +  ++++ LP  + +   T+ H      L+ + + L + E+ RK  + +    I    
Subjt:  YAISRYLRYQMTDDKSIISE--------------GMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAILRKK

Query:  LTAVLKPDLKPKGNEMKRESNK--------QNNPQSKSMVQIVCYNCNKPGHLARNCRN--RSRPPAQANLIEDELVAMI
                 + +G   +R SN         ++  +SKS V+  CYNCN+PGH  R+C N  + +        +D   AM+
Subjt:  LTAVLKPDLKPKGNEMKRESNK--------QNNPQSKSMVQIVCYNCNKPGHLARNCRN--RSRPPAQANLIEDELVAMI

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein2.8e-2226.46Show/hide
Query:  FDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLA-------TWTETDFICKNLILNGLTDELYDYYS-TMTTAKEVWDA
        F + +  RF+G  +  W  +M  FL   K+    +   P +      E   + +         W   D++C   ++N L+D LY  YS     AKE+WD 
Subjt:  FDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLA-------TWTETDFICKNLILNGLTDELYDYYS-TMTTAKEVWDA

Query:  LQKKYDTEEAWSKKYAISRYLRYQMTDDK--------------SIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARK
        L+  Y  +E+ SK+  + +Y+ ++M +++              SI+S GM LD+ F V+ II K PP W+ F   L  + +   +  L+ R++ EEE  +
Subjt:  LQKKYDTEEAWSKKYAISRYLRYQMTDDK--------------SIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARK

Query:  HDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESN----KQNNPQSKSMVQIVCYNCNKPGHLARNC---RNRSRPPAQANLIEDELVAMI
        +  K     +  +  T   + +  P      R S     K+  P+    V IVC NC + GHLA++C   ++  R   ++N I   + A +
Subjt:  HDKKEEVNAILRKKLTAVLKPDLKPKGNEMKRESN----KQNNPQSKSMVQIVCYNCNKPGHLARNC---RNRSRPPAQANLIEDELVAMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGACAAATCCAATCCGACTTGATGTCTTTTGATCTCAACCGTCCATTCCGTTTTGAAGGAGCACACTTCAAAAGGTGGAAACAAAAAATGTTATTTTTCCTCAC
GCTGAAGAAGGTGGCCACTGCTTGTACTACTGAAAAGCCGGAGGTTTTAGAAAAAGATCCTACAGAAAAACAACTGAAGAACCTCGCCACCTGGACAGAAACTGACTTCA
TTTGTAAGAACCTAATTCTTAATGGTCTTACTGATGAACTATATGATTATTACAGTACCATGACTACTGCAAAAGAAGTGTGGGACGCGCTACAAAAGAAGTACGATACT
GAAGAAGCATGGTCGAAGAAGTACGCTATCAGTCGATACCTGAGATATCAAATGACTGATGACAAATCCATTATCAGTGAAGGTATGCCACTCGATGATCAATTTCAAGT
TGCTGTTATTATTGATAAATTACCTCCACTGTGGAAGGATTTCAAGAACACTCTAAGGCACAAAACCAAGGAGTTCTCACTAGAAAGTCTAATCACGAGGCTAAGGATAG
AGGAGGAAGCAAGGAAGCATGATAAAAAAGAAGAGGTGAACGCTATCCTCAGAAAGAAGCTCACTGCAGTTCTGAAACCGGACCTGAAACCGAAAGGAAACGAGATGAAA
CGAGAATCTAACAAACAAAACAACCCACAGTCCAAAAGTATGGTACAAATTGTTTGTTATAATTGTAATAAGCCTGGTCATTTAGCTAGAAATTGTAGAAACAGGAGTCG
TCCTCCTGCGCAGGCAAACCTGATAGAAGATGAATTAGTAGCTATGATATCTGAAGTTAATGTGATTGGGGGGTTTAAAGGTTGGAGATCATCACACAACCAACGTGGCT
GGTATTGGAGAAGTAGAAATGAAATTCACATCCAGCAAGACGCTTGTGCTGAAGGAAGTTCTGCATACTCCAGAAATTTGAATGAATTTGGTCTCGGATATCTCCTTAAC
AAAGCTGGATTCACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTATTTATGGAGAAGAGTTTCGCTACTGATGGCATGTTCAAATTGAATCTAGA
AATTAATAAGATTGCATTTTTTGCTTACATGTTGACTTCTTTCAATGTTTGGCATGCTAGACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGACAAATCCAATCCGACTTGATGTCTTTTGATCTCAACCGTCCATTCCGTTTTGAAGGAGCACACTTCAAAAGGTGGAAACAAAAAATGTTATTTTTCCTCAC
GCTGAAGAAGGTGGCCACTGCTTGTACTACTGAAAAGCCGGAGGTTTTAGAAAAAGATCCTACAGAAAAACAACTGAAGAACCTCGCCACCTGGACAGAAACTGACTTCA
TTTGTAAGAACCTAATTCTTAATGGTCTTACTGATGAACTATATGATTATTACAGTACCATGACTACTGCAAAAGAAGTGTGGGACGCGCTACAAAAGAAGTACGATACT
GAAGAAGCATGGTCGAAGAAGTACGCTATCAGTCGATACCTGAGATATCAAATGACTGATGACAAATCCATTATCAGTGAAGGTATGCCACTCGATGATCAATTTCAAGT
TGCTGTTATTATTGATAAATTACCTCCACTGTGGAAGGATTTCAAGAACACTCTAAGGCACAAAACCAAGGAGTTCTCACTAGAAAGTCTAATCACGAGGCTAAGGATAG
AGGAGGAAGCAAGGAAGCATGATAAAAAAGAAGAGGTGAACGCTATCCTCAGAAAGAAGCTCACTGCAGTTCTGAAACCGGACCTGAAACCGAAAGGAAACGAGATGAAA
CGAGAATCTAACAAACAAAACAACCCACAGTCCAAAAGTATGGTACAAATTGTTTGTTATAATTGTAATAAGCCTGGTCATTTAGCTAGAAATTGTAGAAACAGGAGTCG
TCCTCCTGCGCAGGCAAACCTGATAGAAGATGAATTAGTAGCTATGATATCTGAAGTTAATGTGATTGGGGGGTTTAAAGGTTGGAGATCATCACACAACCAACGTGGCT
GGTATTGGAGAAGTAGAAATGAAATTCACATCCAGCAAGACGCTTGTGCTGAAGGAAGTTCTGCATACTCCAGAAATTTGAATGAATTTGGTCTCGGATATCTCCTTAAC
AAAGCTGGATTCACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTATTTATGGAGAAGAGTTTCGCTACTGATGGCATGTTCAAATTGAATCTAGA
AATTAATAAGATTGCATTTTTTGCTTACATGTTGACTTCTTTCAATGTTTGGCATGCTAGACTTTAA
Protein sequenceShow/hide protein sequence
MAGQIQSDLMSFDLNRPFRFEGAHFKRWKQKMLFFLTLKKVATACTTEKPEVLEKDPTEKQLKNLATWTETDFICKNLILNGLTDELYDYYSTMTTAKEVWDALQKKYDT
EEAWSKKYAISRYLRYQMTDDKSIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLESLITRLRIEEEARKHDKKEEVNAILRKKLTAVLKPDLKPKGNEMK
RESNKQNNPQSKSMVQIVCYNCNKPGHLARNCRNRSRPPAQANLIEDELVAMISEVNVIGGFKGWRSSHNQRGWYWRSRNEIHIQQDACAEGSSAYSRNLNEFGLGYLLN
KAGFTQTIGSDLFTLTKNNVFMEKSFATDGMFKLNLEINKIAFFAYMLTSFNVWHARL