; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0066191 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0066191
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr03:8041669..8043195
RNA-Seq ExpressionCmc03g0066191
SyntenyCmc03g0066191
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033740.1 hypothetical protein E6C27_scaffold239G002160 [Cucumis melo var. makuwa]5.1e-12777.99Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTDDKS+EAQSHEIQKIA  +I                        +FK         FSL+SLITRLRIE+EARK++KKE+VN IP+KKPTAVLKPDLK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
         KGNKMKRE NKQ NP S+ TVQIV YNCNK G+LARNCRN+SRPAAQAN IEDELVAMISEVNVIGG EGWWLDTGAS  VCHDLSLFRK NEVKDK+I
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN
        LLGDHHTTKVA IGEVELKFTS K LVLKEVLHT EIRKNLV GYLLNK GFTQTIGSDLFTLTKNNVFVGKGYATD MFKLNLEINKIASSAYML SFN
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN

Query:  VWHARLCHVNKKLISNMS
        VWHARLCHVNK+LISNMS
Subjt:  VWHARLCHVNKKLISNMS

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]1.3e-25087.02Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTDDKS+EAQSHEIQKIAHEII+EGMPL DQFQVAVIIDKLP LWKDFKNTLRHKTKEFSL+S+ITRL+IE+E RKHDKKE+VN IPRKKPTA+LKP+LK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
         KGNKMKR  NKQNN QSRSTVQI  YNCNKPG+LA+NCRN+SRPAAQANLIEDELVAMIS+VNVIGG EGWWLDTGASH VCH+LSLFRK NEVKDKNI
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN
        LLGDHHTTKV GIGEVELKFTSDKTLV+KE LHT EIRKNLV GYLLNKAGFTQTIGS+LFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYML SFN
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN

Query:  VWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-----------CSDYTFIFKQINKSD
        VWHARLCHVNK+LISNMSRLNLIPKLSLHDFEKCACC+QAKITKTSHK VTRVT+PLELIHSDLCEFDGTLTRN           CSDYTFI+   NKSD
Subjt:  VWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-----------CSDYTFIFKQINKSD

Query:  AYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRI
        AYE+FKVFVTEIENQFNKRIKRLRSDR  EYDSVAFNE YNSKGIIHETTTPYSPEMN K E KNRTLTEL VAILLES AAPSWWGEIIKTVNYVLNRI
Subjt:  AYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRI

Query:  PKSNSKTSPSKSLNIK
        PKSNSKTSP + L  K
Subjt:  PKSNSKTSPSKSLNIK

KAA0055815.1 putative Polyprotein [Cucumis melo var. makuwa]6.8e-17289.97Show/hide
Query:  MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSRSTVQIV
        MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSL+SLITRLRIE+EARKHDKKE+ NVIPRKK TAVLK DLK KGNKMKR FNKQNNPQSRSTVQIV
Subjt:  MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSRSTVQIV

Query:  YYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKT
         YNCNKPG+LARNCRN+SRPAAQANLIEDELVAMI EVNVIG  EGWWLDTGAS  V HDLSLFRK NEVKDKNILLGDHH TKV GIGEVELKFTS KT
Subjt:  YYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKT

Query:  LVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPK
        LVLKEVLHT E RKNLVSGYLLNK G TQTIG DLFTLTKNNVFVGKGYATD MFKLNL+INKIASSAYML  FN WHARLCHVNK+LISNMSRLNLIPK
Subjt:  LVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPK

Query:  LSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN
        LSLHDFEKCA C+QAKITKT HKSVTRVTEPLELIHSDLCEFDGTLTRN
Subjt:  LSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN

KAA0067915.1 putative Polyprotein [Cucumis melo var. makuwa]4.0e-14087.29Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTD+K +EAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEF L+SLITRL IE+E RKHDKKE+VN IP+KKPT VLK DLK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
        PKGNKMKR  NKQNNPQS+STVQIV YNCNK G+LARNCRN+S P AQANLIE+ELVAMI EVNVIGG EGWWLDTGA   VCHDLSLFRK NEVKDKNI
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASF
        LLGDHHTTKV  IGEVELKFT DK LVLKEVLHT +IRKNLVS YLLNKAGFTQTIGSDLFTLTKNNVFV KGYATDGMFKLNLEINKIASSAYML SF
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASF

XP_021732277.1 uncharacterized protein LOC110699091 [Chenopodium quinoa]2.3e-14356.44Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKV-----NVIPRKKPTAVL
        MTDDKS+E QSH++QKIAHEIISEGM LD+QFQ+AVIIDKLPP WKDFKN LRHKTKEFSL+SLITRLRIE+E+RK D KE++     N  PR+   AVL
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKV-----NVIPRKKPTAVL

Query:  KP---DLKP--------KGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPA-AQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVC
        KP   + KP        K N  K    +Q+ P    T Q + Y C KPG++AR CRN   P  AQA++IE+  VAMI+E+N+ GG +GWW+DTGA+  VC
Subjt:  KP---DLKP--------KGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPA-AQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVC

Query:  HDLSLFRKCNE-VKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKL
        +D  +F+   E   DK +LLGD H+T +AG+G VELKFTS +TL+LK+VLHT E+RKNLVSG+LLNKAGF QTIGSDLFTLTKN +FVGKGYATDGMFKL
Subjt:  HDLSLFRKCNE-VKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKL

Query:  NLEINKIASSAYMLAS-FNVWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-------
        N+E+NKI++SAYML S  NVWH RLCHVNK+LI NMS L LIP +SL+DF+KC  C+QAKITKT HKSV R +EPL+LIHSD+CE +GTLTRN       
Subjt:  NLEINKIASSAYMLAS-FNVWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-------

Query:  ----CSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGA
            CSDYT I+   NKSDA+E+                                                        AE KNRT TELVVAI L SGA
Subjt:  ----CSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGA

Query:  APSWW
        A  W+
Subjt:  APSWW

TrEMBL top hitse value%identityAlignment
A0A5A7SS30 CCHC-type domain-containing protein2.5e-12777.99Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTDDKS+EAQSHEIQKIA  +I                        +FK         FSL+SLITRLRIE+EARK++KKE+VN IP+KKPTAVLKPDLK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
         KGNKMKRE NKQ NP S+ TVQIV YNCNK G+LARNCRN+SRPAAQAN IEDELVAMISEVNVIGG EGWWLDTGAS  VCHDLSLFRK NEVKDK+I
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN
        LLGDHHTTKVA IGEVELKFTS K LVLKEVLHT EIRKNLV GYLLNK GFTQTIGSDLFTLTKNNVFVGKGYATD MFKLNLEINKIASSAYML SFN
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN

Query:  VWHARLCHVNKKLISNMS
        VWHARLCHVNK+LISNMS
Subjt:  VWHARLCHVNKKLISNMS

A0A5A7UQC7 Putative Polyprotein3.3e-17289.97Show/hide
Query:  MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSRSTVQIV
        MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSL+SLITRLRIE+EARKHDKKE+ NVIPRKK TAVLK DLK KGNKMKR FNKQNNPQSRSTVQIV
Subjt:  MPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSRSTVQIV

Query:  YYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKT
         YNCNKPG+LARNCRN+SRPAAQANLIEDELVAMI EVNVIG  EGWWLDTGAS  V HDLSLFRK NEVKDKNILLGDHH TKV GIGEVELKFTS KT
Subjt:  YYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKT

Query:  LVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPK
        LVLKEVLHT E RKNLVSGYLLNK G TQTIG DLFTLTKNNVFVGKGYATD MFKLNL+INKIASSAYML  FN WHARLCHVNK+LISNMSRLNLIPK
Subjt:  LVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPK

Query:  LSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN
        LSLHDFEKCA C+QAKITKT HKSVTRVTEPLELIHSDLCEFDGTLTRN
Subjt:  LSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN

A0A5A7VQD4 Putative Polyprotein2.0e-14087.29Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTD+K +EAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEF L+SLITRL IE+E RKHDKKE+VN IP+KKPT VLK DLK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
        PKGNKMKR  NKQNNPQS+STVQIV YNCNK G+LARNCRN+S P AQANLIE+ELVAMI EVNVIGG EGWWLDTGA   VCHDLSLFRK NEVKDKNI
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASF
        LLGDHHTTKV  IGEVELKFT DK LVLKEVLHT +IRKNLVS YLLNKAGFTQTIGSDLFTLTKNNVFV KGYATDGMFKLNLEINKIASSAYML SF
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASF

A0A5D3DCJ1 Putative Polyprotein6.1e-25187.02Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        MTDDKS+EAQSHEIQKIAHEII+EGMPL DQFQVAVIIDKLP LWKDFKNTLRHKTKEFSL+S+ITRL+IE+E RKHDKKE+VN IPRKKPTA+LKP+LK
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI
         KGNKMKR  NKQNN QSRSTVQI  YNCNKPG+LA+NCRN+SRPAAQANLIEDELVAMIS+VNVIGG EGWWLDTGASH VCH+LSLFRK NEVKDKNI
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNI

Query:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN
        LLGDHHTTKV GIGEVELKFTSDKTLV+KE LHT EIRKNLV GYLLNKAGFTQTIGS+LFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYML SFN
Subjt:  LLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFN

Query:  VWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-----------CSDYTFIFKQINKSD
        VWHARLCHVNK+LISNMSRLNLIPKLSLHDFEKCACC+QAKITKTSHK VTRVT+PLELIHSDLCEFDGTLTRN           CSDYTFI+   NKSD
Subjt:  VWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRN-----------CSDYTFIFKQINKSD

Query:  AYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRI
        AYE+FKVFVTEIENQFNKRIKRLRSDR  EYDSVAFNE YNSKGIIHETTTPYSPEMN K E KNRTLTEL VAILLES AAPSWWGEIIKTVNYVLNRI
Subjt:  AYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRI

Query:  PKSNSKTSPSKSLNIK
        PKSNSKTSP + L  K
Subjt:  PKSNSKTSPSKSLNIK

A0A7N2L531 Uncharacterized protein3.5e-12949.81Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHD----------KKEKVNVI----
        M D KS+  Q+ + Q I  E+ SEG+ + D   VA IIDKLP  W++F+ TLRHK KE SL++LITR+R+E+EAR  D             KVN+I    
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHD----------KKEKVNVI----

Query:  -------PR----KKPTAVLKPDLKPKG-NKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAA-QANLIEDELVAMISEVNVIGGFEGWW
               PR    K      K + +P+G     + +NK   P S+       + C K G++AR C+ + R +  QAN+ E+ LVAMI+++N++   EGWW
Subjt:  -------PR----KKPTAVLKPDLKPKG-NKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAA-QANLIEDELVAMISEVNVIGGFEGWW

Query:  LDTGASHRVCHDLSLFRKCNEV-KDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGK
         D+GA+  VC+D + F+      ++K ++LGD   TKV G GEVELKFTS + L LK+VL+T  +RKNL+S +LLNKAGF QT+ SD + +TK  +FVGK
Subjt:  LDTGASHRVCHDLSLFRKCNEV-KDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGK

Query:  GYATDGMFKLNLEINKIA-SSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTL
        GYA DGMFKLN+E NK + SS YML+S N WHARLCH+N + +  MS L LIP+LS  DFEKC  C+QAKITK  HK+V R TE LELIHSDLCEF+G L
Subjt:  GYATDGMFKLNLEINKIA-SSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTL

Query:  TR-----------NCSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTEL
        TR           + S YT I+   NKSDA+E F+ F+ E+ENQF ++IKR+RSDR  EY+S AFN    S GIIHETT PYSP  N  AE KNRTL EL
Subjt:  TR-----------NCSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTEL

Query:  VVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSP
          A+L+ESGA   +WGE I T  +VLNR+P   S T+P
Subjt:  VVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSP

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-2923.67Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEAR-KHD----KKEKVNVIPRKKPTAVL
        ++ + S+ +  H   ++  E+++ G  +++  +++ ++  LP  +      +   ++E    + +    +++E + K+D     K+ +N I         
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEAR-KHD----KKEKVNVIPRKKPTAVL

Query:  KPDLKPKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNC---------RNKSRPAAQANLIEDELVAMISEVNVIGGFE--GWWLDTGASHRVCH
            K +  K K+ F      +  S  ++  ++C + G++ ++C         +NK             +  M+ EVN     +  G+ LD+GAS  + +
Subjt:  KPDLKPKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNC---------RNKSRPAAQANLIEDELVAMISEVNVIGGFE--GWWLDTGASHRVCH

Query:  DLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNL
        D SL+    EV     +        +       ++  +D  + L++VL  +E   NL+S   L +AG +        T++KN + V K     GM     
Subjt:  DLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNL

Query:  EINKIASS--AYMLASFNVWHARLCHVN---------KKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTR
         IN  A S  A    +F +WH R  H++         K + S+ S LN + +LS    E C    QA++     K  T +  PL ++HSD+C     +T 
Subjt:  EINKIASS--AYMLASFNVWHARLCHVN---------KKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTR

Query:  NCSDYTFIFKQ-----------INKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVV
        +  +Y  IF               KSD + +F+ FV + E  FN ++  L  D   EY S    +    KGI +  T P++P++N  +E   RT+TE   
Subjt:  NCSDYTFIFKQ-----------INKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVV

Query:  AILLESGAAPSWWGEIIKTVNYVLNRIP
         ++  +    S+WGE + T  Y++NRIP
Subjt:  AILLESGAAPSWWGEIIKTVNYVLNRIP

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein7.5e-0419.68Show/hide
Query:  CACCNQAKITKTSHKSVTR----------------VTEPLELIHSDLCEFDGTLTRNCSDYTFIFKQINKSDAYEIFKV--FVTEIENQFNKRIKRLRSD
        C  C  +K TK +H + +                 +  P+   ++D   +   +  N + Y       NK+    + ++   +  +E QF+++++ + SD
Subjt:  CACCNQAKITKTSHKSVTR----------------VTEPLELIHSDLCEFDGTLTRNCSDYTFIFKQINKSDAYEIFKV--FVTEIENQFNKRIKRLRSD

Query:  REIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPSKSLN
        R  E+ +    E + SKGI H  T+      N +AE   RT+      +L +S     +W   + +   + N +   ++   P K+++
Subjt:  REIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPSKSLN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-4526.6Show/hide
Query:  EIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSR
        ++ + G+ ++++ +  ++++ LP  + +   T+ H      LK + + L + ++ RK  + +   +I   +  +      +   N  +     ++  +S+
Subjt:  EIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQSR

Query:  STVQIVYYNCNKPGYLARNCRN----KSRPAAQAN------LIE--DELVAMISE----VNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLG
        S V+   YNCN+PG+  R+C N    K   + Q N      +++  D +V  I+E    +++ G    W +DT ASH       LF +        + +G
Subjt:  STVQIVYYNCNKPGYLARNCRN----KSRPAAQAN------LIE--DELVAMISE----VNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLG

Query:  DHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEI-NKIASSAYMLASFNVW
        +   +K+AGIG++ +K     TLVLK+V H  ++R NL+SG  L++ G+     +  + LTK ++ + KG A   +++ N EI     ++A    S ++W
Subjt:  DHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEI-NKIASSAYMLASFNVW

Query:  HARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHK-SVTRVTEPLELIHSDLC-----------EFDGTLTRNCSDYTFIFKQINKSDA
        H R+ H+++K +  +++ +LI        + C  C   K  + S + S  R    L+L++SD+C           ++  T   + S   +++    K   
Subjt:  HARLCHVNKKLISNMSRLNLIPKLSLHDFEKCACCNQAKITKTSHK-SVTRVTEPLELIHSDLC-----------EFDGTLTRNCSDYTFIFKQINKSDA

Query:  YEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIP
        +++F+ F   +E +  +++KRLRSD   EY S  F E  +S GI HE T P +P+ N  AE  NRT+ E V ++L  +    S+WGE ++T  Y++NR P
Subjt:  YEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIP

P47024 Transposon Ty4-J Gag-Pol polyprotein5.7e-0420.21Show/hide
Query:  CACCNQAKITKTSHKSVTR----------------VTEPLELIHSDLCEFDGTLTRNCSDYTFIFKQINKSDAYEIFKV--FVTEIENQFNKRIKRLRSD
        C  C  +K TK +H + +                 +  P+   ++D   +   +  N + Y       NK+    + +V   +  +E QF+++++ + SD
Subjt:  CACCNQAKITKTSHKSVTR----------------VTEPLELIHSDLCEFDGTLTRNCSDYTFIFKQINKSDAYEIFKV--FVTEIENQFNKRIKRLRSD

Query:  REIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPSKSLN
        R  E+ +    E + SKGI H  T+      N +AE   RT+      +L +S     +W   + +   + N +   ++   P K+++
Subjt:  REIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPSKSLN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-2023.08Show/hide
Query:  GMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEAR--KHDKKEKV----NVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQS
        G P+D   QV  +++ LP  +K   + +  K    SL  +  RL I +E++    +  E V    NV+  +        + +          N+ N+ Q 
Subjt:  GMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEAR--KHDKKEKV----NVIPRKKPTAVLKPDLKPKGNKMKREFNKQNNPQS

Query:  RST---------------VQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMIS--------EVNVIGGFEGWWLDTGASHRVCHD---LSLFRKC
         S+                QI    C+  G+ A+ C    +  +  N  + +  +  +         VN       W LD+GA+H +  D   LS  +  
Subjt:  RST---------------VQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMIS--------EVNVIGGFEGWWLDTGASHRVCHD---LSLFRKC

Query:  NEVKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLT------KNNVFVGKGYATDGMFKLNLEI
            D  +++ D  T  +   G   L  TS ++L L +VL+   I KNL+S Y L     T  +  + F  +         V + +G   D +++  +  
Subjt:  NEVKDKNILLGDHHTTKVAGIGEVELKFTSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLT------KNNVFVGKGYATDGMFKLNLEI

Query:  NKIAS---SAYMLASFNVWHARLCHVNKKLISNMSRLNLIPKLS-LHDFEKCACC--NQAKITKTSHKSVTRVTEPLELIHSDL----------CEFDGT
        ++  S   S    A+ + WH+RL H +  +++++   + +P L+  H    C+ C  N++     S+ ++T  ++PLE I+SD+            +   
Subjt:  NKIAS---SAYMLASFNVWHARLCHVNKKLISNMSRLNLIPKLS-LHDFEKCACC--NQAKITKTSHKSVTRVTEPLELIHSDL----------CEFDGT

Query:  LTRNCSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGA
           + + YT+++    KS   + F +F + +EN+F  RI  L SD   E+  V   +  +  GI H T+ P++PE N  +E K+R + E+ + +L  +  
Subjt:  LTRNCSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTTPYSPEMNEKAETKNRTLTELVVAILLESGA

Query:  APSWWGEIIKTVNYVLNRIP
          ++W        Y++NR+P
Subjt:  APSWWGEIIKTVNYVLNRIP

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein1.6e-0626.38Show/hide
Query:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK
        M +++ +  Q     KIA  I+S GM LD+ F V+ II K PP W+ F   L  + +   +  L+ R++ E+E  ++  K                P L 
Subjt:  MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLK

Query:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNC---RNKSRPAAQANLIEDELVAMI
              +    K+  P+    V IV  NC + G+LA++C   ++  R + ++N I   + A +
Subjt:  PKGNKMKREFNKQNNPQSRSTVQIVYYNCNKPGYLARNC---RNKSRPAAQANLIEDELVAMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGATGACAAATCCATGGAGGCACAGTCGCATGAAATCCAGAAAATAGCCCACGAGATTATCAGTGAAGGTATGCCACTCGATGATCAATTTCAAGTTGCTGTTAT
TATTGATAAATTACCTCCATTGTGGAAGGATTTCAAGAACACCCTAAGGCACAAAACCAAGGAGTTCTCATTGAAAAGTCTAATCACGAGGCTAAGGATAGAGAAGGAAG
CAAGGAAGCATGATAAAAAAGAAAAGGTGAACGTTATCCCCAGAAAGAAGCCCACTGCAGTTTTGAAACCGGACCTAAAGCCGAAAGGAAACAAGATGAAACGAGAATTC
AACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTATTATAATTGTAATAAGCCTGGTTATTTAGCTAGAAATTGTAGAAACAAGAGTCGTCCTGCTGC
GCAGGCAAATCTAATAGAAGATGAATTAGTAGCTATGATATCTGAAGTTAATGTGATTGGGGGGTTTGAAGGTTGGTGGCTAGACACCGGTGCATCCCACCGTGTCTGCC
ACGACCTTAGTCTTTTTAGAAAATGTAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACAACCAAGGTGGCCGGTATTGGAGAAGTAGAACTGAAATTC
ACATCCGACAAGACGCTTGTGCTGAAGGAAGTTCTGCATACTCGAGAAATTCGAAAGAATTTGGTCTCCGGATATCTCCTCAACAAAGCTGGATTCACACAAACCATAGG
ATCAGACTTGTTTACTTTAACTAAAAATAATGTATTTGTGGGGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTT
ACATGTTGGCTTCTTTTAATGTTTGGCATGCTAGACTTTGTCATGTTAATAAAAAATTGATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGAT
TTTGAGAAATGTGCATGTTGTAATCAAGCTAAGATAACTAAAACCTCGCATAAGTCTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATT
TGATGGCACTTTAACTAGAAACTGTTCTGACTACACTTTTATTTTTAAGCAGATAAATAAAAGTGATGCTTATGAAATATTCAAAGTCTTTGTGACTGAAATAGAGAACC
AGTTTAATAAAAGAATTAAGAGACTTCGTAGTGATAGAGAAATTGAATATGATTCGGTTGCTTTCAATGAAGTTTATAACTCAAAAGGAATAATACATGAAACTACTACG
CCTTATTCTCCTGAAATGAATGAAAAAGCAGAAACAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCAGCACCATCTTGGTGGGGTGA
AATAATTAAGACTGTTAATTATGTTCTTAATAGGATTCCTAAATCTAACAGTAAAACTTCACCATCGAAGTCCTTAAACATAAAACACCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGATGACAAATCCATGGAGGCACAGTCGCATGAAATCCAGAAAATAGCCCACGAGATTATCAGTGAAGGTATGCCACTCGATGATCAATTTCAAGTTGCTGTTAT
TATTGATAAATTACCTCCATTGTGGAAGGATTTCAAGAACACCCTAAGGCACAAAACCAAGGAGTTCTCATTGAAAAGTCTAATCACGAGGCTAAGGATAGAGAAGGAAG
CAAGGAAGCATGATAAAAAAGAAAAGGTGAACGTTATCCCCAGAAAGAAGCCCACTGCAGTTTTGAAACCGGACCTAAAGCCGAAAGGAAACAAGATGAAACGAGAATTC
AACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTATTATAATTGTAATAAGCCTGGTTATTTAGCTAGAAATTGTAGAAACAAGAGTCGTCCTGCTGC
GCAGGCAAATCTAATAGAAGATGAATTAGTAGCTATGATATCTGAAGTTAATGTGATTGGGGGGTTTGAAGGTTGGTGGCTAGACACCGGTGCATCCCACCGTGTCTGCC
ACGACCTTAGTCTTTTTAGAAAATGTAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACAACCAAGGTGGCCGGTATTGGAGAAGTAGAACTGAAATTC
ACATCCGACAAGACGCTTGTGCTGAAGGAAGTTCTGCATACTCGAGAAATTCGAAAGAATTTGGTCTCCGGATATCTCCTCAACAAAGCTGGATTCACACAAACCATAGG
ATCAGACTTGTTTACTTTAACTAAAAATAATGTATTTGTGGGGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTT
ACATGTTGGCTTCTTTTAATGTTTGGCATGCTAGACTTTGTCATGTTAATAAAAAATTGATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGAT
TTTGAGAAATGTGCATGTTGTAATCAAGCTAAGATAACTAAAACCTCGCATAAGTCTGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCTGACTTATGTGAATT
TGATGGCACTTTAACTAGAAACTGTTCTGACTACACTTTTATTTTTAAGCAGATAAATAAAAGTGATGCTTATGAAATATTCAAAGTCTTTGTGACTGAAATAGAGAACC
AGTTTAATAAAAGAATTAAGAGACTTCGTAGTGATAGAGAAATTGAATATGATTCGGTTGCTTTCAATGAAGTTTATAACTCAAAAGGAATAATACATGAAACTACTACG
CCTTATTCTCCTGAAATGAATGAAAAAGCAGAAACAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCAGCACCATCTTGGTGGGGTGA
AATAATTAAGACTGTTAATTATGTTCTTAATAGGATTCCTAAATCTAACAGTAAAACTTCACCATCGAAGTCCTTAAACATAAAACACCAAACTTGA
Protein sequenceShow/hide protein sequence
MTDDKSMEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPPLWKDFKNTLRHKTKEFSLKSLITRLRIEKEARKHDKKEKVNVIPRKKPTAVLKPDLKPKGNKMKREF
NKQNNPQSRSTVQIVYYNCNKPGYLARNCRNKSRPAAQANLIEDELVAMISEVNVIGGFEGWWLDTGASHRVCHDLSLFRKCNEVKDKNILLGDHHTTKVAGIGEVELKF
TSDKTLVLKEVLHTREIRKNLVSGYLLNKAGFTQTIGSDLFTLTKNNVFVGKGYATDGMFKLNLEINKIASSAYMLASFNVWHARLCHVNKKLISNMSRLNLIPKLSLHD
FEKCACCNQAKITKTSHKSVTRVTEPLELIHSDLCEFDGTLTRNCSDYTFIFKQINKSDAYEIFKVFVTEIENQFNKRIKRLRSDREIEYDSVAFNEVYNSKGIIHETTT
PYSPEMNEKAETKNRTLTELVVAILLESGAAPSWWGEIIKTVNYVLNRIPKSNSKTSPSKSLNIKHQT