; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010756 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010756
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr06:11964066..11966552
RNA-Seq ExpressionPay0010756
SyntenyPay0010756
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036574.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.9e-22857.83Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP                           T + N    A  +  DAFD+AKEL
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQE+GQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYE VRAALLHR+PLPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------
        LGIVS+L SD     T++   NE  F    +L                    H++                P  +H+  SS                   
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSHIPV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------
        SFP SASLCDKPFGLIHSDIWGPAPC T                                                                        
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------

Query:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE
                    QNGRAERKHRHILDSVRAQLLS SCP+ FWGEAALTS          VIHNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLE
Subjt:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE

Query:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS
        PRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTDPST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS
Subjt:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS

Query:  ISSEESEPTPV
        +  EE E  PV
Subjt:  ISSEESEPTPV

KAA0037189.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-25771.66Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------
        MQKNNVARLISTILD                         GDITKP+KPTTPMKNIDNTANTSD                                    
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------

Query:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL
              LD+FDTAKELWDFL+TRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQ KINPDHIRLIKVLMGLRPEYESVRAALLHR+PL
Subjt:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL

Query:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
        PSLDAAVQEILFEEKRLGIVSALPSD          PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
Subjt:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM

Query:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQ------LCLGLSDGTI-DWYGTEGHASSDKLRSLASNGHL
        TSGISLLSSHIPVHSLPPIHSTDG+HMSISHIGT ++ T ++  T       + L S+        +    +D TI  W+   GHASSDKLRSLASNGHL
Subjt:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQ------LCLGLSDGTI-DWYGTEGHASSDKLRSLASNGHL

Query:  NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT--------------------------------------------------
        NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT                                                  
Subjt:  NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT--------------------------------------------------

Query:  ----QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCF
            QNGRAERKHRHILDSVRAQLLSASCPK FWGEAA+TS          VIHNISPFERLHGTPPSYSNLKIFGC CFVLLHPHEHTKLEPRARLCCF
Subjt:  ----QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCF

Query:  LGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS
        LGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS
Subjt:  LGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS

KAA0043149.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-22857.95Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP                           T + N    A  +  DAFD+AKEL
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQE+GQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------
        LGIVS+L SD     T++   NE  F    +L                    H++                P  +H+  SS                   
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSHIPV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------
        SFP SASLCDKPFGLIHSDIWGPAPC T                                                                        
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------

Query:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE
                    QNGRAERKHRHILDSVRAQLLS SCP+ FWGEAALTS          VIHNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLE
Subjt:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE

Query:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS
        PRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTDPST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS
Subjt:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS

Query:  ISSEESEPTPV
        +  EE E  PV
Subjt:  ISSEESEPTPV

KAA0058316.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.7e-22059.73Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDN--------------------TANTS------DLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP     + +N                      NTS        DAFD+AK+L
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDN--------------------TANTS------DLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQEVGQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYE VRAALLHR+ LPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLVPLVTHRNPSSP----------------------------------
        LGIVS+L SD     T++   NE  F    +L                    H++     R P  P                                  
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLVPLVTHRNPSSP----------------------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSH PV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT---------------------------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTSVI
        SFP SASLCDKPFGLIHSDIWGPAPC T                                 QNGRAERKHRHILD                      SVI
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT---------------------------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTSVI

Query:  HNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTD
        HNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLEPRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTD
Subjt:  HNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTD

Query:  PSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSISSEESEPTPV
        PST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS+  EE E  PV
Subjt:  PSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSISSEESEPTPV

XP_016901896.1 PREDICTED: uncharacterized protein LOC107991463 [Cucumis melo]4.5e-31087.58Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------
        MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIV GDITKP+KPTTPMKNIDNTANTSD                                    
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------

Query:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL
              LD+FDTAKELWDFL+TRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQ KINPDHIRLIKVLMGLRPEYESVRAALLHR+PL
Subjt:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL

Query:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
        PSLDAAVQEILFEEKRLGIVSALPSD          PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
Subjt:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM

Query:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLCLGLSDGTIDWYGTEGHASSDKLRSLASNGHLNNVSKFS
        TSGISLLSSHIPVHSLPPIHSTDG+HMSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLCLGLSDGT DWYGTEGHASSDKLRSLASNGHLNNVSKFS
Subjt:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLCLGLSDGTIDWYGTEGHASSDKLRSLASNGHLNNVSKFS

Query:  TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFER
        TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPK FWGEAA+TS          VIHNISPFER
Subjt:  TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFER

Query:  LHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPT
        LHGTPPSYSNLKIFG  CFVLLHPHEHTKLEPRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSSSQSFFTDPSTTLFPT
Subjt:  LHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPT

Query:  PDSPPNTIPYPPLSSELTPS
        PDSPPNTIPYPPLSSELTPS
Subjt:  PDSPPNTIPYPPLSSELTPS

TrEMBL top hitse value%identityAlignment
A0A1S4E0Y9 uncharacterized protein LOC1079914632.2e-31087.58Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------
        MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIV GDITKP+KPTTPMKNIDNTANTSD                                    
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------

Query:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL
              LD+FDTAKELWDFL+TRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQ KINPDHIRLIKVLMGLRPEYESVRAALLHR+PL
Subjt:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL

Query:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
        PSLDAAVQEILFEEKRLGIVSALPSD          PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
Subjt:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM

Query:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLCLGLSDGTIDWYGTEGHASSDKLRSLASNGHLNNVSKFS
        TSGISLLSSHIPVHSLPPIHSTDG+HMSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLCLGLSDGT DWYGTEGHASSDKLRSLASNGHLNNVSKFS
Subjt:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLCLGLSDGTIDWYGTEGHASSDKLRSLASNGHLNNVSKFS

Query:  TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFER
        TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPK FWGEAA+TS          VIHNISPFER
Subjt:  TLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFER

Query:  LHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPT
        LHGTPPSYSNLKIFG  CFVLLHPHEHTKLEPRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSSSQSFFTDPSTTLFPT
Subjt:  LHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPT

Query:  PDSPPNTIPYPPLSSELTPS
        PDSPPNTIPYPPLSSELTPS
Subjt:  PDSPPNTIPYPPLSSELTPS

A0A5A7SZ66 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-22857.83Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP                           T + N    A  +  DAFD+AKEL
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQE+GQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYE VRAALLHR+PLPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------
        LGIVS+L SD     T++   NE  F    +L                    H++                P  +H+  SS                   
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSHIPV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------
        SFP SASLCDKPFGLIHSDIWGPAPC T                                                                        
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------

Query:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE
                    QNGRAERKHRHILDSVRAQLLS SCP+ FWGEAALTS          VIHNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLE
Subjt:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE

Query:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS
        PRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTDPST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS
Subjt:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS

Query:  ISSEESEPTPV
        +  EE E  PV
Subjt:  ISSEESEPTPV

A0A5A7UVX4 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-22059.73Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDN--------------------TANTS------DLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP     + +N                      NTS        DAFD+AK+L
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDN--------------------TANTS------DLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQEVGQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYE VRAALLHR+ LPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLVPLVTHRNPSSP----------------------------------
        LGIVS+L SD     T++   NE  F    +L                    H++     R P  P                                  
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLVPLVTHRNPSSP----------------------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSH PV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT---------------------------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTSVI
        SFP SASLCDKPFGLIHSDIWGPAPC T                                 QNGRAERKHRHILD                      SVI
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT---------------------------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTSVI

Query:  HNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTD
        HNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLEPRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTD
Subjt:  HNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTD

Query:  PSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSISSEESEPTPV
        PST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS+  EE E  PV
Subjt:  PSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSISSEESEPTPV

A0A5D3CPU5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-25771.66Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------
        MQKNNVARLISTILD                         GDITKP+KPTTPMKNIDNTANTSD                                    
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSD------------------------------------

Query:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL
              LD+FDTAKELWDFL+TRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQ KINPDHIRLIKVLMGLRPEYESVRAALLHR+PL
Subjt:  ------LDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPL

Query:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
        PSLDAAVQEILFEEKRLGIVSALPSD          PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM
Subjt:  PSLDAAVQEILFEEKRLGIVSALPSD----------PNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLKLVISSNSTALAVTPGTSWLLDSTYCNHM

Query:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQ------LCLGLSDGTI-DWYGTEGHASSDKLRSLASNGHL
        TSGISLLSSHIPVHSLPPIHSTDG+HMSISHIGT ++ T ++  T       + L S+        +    +D TI  W+   GHASSDKLRSLASNGHL
Subjt:  TSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQ------LCLGLSDGTI-DWYGTEGHASSDKLRSLASNGHL

Query:  NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT--------------------------------------------------
        NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT                                                  
Subjt:  NNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCAT--------------------------------------------------

Query:  ----QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCF
            QNGRAERKHRHILDSVRAQLLSASCPK FWGEAA+TS          VIHNISPFERLHGTPPSYSNLKIFGC CFVLLHPHEHTKLEPRARLCCF
Subjt:  ----QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCF

Query:  LGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS
        LGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS
Subjt:  LGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPS

A0A5D3DG18 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-22957.95Show/hide
Query:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL
        M+KN+VAR ISTILDGTNYITWAH MRSFLIGRKLWRIV GDITKP+KP                           T + N    A  +  DAFD+AKEL
Subjt:  MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKP--------------------------TTPMKNIDNTANTSDLDAFDTAKEL

Query:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR
        WDFL TRFQSIGLAHYYQL+STL++LNQE+GQSVNEYLATLQPIWTQLDQAKI+PDHIRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAVQEILFEEKR
Subjt:  WDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNEYLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKR

Query:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------
        LGIVS+L SD     T++   NE  F    +L                    H++                P  +H+  SS                   
Subjt:  LGIVSALPSDP--NGTNI-VTNEVTFWIIVQL-------------------VHLV----------------PLVTHRNPSSP------------------

Query:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG
             LK VISS STALAVTPGTSWLLDS  CNHMTS ISLLSSHIPV SLPPIHS DGN MSISHIGTVNTPTIKLSNTYHVPNLT+NLASVGQLC LG
Subjt:  -----LKLVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLC-LG

Query:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL
        L                                                +D T+  W+   GHASS+KLRSL S G LNNVS+F+T D L+CK+AKQ AL
Subjt:  L------------------------------------------------SDGTI-DWYGTEGHASSDKLRSLASNGHLNNVSKFSTLDYLNCKLAKQLAL

Query:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------
        SFP SASLCDKPFGLIHSDIWGPAPC T                                                                        
Subjt:  SFPNSASLCDKPFGLIHSDIWGPAPCAT------------------------------------------------------------------------

Query:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE
                    QNGRAERKHRHILDSVRAQLLS SCP+ FWGEAALTS          VIHNISPFERL+GTPP+YS+LK+FGCACFVLLH HEHTKLE
Subjt:  ------------QNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS----------VIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLE

Query:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS
        PRARLCCFLGYGT+HKGFRCWDPISQRLRISRHVTFWEH MFSSLSSFHASLSS  SFFTDPST LFPTPDSP NT   PPL+SELT SHTTS LP+LPS
Subjt:  PRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPS

Query:  ISSEESEPTPV
        +  EE E  PV
Subjt:  ISSEESEPTPV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-1837.58Show/hide
Query:  NGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS-VIHNISPFERLHGTPP---------SYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGT
        NG AER +R I++ VR+ L  A  PK FWGEA  T+  + N SP   L    P         SYS+LK+FGC  F  +   + TKL+ ++  C F+GYG 
Subjt:  NGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTS-VIHNISPFERLHGTPP---------SYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGT

Query:  KHKGFRCWDPISQRLRISRHVTFWEHHMFSSLS-SFHASLSSSQSFFTDPSTTLFPT
        +  G+R WDP+ +++  SR V F E  + ++   S         +F T PST+  PT
Subjt:  KHKGFRCWDPISQRLRISRHVTFWEHHMFSSLS-SFHASLSSSQSFFTDPSTTLFPT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-2034.95Show/hide
Query:  SDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAAL----------TSVIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEP
        S +  P      NG +ERKHRHI+++    L  AS PK +W  A            T ++   SPF++L GT P+Y  L++FGCAC+  L P+   KL+ 
Subjt:  SDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAAL----------TSVIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEP

Query:  RARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSI
        ++R C FLGY      + C    + RL ISRHV F E+      S++ A+LS  Q    + S    P    P  T P  P  S   P H  +T P  PS 
Subjt:  RARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPPLSSELTPSHTTSTLPDLPSI

Query:  SSEESE
            S+
Subjt:  SSEESE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-2235.43Show/hide
Query:  SDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEA----------ALTSVIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEP
        S    P      NG +ERKHRHI++     L  AS PK +W  A            T ++   SPF++L G PP+Y  LK+FGCAC+  L P+   KLE 
Subjt:  SDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEA----------ALTSVIHNISPFERLHGTPPSYSNLKIFGCACFVLLHPHEHTKLEP

Query:  RARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPS----TTLFPTP---DSPP------NTIPYPPLSSE--L
        +++ C F+GY      + C    + RL  SRHV F E     S ++F  S S  Q   + P+    TTL  TP    +PP      +T P PP S     
Subjt:  RARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPS----TTLFPTP---DSPP------NTIPYPPLSSE--L

Query:  TPSHTTSTLPDLPSISSEESEPT
        T   ++S LP     S   SEPT
Subjt:  TPSHTTSTLPDLPSISSEESEPT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAAAACAATGTTGCACGCCTCATTAGCACCATTCTTGATGGTACAAATTATATTACATGGGCACACCTAATGAGGAGTTTTTTGATAGGTAGGAAATTATGGCG
TATTGTTATAGGAGATATCACCAAACCTCTCAAACCCACTACCCCAATGAAAAACATCGATAATACTGCAAACACTTCAGATCTTGATGCTTTCGATACTGCAAAAGAAC
TTTGGGATTTTTTGTCTACACGTTTTCAGTCTATAGGTCTTGCTCATTATTATCAGTTGTATTCTACACTCATTAATCTAAATCAGGAAGTGGGCCAATCTGTGAACGAG
TATCTAGCTACTCTTCAACCCATTTGGACTCAGTTAGACCAGGCAAAAATTAACCCTGATCATATTCGTCTTATCAAAGTCTTAATGGGTCTCAGACCAGAATATGAATC
CGTTCGTGCTGCTCTTTTACATCGCGACCCTCTACCTTCTCTTGACGCTGCTGTTCAGGAAATCTTATTTGAGGAGAAAAGACTTGGCATTGTCTCTGCTCTGCCATCTG
ACCCTAATGGAACCAATATTGTCACAAACGAGGTCACATTTTGGATTATTGTCCAACTCGTCCACCTCGTCCCTCTGGTCACTCACAGAAACCCAAGTTCTCCTTTAAAA
CTGGTGATCTCTTCCAACTCTACTGCTCTTGCTGTCACCCCAGGTACCTCTTGGCTTCTTGACTCAACTTATTGTAATCACATGACTTCTGGAATTTCATTGTTATCCTC
TCATATCCCTGTTCACTCACTTCCTCCAATTCACTCTACTGATGGTAATCACATGTCTATTTCTCACATTGGCACTGTTAATACACCCACCATAAAACTTTCCAACACCT
ACCATGTCCCCAATCTTACATACAACCTAGCCTCTGTTGGCCAATTATGTTTAGGACTCTCAGACGGGACAATTGATTGGTACGGGACAGAAGGTCATGCATCCTCTGAT
AAACTCCGTAGTTTAGCTTCTAATGGTCATTTGAATAATGTCTCTAAGTTTAGTACTCTTGACTATTTAAATTGCAAACTAGCCAAACAACTTGCTTTGTCCTTTCCTAA
CTCTGCTTCTTTATGTGATAAACCTTTTGGCCTAATTCACTCTGACATTTGGGGACCTGCTCCATGTGCTACGCAAAATGGACGAGCAGAACGTAAACACCGTCACATTC
TAGACTCTGTTCGTGCTCAACTCCTCTCTGCCTCATGCCCTAAAATTTTTTGGGGAGAAGCTGCCCTCACCTCTGTCATACACAATATTTCCCCATTTGAACGCCTACAC
GGTACTCCACCCTCCTACTCTAATCTCAAAATCTTTGGTTGTGCATGTTTTGTATTATTACACCCTCATGAACATACAAAACTTGAACCTCGTGCTCGTCTATGTTGTTT
CTTGGGTTATGGCACTAAACACAAAGGTTTTCGTTGTTGGGATCCTATCTCTCAACGATTACGTATTTCTCGTCATGTCACATTTTGGGAACATCATATGTTTTCTAGTC
TTTCTTCATTTCATGCCTCTCTATCTAGTTCTCAATCATTCTTTACTGATCCTTCTACTACTCTCTTCCCTACACCTGATTCACCACCCAACACTATCCCTTATCCTCCA
CTCTCATCTGAGCTCACTCCATCTCACACTACCTCTACGCTCCCGGATCTTCCATCTATCTCCTCTGAGGAATCTGAACCTACACCTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAAAACAATGTTGCACGCCTCATTAGCACCATTCTTGATGGTACAAATTATATTACATGGGCACACCTAATGAGGAGTTTTTTGATAGGTAGGAAATTATGGCG
TATTGTTATAGGAGATATCACCAAACCTCTCAAACCCACTACCCCAATGAAAAACATCGATAATACTGCAAACACTTCAGATCTTGATGCTTTCGATACTGCAAAAGAAC
TTTGGGATTTTTTGTCTACACGTTTTCAGTCTATAGGTCTTGCTCATTATTATCAGTTGTATTCTACACTCATTAATCTAAATCAGGAAGTGGGCCAATCTGTGAACGAG
TATCTAGCTACTCTTCAACCCATTTGGACTCAGTTAGACCAGGCAAAAATTAACCCTGATCATATTCGTCTTATCAAAGTCTTAATGGGTCTCAGACCAGAATATGAATC
CGTTCGTGCTGCTCTTTTACATCGCGACCCTCTACCTTCTCTTGACGCTGCTGTTCAGGAAATCTTATTTGAGGAGAAAAGACTTGGCATTGTCTCTGCTCTGCCATCTG
ACCCTAATGGAACCAATATTGTCACAAACGAGGTCACATTTTGGATTATTGTCCAACTCGTCCACCTCGTCCCTCTGGTCACTCACAGAAACCCAAGTTCTCCTTTAAAA
CTGGTGATCTCTTCCAACTCTACTGCTCTTGCTGTCACCCCAGGTACCTCTTGGCTTCTTGACTCAACTTATTGTAATCACATGACTTCTGGAATTTCATTGTTATCCTC
TCATATCCCTGTTCACTCACTTCCTCCAATTCACTCTACTGATGGTAATCACATGTCTATTTCTCACATTGGCACTGTTAATACACCCACCATAAAACTTTCCAACACCT
ACCATGTCCCCAATCTTACATACAACCTAGCCTCTGTTGGCCAATTATGTTTAGGACTCTCAGACGGGACAATTGATTGGTACGGGACAGAAGGTCATGCATCCTCTGAT
AAACTCCGTAGTTTAGCTTCTAATGGTCATTTGAATAATGTCTCTAAGTTTAGTACTCTTGACTATTTAAATTGCAAACTAGCCAAACAACTTGCTTTGTCCTTTCCTAA
CTCTGCTTCTTTATGTGATAAACCTTTTGGCCTAATTCACTCTGACATTTGGGGACCTGCTCCATGTGCTACGCAAAATGGACGAGCAGAACGTAAACACCGTCACATTC
TAGACTCTGTTCGTGCTCAACTCCTCTCTGCCTCATGCCCTAAAATTTTTTGGGGAGAAGCTGCCCTCACCTCTGTCATACACAATATTTCCCCATTTGAACGCCTACAC
GGTACTCCACCCTCCTACTCTAATCTCAAAATCTTTGGTTGTGCATGTTTTGTATTATTACACCCTCATGAACATACAAAACTTGAACCTCGTGCTCGTCTATGTTGTTT
CTTGGGTTATGGCACTAAACACAAAGGTTTTCGTTGTTGGGATCCTATCTCTCAACGATTACGTATTTCTCGTCATGTCACATTTTGGGAACATCATATGTTTTCTAGTC
TTTCTTCATTTCATGCCTCTCTATCTAGTTCTCAATCATTCTTTACTGATCCTTCTACTACTCTCTTCCCTACACCTGATTCACCACCCAACACTATCCCTTATCCTCCA
CTCTCATCTGAGCTCACTCCATCTCACACTACCTCTACGCTCCCGGATCTTCCATCTATCTCCTCTGAGGAATCTGAACCTACACCTGTCTGA
Protein sequenceShow/hide protein sequence
MQKNNVARLISTILDGTNYITWAHLMRSFLIGRKLWRIVIGDITKPLKPTTPMKNIDNTANTSDLDAFDTAKELWDFLSTRFQSIGLAHYYQLYSTLINLNQEVGQSVNE
YLATLQPIWTQLDQAKINPDHIRLIKVLMGLRPEYESVRAALLHRDPLPSLDAAVQEILFEEKRLGIVSALPSDPNGTNIVTNEVTFWIIVQLVHLVPLVTHRNPSSPLK
LVISSNSTALAVTPGTSWLLDSTYCNHMTSGISLLSSHIPVHSLPPIHSTDGNHMSISHIGTVNTPTIKLSNTYHVPNLTYNLASVGQLCLGLSDGTIDWYGTEGHASSD
KLRSLASNGHLNNVSKFSTLDYLNCKLAKQLALSFPNSASLCDKPFGLIHSDIWGPAPCATQNGRAERKHRHILDSVRAQLLSASCPKIFWGEAALTSVIHNISPFERLH
GTPPSYSNLKIFGCACFVLLHPHEHTKLEPRARLCCFLGYGTKHKGFRCWDPISQRLRISRHVTFWEHHMFSSLSSFHASLSSSQSFFTDPSTTLFPTPDSPPNTIPYPP
LSSELTPSHTTSTLPDLPSISSEESEPTPV