; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021886 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021886
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr7:13486058..13495274
RNA-Seq ExpressionLag0021886
SyntenyLag0021886
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045362.1 putative mitochondrial protein [Cucumis melo var. makuwa]4.7e-14154.34Show/hide
Query:  DPTPLLTLLNLSQTPPQNPYSVIHTQL-----PTVDNSSPSTVELLDNVTCAD-SLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIF
        +P+ LLTLLN+ Q  P N   + + Q      PTVD +  S ++   N TC   ++    DT + +   T    + P       P NTH+MQTRAKS IF
Subjt:  DPTPLLTLLNLSQTPPQNPYSVIHTQL-----PTVDNSSPSTVELLDNVTCAD-SLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIF

Query:  KPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK----------------
        KPKAF  T    +PT P+S+TEASKY EWR AM EEFNALQ QGTW+LVPRLPS NVVGCKWVFR KY+PDG+IARHKA  + K                
Subjt:  KPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK----------------

Query:  VITK-----------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDR-----------FTSDLLTLGF---
        V+ K                  +LDVKN F HG L+E VYM+Q + F DK+CP  VCLLHKSLY    A  + F R           +  D++  G    
Subjt:  VITK-----------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDR-----------FTSDLLTLGF---

Query:  --------IASAADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDIT
                +A     S L  L+ F  LE+ SS DGIFVNQAKYLNDLLHTSGMTSAKSC+TPMST++DLY  AP FND  LYR+LVGSLQYLTFTRPDI 
Subjt:  --------IASAADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDIT

Query:  FAANRVS----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATT
         + NRVS                                    SL  +CDSDWA DTSDRRSTSGFIAFLGS+PISWS+KKQ  VSRSSTEAEY SLATT
Subjt:  FAANRVS----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATT

Query:  TADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        TADLYWIRQLLCDLH+PL T PTLWCDNVSAISLA+NPVFHARTKHIEIDYHFV EKV+ KDIS+
Subjt:  TADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

KAA0048199.1 putative mitochondrial protein [Cucumis melo var. makuwa]4.6e-15244.74Show/hide
Query:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV
        N S  +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T ++I+P YLQW+
Subjt:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV

Query:  ARDQALITLINATLSPSALAHVVGTASAKELWKS---------IKDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDA
        +RDQALITLINATLS SAL HVV +   K+   S         IK LVD+L AASI+++DEEILVHTLNGLP  F AF TSIRTRS +  L+        
Subjt:  ARDQALITLINATLSPSALAHVVGTASAKELWKS---------IKDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDA

Query:  EEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMSLLFH
                                                                                                            
Subjt:  EEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMSLLFH

Query:  ASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTCADSL
              F P++F                   P+       P L  LK F  P    T L+++                  D  + ST   LD VT     
Subjt:  ASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTCADSL

Query:  CQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMN
                               G +L                                   S  EASKY EWR AM EEFNALQ QGTW LVPRLPSMN
Subjt:  CQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMN

Query:  VVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIAS
        VVGCKWVFR KY+PDG+IA HKA          +L VK     G+ +E VYM+QP+GF DK+CP  V LLHKSLY L     +W                
Subjt:  VVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIAS

Query:  AADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD---
        + +P             + SS DGIF NQAKYLNDLLHTSGMTSAKSC+TPMST++DLY  AP FND +LYR+LVGSLQYLTFTRPDI F+ NRVS    
Subjt:  AADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD---

Query:  ------MSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLAN
               S          GDTSD+RSTSGFIAFL S+PISWS+KKQPTVSRSSTEAEYRSLATTT DLYWI+QLLCDLH+PL T  TLWCDNV AISLA+
Subjt:  ------MSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLAN

Query:  NPVFHARTKHIEIDYHFVREKVVRKDI
        NPVFHARTKHIEIDYHFVREKV+RKDI
Subjt:  NPVFHARTKHIEIDYHFVREKVVRKDI

KAA0056771.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.1e-13640.46Show/hide
Query:  SSTTTTVNSSSM-----SSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT
        S T+  +  SS      +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T
Subjt:  SSTTTTVNSSSM-----SSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT

Query:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKSI-KDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHA
         +EI+P YLQW++RDQALITLINATLS SAL HVVG+ ++K LW S+ K LV +LAAASI++ DEEILVHTLNGL   F AF TSIRTRSG++SLEELH 
Subjt:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKSI-KDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHA

Query:  LLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMS
        LL  EE T++  +  E       A H  Q+HG              N G+     + + I S   + T A+   L                         
Subjt:  LLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMS

Query:  LLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTC
                        S+ V   N ++++++N+  P                                PP                              
Subjt:  LLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTC

Query:  ADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRL
                                        P N H++QTRAKS                                                       
Subjt:  ADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRL

Query:  PSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLG
                                               DVKNAFLHG L+E VYM+QP+ F DK+CP  VCLLHKSLYGLKQAPRAWF+RFTS L TLG
Subjt:  PSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLG

Query:  FIASAADPS-------------CLYVLEI-------------------------------FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM
        F+AS  DPS              LYV +I                               F  LE+ SS DGIFVNQAKYLNDLLHTS MTSAKSC+T M
Subjt:  FIASAADPS-------------CLYVLEI-------------------------------FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM

Query:  STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISW--SAKKQPTVSRSSTEAEYR
        ST++DLY   P FND +LYR+LVGSLQY TFT P+I F+ NRVS                    +    + +L  +P       K+Q TVSRSSTEA+YR
Subjt:  STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISW--SAKKQPTVSRSSTEAEYR

Query:  SLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        SLATTTADLYWIRQLLCDLH+PL T PTLWC NVSAISLA+NPVF ARTKHIEIDYHFVREKV+RKDIS+
Subjt:  SLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.6e-18147.09Show/hide
Query:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV
        N S  +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T +EI+P+YLQW+
Subjt:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV

Query:  ARDQALITLINATLSPSALAHVVGTASAKELWKS-------------------------------------IKDLVDRLAAASITIDDEEILVHTLNGLP
        +RDQALITLIN TLS SALAHVV + S+K LW S                                     IK LVD+LAAAS++++DEEILVHTLNGLP
Subjt:  ARDQALITLINATLSPSALAHVVGTASAKELWKS-------------------------------------IKDLVDRLAAASITIDDEEILVHTLNGLP

Query:  DEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLY
          F AFRTSIRTRSG++SLEELH LL +EE  +   +  E       A H  Q+HG                  S G G                    +
Subjt:  DEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLY

Query:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSV
           +P+  SSS   +R                                  +S S  + S                                         
Subjt:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSV

Query:  IHTQLPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLE
         +  + T +  + +T+ L D                       P S          P NTH+MQTRAKSGIFKPKAF  T    +PT P+S+TEASKY E
Subjt:  IHTQLPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLE

Query:  WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHK
        WR  M EEFNALQ QGTW+LVPRLPSMNVVGCKWVFR KY+ DG+IARHKA  + K               G+ +E VYM+QP+GF +K+CP  VCLLHK
Subjt:  WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHK

Query:  SLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVN
        SLYGLKQAPRAWF+RFTS L TLGF+AS ADPS              LYV                               L+ F  LE+ SS DGI VN
Subjt:  SLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVN

Query:  QAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITF--AANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLG
        QA+YLNDLLHTSGMTSAKSC+TP+ST++DLY  AP FND +LYR+L          +P +    A  R+    L   C     GDTSDRRSTSGFIAFL 
Subjt:  QAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITF--AANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLG

Query:  SSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        S+PISWS+KKQ T+SRSSTEAEYRSLATTTADLYWIRQLL DLH+PL T P LWCDN+SAISLA+NPVFHARTKHIEIDYHFVREKV+RKDIS+
Subjt:  SSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

TYJ97594.1 putative mitochondrial protein [Cucumis melo var. makuwa]9.2e-13740.46Show/hide
Query:  SSTTTTVNSSSM-----SSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT
        S T+  +  SS      +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T
Subjt:  SSTTTTVNSSSM-----SSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT

Query:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKSI-KDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHA
         +EI+P+YLQW++RDQALITLINATLS SAL HVVG+ ++K LW S+ K LV +LAAASI++ DEEILVHTLNGL   F AF TSIRTRSG++SLEELH 
Subjt:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKSI-KDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHA

Query:  LLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMS
        LL  EE T++  +  E       A H  Q+HG              N G+     + + I S   + T A+   L                         
Subjt:  LLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMS

Query:  LLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTC
                        S+ V   N ++++++N+  P                                PP                              
Subjt:  LLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTC

Query:  ADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRL
                                        P N H++QTRAKS                                                       
Subjt:  ADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRL

Query:  PSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLG
                                               DVKNAFLHG L+E VYM+QP+ F DK+CP  VCLLHKSLYGLKQAPRAWF+RFTS L TLG
Subjt:  PSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLG

Query:  FIASAADPS-------------CLYVLEI-------------------------------FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM
        F+AS  DPS              LYV +I                               F  LE+ SS DGIFVNQAKYLNDLLHTS MTSAKSC+T M
Subjt:  FIASAADPS-------------CLYVLEI-------------------------------FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM

Query:  STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISW--SAKKQPTVSRSSTEAEYR
        ST++DLY   P FND +LYR+LVGSLQY TFT P+I F+ NRVS                    +    + +L  +P       K+Q TVSRSSTEA+YR
Subjt:  STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISW--SAKKQPTVSRSSTEAEYR

Query:  SLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        SLATTTADLYWIRQLLCDLH+PL T PTLWC NVSAISLA+NPVF ARTKHIEIDYHFVREKV+RKDIS+
Subjt:  SLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

TrEMBL top hitse value%identityAlignment
A0A2N9EBV4 Integrase catalytic domain-containing protein2.7e-14232.96Show/hide
Query:  SSTTTTVNSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEIS
        SST +++N++S+ SPL LL+N+ NL+  +LDSTNY++WK QIS++L A+S+  H+D S+PQP +F+ S     +                        ++
Subjt:  SSTTTTVNSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEIS

Query:  PDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKS------------------------------------IKDLVDRLAAASITIDDEEILVH
        P +L W  +D+A++TL+ +TL+   LA V+G ++++E+W +                                    IK   DRL+A  + ID+EE+L  
Subjt:  PDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKS------------------------------------IKDLVDRLAAASITIDDEEILVH

Query:  TLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDAEEKT---------------------------------------------------LMAAAGEED--
         L GLP E+  F ++IRTR G LSLE L  LL  EE++                                                   L  +AG +D  
Subjt:  TLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDAEEKT---------------------------------------------------LMAAAGEED--

Query:  ------------------DNLRGRANHFGQYH----GWSWSNQLGIL------GSGLN--------SGQSIGSGAALGIKSLEKLY--------------
                            L   A+     H      SW    G         + LN           S+G+G  L I+++ K++              
Subjt:  ------------------DNLRGRANHFGQYH----GWSWSNQLGIL------GSGLN--------SGQSIGSGAALGIKSLEKLY--------------

Query:  ---TGASINGLYPIPSPSAL--------------------------------------------------------------------------------
             A + G  P+ S +A                                                                                 
Subjt:  ---TGASINGLYPIPSPSAL--------------------------------------------------------------------------------

Query:  --SSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLP
            +ERKHRH+V+ A++LL  +++P+ +W YA STA  LINR+ + +L   SP++    +  +   L   G   P   L+  +   PQ+P  +  +  P
Subjt:  --SSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLP

Query:  TVDNSSPSTVELLDNVTCADSLCQNA--DTVHSMPTQTEP-ASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRN
        T  N + ST         A+S  Q+A    ++ +PT   P    +P     L P  TH MQTR+KSGIFKPK   +  + +  T+P+S+T ASK+ +W  
Subjt:  TVDNSSPSTVELLDNVTCADSLCQNA--DTVHSMPTQTEP-ASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRN

Query:  AMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK------------------------VITKL---------KLDVKNAFL
        AM EEF ALQ+QGTW+LVP   + NVVGCKWV++ K++ DG+IAR+KA  + K                        +I  L         +LDVKNAFL
Subjt:  AMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK------------------------VITKL---------KLDVKNAFL

Query:  HGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV--------------------
        HG L+EEVYM+QP G++D + P  VC LHKS+YGLKQAPRAWF+ FTS LL LGF AS AD S              LYV                    
Subjt:  HGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV--------------------

Query:  -----------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM--STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-
                   L  F  L+V  +A  +F+ Q KY  DLL    M   K+  +P   +T + L+A  PL  D   YR +VG+L YLTFTRPDI+FA ++V 
Subjt:  -----------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPM--STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-

Query:  ---------------------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWI
                                           ++L+AY D+DWAGD  DRRSTSGF+ +LGS+ I+WSAKKQPTVSRSSTE+EYR+LA  +A+L W+
Subjt:  ---------------------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWI

Query:  RQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDI
        R LL DL I +  PP LWCDNVSA+++A+NPVFHARTKHIE+D+HFVRE+V+RKD+
Subjt:  RQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDI

A0A2N9H491 Uncharacterized protein6.0e-14232.22Show/hide
Query:  SSSTTTTVNS----SSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT
        SS+T TT  +    +S+ +PL LLSN+ NL+ ++LDSTN+++WK Q+SSILKA+S+   +D ++P P +F+                        ++ + 
Subjt:  SSSTTTTVNS----SSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDT

Query:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKS------------------------------------IKDLVDRLAAASITIDDE
         T ++PD+  W  RDQAL+TLIN+TLSP+ L+ VVG  SA+ +WK+                                    +K+  D+L A    ID+E
Subjt:  ATEISPDYLQWVARDQALITLINATLSPSALAHVVGTASAKELWKS------------------------------------IKDLVDRLAAASITIDDE

Query:  EILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLM----------------AAAGEEDDNLR--------------------------GR
        E+L   L GLP E+G F ++IRTR+  ++ EE+  LL  EE++++                A+A + + N                            GR
Subjt:  EILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLM----------------AAAGEEDDNLR--------------------------GR

Query:  ANHFGQYHGWSWSNQ----------------------LGILG--------------SGLNSGQSIGSGAALGIKS-LEKLYTGA-----------SINGL
         NH  QY   + SNQ                       G LG               G +    + + A++   S ++  Y G+           S    
Subjt:  ANHFGQYHGWSWSNQ----------------------LGILG--------------SGLNSGQSIGSGAALGIKS-LEKLYTGA-----------SINGL

Query:  YPIPSPSALSSS-------------------------------------------------ERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRM
        + + +P ALS++                                                 ERKHRHIVE A++LL HAS+P+  W YA + A+ LINR+
Subjt:  YPIPSPSALSSS-------------------------------------------------ERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRM

Query:  SSSSLNMSSPFETLFGYTPDLHHLKVFG-------------------------------------DPT---------------PLLTLLNLSQTPPQNPY
         +  L+  SP+E LF   PD+ HL+ FG                                     DPT                 L  L+L+ TP   P 
Subjt:  SSSSLNMSSPFETLFGYTPDLHHLKVFG-------------------------------------DPT---------------PLLTLLNLSQTPPQNPY

Query:  SVIHTQLP----------TVDNSSPSTVELLD----------NVTCADSLCQNADTV------------------HSMPTQTEPASNAPIDGSSLQP-TN
        +     LP          +  NS+ + +E L+          ++  + SL  + D +                   S+P+ + P S+ P       P  N
Subjt:  SVIHTQLP----------TVDNSSPSTVELLD----------NVTCADSLCQNADTV------------------HSMPTQTEPASNAPIDGSSLQP-TN

Query:  THSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK---
        TH M TR+K+GIFKPK+F +T + +  T+P +   ASKY  W  AM +E+ ALQ Q TW+LVP     N+VGCKWVF+ K + DGSI+R+KA  + K   
Subjt:  THSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK---

Query:  ---------------------VITKL---------KLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFI
                             +I  L         +LD++NAFLHG L+EEVYM+QP G+++ S P  VC LHKS+YGLKQAPRAWF+ FT+ LL LGFI
Subjt:  ---------------------VITKL---------KLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFI

Query:  ASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMST
        +S+AD S              LYV                               L  F  L++  S+ G+ + Q KY  DLL    M     C TP   
Subjt:  ASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMST

Query:  TIDLYASAPL-FNDATLYRQLVGSLQYLTFTRPDITFAANRV----------------------------------SDMSLTAYCDSDWAGDTSDRRSTS
         + L ++  +   D   YR LVG+L YLTFTRPD++FA ++V                                    ++L+A+ D+DWAGD  DRRSTS
Subjt:  TIDLYASAPL-FNDATLYRQLVGSLQYLTFTRPDITFAANRV----------------------------------SDMSLTAYCDSDWAGDTSDRRSTS

Query:  GFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDIS
        G + FLG +P++WSAKKQ TVSRSSTEAEYR+LA+ +A+L W+R L+ DL I L  PP LWCDNVSA+++A+NPVFHARTKHIE+D+HF+RE+V+RKD+ 
Subjt:  GFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDIS

Query:  M
        +
Subjt:  M

A0A5A7TPR4 Putative mitochondrial protein2.3e-14154.34Show/hide
Query:  DPTPLLTLLNLSQTPPQNPYSVIHTQL-----PTVDNSSPSTVELLDNVTCAD-SLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIF
        +P+ LLTLLN+ Q  P N   + + Q      PTVD +  S ++   N TC   ++    DT + +   T    + P       P NTH+MQTRAKS IF
Subjt:  DPTPLLTLLNLSQTPPQNPYSVIHTQL-----PTVDNSSPSTVELLDNVTCAD-SLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIF

Query:  KPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK----------------
        KPKAF  T    +PT P+S+TEASKY EWR AM EEFNALQ QGTW+LVPRLPS NVVGCKWVFR KY+PDG+IARHKA  + K                
Subjt:  KPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK----------------

Query:  VITK-----------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDR-----------FTSDLLTLGF---
        V+ K                  +LDVKN F HG L+E VYM+Q + F DK+CP  VCLLHKSLY    A  + F R           +  D++  G    
Subjt:  VITK-----------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDR-----------FTSDLLTLGF---

Query:  --------IASAADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDIT
                +A     S L  L+ F  LE+ SS DGIFVNQAKYLNDLLHTSGMTSAKSC+TPMST++DLY  AP FND  LYR+LVGSLQYLTFTRPDI 
Subjt:  --------IASAADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDIT

Query:  FAANRVS----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATT
         + NRVS                                    SL  +CDSDWA DTSDRRSTSGFIAFLGS+PISWS+KKQ  VSRSSTEAEY SLATT
Subjt:  FAANRVS----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATT

Query:  TADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        TADLYWIRQLLCDLH+PL T PTLWCDNVSAISLA+NPVFHARTKHIEIDYHFV EKV+ KDIS+
Subjt:  TADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

A0A5A7U426 Putative mitochondrial protein2.2e-15244.74Show/hide
Query:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV
        N S  +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T ++I+P YLQW+
Subjt:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV

Query:  ARDQALITLINATLSPSALAHVVGTASAKELWKS---------IKDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDA
        +RDQALITLINATLS SAL HVV +   K+   S         IK LVD+L AASI+++DEEILVHTLNGLP  F AF TSIRTRS +  L+        
Subjt:  ARDQALITLINATLSPSALAHVVGTASAKELWKS---------IKDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDA

Query:  EEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMSLLFH
                                                                                                            
Subjt:  EEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMSLLFH

Query:  ASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTCADSL
              F P++F                   P+       P L  LK F  P    T L+++                  D  + ST   LD VT     
Subjt:  ASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTCADSL

Query:  CQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMN
                               G +L                                   S  EASKY EWR AM EEFNALQ QGTW LVPRLPSMN
Subjt:  CQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMN

Query:  VVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIAS
        VVGCKWVFR KY+PDG+IA HKA          +L VK     G+ +E VYM+QP+GF DK+CP  V LLHKSLY L     +W                
Subjt:  VVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIAS

Query:  AADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD---
        + +P             + SS DGIF NQAKYLNDLLHTSGMTSAKSC+TPMST++DLY  AP FND +LYR+LVGSLQYLTFTRPDI F+ NRVS    
Subjt:  AADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD---

Query:  ------MSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLAN
               S          GDTSD+RSTSGFIAFL S+PISWS+KKQPTVSRSSTEAEYRSLATTT DLYWI+QLLCDLH+PL T  TLWCDNV AISLA+
Subjt:  ------MSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLAN

Query:  NPVFHARTKHIEIDYHFVREKVVRKDI
        NPVFHARTKHIEIDYHFVREKV+RKDI
Subjt:  NPVFHARTKHIEIDYHFVREKVVRKDI

A0A5A7UZE5 Putative mitochondrial protein7.8e-18247.09Show/hide
Query:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV
        N S  +S LFLLSNICNLVP+RLDSTNYVLWK+Q+SSILKAHSLFGHIDD+LP P K + SST                        T +EI+P+YLQW+
Subjt:  NSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV

Query:  ARDQALITLINATLSPSALAHVVGTASAKELWKS-------------------------------------IKDLVDRLAAASITIDDEEILVHTLNGLP
        +RDQALITLIN TLS SALAHVV + S+K LW S                                     IK LVD+LAAAS++++DEEILVHTLNGLP
Subjt:  ARDQALITLINATLSPSALAHVVGTASAKELWKS-------------------------------------IKDLVDRLAAASITIDDEEILVHTLNGLP

Query:  DEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLY
          F AFRTSIRTRSG++SLEELH LL +EE  +   +  E       A H  Q+HG                  S G G                    +
Subjt:  DEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLMAAAGEEDDNLRGRANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLY

Query:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSV
           +P+  SSS   +R                                  +S S  + S                                         
Subjt:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSV

Query:  IHTQLPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLE
         +  + T +  + +T+ L D                       P S          P NTH+MQTRAKSGIFKPKAF  T    +PT P+S+TEASKY E
Subjt:  IHTQLPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLE

Query:  WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHK
        WR  M EEFNALQ QGTW+LVPRLPSMNVVGCKWVFR KY+ DG+IARHKA  + K               G+ +E VYM+QP+GF +K+CP  VCLLHK
Subjt:  WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHK

Query:  SLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVN
        SLYGLKQAPRAWF+RFTS L TLGF+AS ADPS              LYV                               L+ F  LE+ SS DGI VN
Subjt:  SLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVN

Query:  QAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITF--AANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLG
        QA+YLNDLLHTSGMTSAKSC+TP+ST++DLY  AP FND +LYR+L          +P +    A  R+    L   C     GDTSDRRSTSGFIAFL 
Subjt:  QAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITF--AANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLG

Query:  SSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM
        S+PISWS+KKQ T+SRSSTEAEYRSLATTTADLYWIRQLL DLH+PL T P LWCDN+SAISLA+NPVFHARTKHIEIDYHFVREKV+RKDIS+
Subjt:  SSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISM

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-4629.41Show/hide
Query:  PSSFTE---ASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKL------------------------
        P+SF E         W  A+  E NA +   TWT+  R  + N+V  +WVF  KY+  G+  R+KA  + +  T+                         
Subjt:  PSSFTE---ASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKL------------------------

Query:  ---------KLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPT-SVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPSCLYVLE------------
                 ++DVK AFL+G L+EE+YM  P G    SC + +VC L+K++YGLKQA R WF+ F   L    F+ S+ D  C+Y+L+            
Subjt:  ---------KLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPT-SVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPSCLYVLE------------

Query:  -----------------------------------IFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVG
                                            F  + +    D I+++Q+ Y+  +L    M +  +  TP+ + I+        +  T  R L+G
Subjt:  -----------------------------------IFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVG

Query:  SLQYLTF-TRPDITFAANRVS-------------------------DMSL------------TAYCDSDWAGDTSDRRSTSGFI-AFLGSSPISWSAKKQ
         L Y+   TRPD+T A N +S                         DM L              Y DSDWAG   DR+ST+G++      + I W+ K+Q
Subjt:  SLQYLTF-TRPDITFAANRVS-------------------------DMSL------------TAYCDSDWAGDTSDRRSTSGFI-AFLGSSPISWSAKKQ

Query:  PTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKV
         +V+ SSTEAEY +L     +  W++ LL  ++I L  P  ++ DN   IS+ANNP  H R KHI+I YHF RE+V
Subjt:  PTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-4626.54Show/hide
Query:  SPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFG------DPTPLLTLLNLSQTP----
        +P     +ER +R IVE   S+L  A +P  FW  A  TA +LINR  S  L    P            HLKVFG       P    T L+    P    
Subjt:  SPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFG------DPTPLLTLLNLSQTP----

Query:  ------------PQNPYSVIHTQ---------LPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPID----GSSL-----------QP
                          VI ++             D S      ++ N     S   N  +  S   +       P +    G  L           Q 
Subjt:  ------------PQNPYSVIHTQ---------LPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPID----GSSL-----------QP

Query:  TNTHSMQTRAKSGIFKPKAFLSTMMAFVPTD--PSSFTEASKYLE---WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAA
           H    R++    + + + ST    +  D  P S  E   + E      AM EE  +LQ+ GT+ LV        + CKWVF+ K   D  + R+KA 
Subjt:  TNTHSMQTRAKSGIFKPKAFLSTMMAFVPTD--PSSFTEASKYLE---WRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAA

Query:  WLQKVITKLK---------------------------------LDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSD
         + K   + K                                 LDVK AFLHG LEEE+YM QP GF        VC L+KSLYGLKQAPR W+ +F S 
Subjt:  WLQKVITKLK---------------------------------LDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSD

Query:  LLTLGFIASAADPSCLY-----------------------------------------VLEIFPWLEV-------HSSADGIFVNQAKYLNDLLHTSGMT
        + +  ++ + +DP C+Y                                         + ++ P  ++         ++  ++++Q KY+  +L    M 
Subjt:  LLTLGFIASAADPSCLY-----------------------------------------VLEIFPWLEV-------HSSADGIFVNQAKYLNDLLHTSGMT

Query:  SAKSCLTPMSTTIDLYAS------APLFNDATL-YRQLVGSLQY-LTFTRPDITFAANRV----------------------------------SDMSLT
        +AK   TP++  + L             N A + Y   VGSL Y +  TRPDI  A   V                                  SD  L 
Subjt:  SAKSCLTPMSTTIDLYAS------APLFNDATL-YRQLVGSLQY-LTFTRPDITFAANRV----------------------------------SDMSLT

Query:  AYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKH
         Y D+D AGD  +R+S++G++       ISW +K Q  V+ S+TEAEY +   T  ++ W+++ L +L +       ++CD+ SAI L+ N ++HARTKH
Subjt:  AYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKH

Query:  IEIDYHFVREKV
        I++ YH++RE V
Subjt:  IEIDYHFVREKV

P92519 Uncharacterized mitochondrial protein AtMg008102.1e-3040.76Show/hide
Query:  FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-----------------
        F  +++ +   G+F++Q KY   +L+ +GM   K   TP+   ++   S   + D + +R +VG+LQYLT TRPDI++A N V                 
Subjt:  FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-----------------

Query:  ------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYW
                          S +++ A+CDSDWAG TS RRST+GF  FLG + ISWSAK+QPTVSRSSTE EYR+LA T A+L W
Subjt:  ------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-9833.17Show/hide
Query:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFG----------------------
        P  +P     SERKHRHIVET ++LL HAS+P  +WPYAF+ AV+LINR+ +  L + SPF+ LFG +P+   L+VFG                      
Subjt:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFG----------------------

Query:  -----------------------------------------------------------------DPTPLLTLLNLSQ-----TPPQNPYSVIH------
                                                                           TP+L   + S      TPP +P +         
Subjt:  -----------------------------------------------------------------DPTPLLTLLNLSQ-----TPPQNPYSVIH------

Query:  -------------------------------TQLPTVDNSSPSTVELLDNVT------CADSLCQNADTVHSMPTQTEPASNAP----------------
                                       TQ  T  +SS +T +  +N T       A SL   A +  S P+ T  AS++                 
Subjt:  -------------------------------TQLPTVDNSSPSTVELLDNVT------CADSLCQNADTVHSMPTQTEPASNAP----------------

Query:  ---IDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAF-VPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPS-MNVVGCKWVFRTKYHPDG
           ++ ++  P NTHSM TRAK+GI KP    S  ++    ++P +  +A K   WRNAM  E NA     TW LVP  PS + +VGC+W+F  KY+ DG
Subjt:  ---IDGSSLQPTNTHSMQTRAKSGIFKPKAFLSTMMAF-VPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPS-MNVVGCKWVFRTKYHPDG

Query:  SIARHKAAWLQKVITK---------------------------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRA
        S+ R+KA  + K   +                                  +LDV NAFL G L ++VYMSQP GF+DK  P  VC L K+LYGLKQAPRA
Subjt:  SIARHKAAWLQKVITK---------------------------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRA

Query:  WFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHT
        W+    + LLT+GF+ S +D S              +YV                               L  F  +E      G+ ++Q +Y+ DLL  
Subjt:  WFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHT

Query:  SGMTSAKSCLTPM--STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD-----------------------------------MSLTA
        + M +AK   TPM  S  + LY+   L  D T YR +VGSLQYL FTRPDI++A NR+S                                    +SL A
Subjt:  SGMTSAKSCLTPM--STTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD-----------------------------------MSLTA

Query:  YCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHI
        Y D+DWAGD  D  ST+G+I +LG  PISWS+KKQ  V RSSTEAEYRS+A T++++ WI  LL +L I LT PP ++CDNV A  L  NPVFH+R KHI
Subjt:  YCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHI

Query:  EIDYHFVREKV
         IDYHF+R +V
Subjt:  EIDYHFVREKV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.7e-9733.33Show/hide
Query:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGD---------------------
        P  +P     SERKHRHIVE  ++LL HASVP  +WPYAFS AV+LINR+ +  L + SPF+ LFG  P+   LKVFG                      
Subjt:  PIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNMSSPFETLFGYTPDLHHLKVFGD---------------------

Query:  ----------------------------------------------------------------PTPLLTL---------LNLSQTPPQNPYSVIHTQ--
                                                                        PT  L L         L+ S  PP +P  +  TQ  
Subjt:  ----------------------------------------------------------------PTPLLTL---------LNLSQTPPQNPYSVIHTQ--

Query:  ---LPTVDNSSPSTVELL------DNVTCADSLCQNADT---VHSMPTQTEPASNAPIDGSSL-------------------------------------
           LP+   SSPS+ E           T      QN+++   + + P    P+ N+P   S L                                     
Subjt:  ---LPTVDNSSPSTVELL------DNVTCADSLCQNADT---VHSMPTQTEPASNAPIDGSSL-------------------------------------

Query:  -----------QPTNTHSMQTRAKSGIFKP--KAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLV-PRLPSMNVVGCKWVFRTKYH
                    P NTHSM TRAK GI KP  K   +T +A   ++P +  +A K   WR AM  E NA     TW LV P  PS+ +VGC+W+F  K++
Subjt:  -----------QPTNTHSMQTRAKSGIFKP--KAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLV-PRLPSMNVVGCKWVFRTKYH

Query:  PDGSIARHKAAWLQKVITK---------------------------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQA
         DGS+ R+KA  + K   +                                  +LDV NAFL G L +EVYMSQP GF+DK  P  VC L K++YGLKQA
Subjt:  PDGSIARHKAAWLQKVITK---------------------------------LKLDVKNAFLHGHLEEEVYMSQPSGFLDKSCPTSVCLLHKSLYGLKQA

Query:  PRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDL
        PRAW+    + LLT+GF+ S +D S              +YV                               L  F  +E      G+ ++Q +Y  DL
Subjt:  PRAWFDRFTSDLLTLGFIASAADPS-------------CLYV-------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDL

Query:  LHTSGMTSAKSCLTPMSTTIDL-YASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD-----------------------------------MSL
        L  + M +AK   TPM+T+  L   S     D T YR +VGSLQYL FTRPD+++A NR+S                                    +SL
Subjt:  LHTSGMTSAKSCLTPMSTTIDL-YASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSD-----------------------------------MSL

Query:  TAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTK
         AY D+DWAGDT D  ST+G+I +LG  PISWS+KKQ  V RSSTEAEYRS+A T+++L WI  LL +L I L+ PP ++CDNV A  L  NPVFH+R K
Subjt:  TAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTK

Query:  HIEIDYHFVREKV
        HI +DYHF+R +V
Subjt:  HIEIDYHFVREKV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-6834.32Show/hide
Query:  DPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKL--------------------------
        +PS++ EA ++L W  AM +E  A++   TW +    P+   +GCKWV++ KY+ DG+I R+KA  + K  T+                           
Subjt:  DPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKL--------------------------

Query:  -------KLDVKNAFLHGHLEEEVYMSQPSGFL----DKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-----------CLYV--
               +LD+ NAFL+G L+EE+YM  P G+     D   P +VC L KS+YGLKQA R WF +F+  L+  GF+ S +D +           C+ V  
Subjt:  -------KLDVKNAFLHGHLEEEVYMSQPSGFL----DKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPS-----------CLYV--

Query:  -------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYA-SAPLFNDATLYRQLVGS
                                       L+ F  LE+  SA GI + Q KY  DLL  +G+   K    PM  ++   A S   F DA  YR+L+G 
Subjt:  -------------------------------LEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYA-SAPLFNDATLYRQLVGS

Query:  LQYLTFTRPDITFAANRVS-----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSR
        L YL  TR DI+FA N++S                                   +M L  + D+ +      RRST+G+  FLG+S ISW +KKQ  VS+
Subjt:  LQYLTFTRPDITFAANRVS-----------------------------------DMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSR

Query:  SSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVV
        SS EAEYR+L+  T ++ W+ Q   +L +PL+ P  L+CDN +AI +A N VFH RTKHIE D H VRE+ V
Subjt:  SSTEAEYRSLATTTADLYWIRQLLCDLHIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-3140.76Show/hide
Query:  FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-----------------
        F  +++ +   G+F++Q KY   +L+ +GM   K   TP+   ++   S   + D + +R +VG+LQYLT TRPDI++A N V                 
Subjt:  FPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLYASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRV-----------------

Query:  ------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYW
                          S +++ A+CDSDWAG TS RRST+GF  FLG + ISWSAK+QPTVSRSSTE EYR+LA T A+L W
Subjt:  ------------------SDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-1344.79Show/hide
Query:  MQTRAKSGIFK--PKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK
        M TR+K+GI K  PK  L T+   +  +P S   A K   W  AM EE +AL    TW LVP   + N++GCKWVF+TK H DG++ R KA  + K
Subjt:  MQTRAKSGIFK--PKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTAGCTCCACAACTACCACTGTCAATTCCTCCTCCATGTCTTCTCCTCTGTTCCTTTTGTCAAATATATGCAATTTGGTTCCACTTCGCTTGGATTCTACTAA
TTATGTTCTTTGGAAGTTCCAAATCTCTTCCATTCTGAAGGCCCACTCTCTTTTTGGACATATTGATGATTCTCTTCCCCAACCACGCAAATTCATTCGTTCTTCGACCC
AGACAGACACCAGCGACATAGACAACACCAACACCAACACCAACACCAACACCAACACCAGCGACACAGACACCGCCACAGAGATCAGTCCTGATTATCTCCAATGGGTT
GCTCGAGATCAAGCCCTTATTACATTAATCAACGCCACGTTGTCACCATCAGCACTTGCCCATGTCGTTGGAACAGCGTCGGCCAAAGAGCTCTGGAAATCAATCAAAGA
TCTTGTTGACAGGCTTGCTGCAGCTTCAATCACCATAGATGATGAAGAAATTTTGGTTCATACCTTGAATGGATTACCAGATGAATTTGGCGCCTTTAGAACGTCCATAC
GGACTCGCAGTGGATCTTTGTCGCTTGAGGAGCTTCATGCCCTTCTTGATGCTGAAGAAAAAACGCTCATGGCGGCCGCGGGCGAGGAAGACGACAATTTGCGAGGGCGT
GCAAATCATTTTGGGCAATATCACGGCTGGTCGTGGTCAAATCAATTGGGCATTCTCGGATCTGGCTTAAATTCGGGTCAATCCATCGGATCTGGAGCAGCACTCGGGAT
AAAGTCTCTGGAAAAATTATATACTGGTGCCAGCATCAACGGTCTCTACCCAATTCCAAGCCCTTCAGCTCTATCCAGTTCCGAACGCAAACACCGTCACATTGTCGAAA
CTGCAATGTCATTACTTTTTCATGCTTCTGTTCCTCTTGAATTCTGGCCCTATGCTTTCTCCACTGCAGTTTTTCTCATAAATCGAATGTCTTCTTCATCTCTAAATATG
TCTTCTCCGTTTGAAACACTGTTTGGTTACACTCCTGATTTGCATCATTTAAAAGTTTTTGGTGACCCAACTCCATTACTTACCCTTCTAAATCTCTCTCAGACCCCTCC
TCAAAACCCCTACTCTGTGATACATACACAATTACCCACTGTTGACAATTCGAGTCCATCCACGGTTGAGTTACTCGATAACGTTACTTGTGCTGATTCTCTTTGTCAGA
ATGCAGATACTGTACATTCGATGCCTACTCAAACTGAGCCTGCCAGTAATGCTCCAATTGATGGGAGTTCACTGCAACCTACTAATACTCACTCCATGCAAACTCGGGCG
AAGTCTGGTATTTTCAAGCCAAAGGCCTTTTTATCTACTATGATGGCCTTTGTTCCCACCGATCCTTCATCTTTCACTGAAGCCTCCAAGTATCTCGAGTGGAGAAATGC
CATGTGTGAAGAATTCAATGCTCTTCAAGAACAAGGTACGTGGACTTTAGTACCTCGATTGCCATCCATGAATGTTGTAGGTTGCAAATGGGTTTTTCGAACTAAATACC
ATCCTGATGGCTCCATTGCTCGACATAAGGCCGCCTGGTTGCAAAAGGTTATCACCAAGTTGAAGTTGGATGTGAAAAATGCATTTCTCCATGGACATCTCGAAGAAGAA
GTCTACATGTCTCAACCCTCTGGCTTTCTAGATAAATCTTGTCCAACCAGTGTTTGTTTGCTTCACAAGAGCCTGTATGGTCTTAAGCAAGCTCCTCGAGCTTGGTTTGA
TCGCTTTACATCAGACCTATTAACCTTGGGATTTATTGCTTCTGCTGCTGATCCATCTTGTTTATACGTCCTTGAAATATTTCCTTGGTTGGAAGTTCACTCATCTGCTG
ATGGTATTTTTGTTAATCAAGCTAAGTACCTCAATGATTTGCTTCATACATCAGGCATGACATCTGCCAAATCTTGTTTGACCCCTATGTCTACTACAATTGATCTATAT
GCTTCAGCACCCTTGTTCAATGACGCAACTCTTTATCGTCAGCTGGTTGGTTCTCTTCAGTATCTCACATTCACAAGGCCGGATATCACATTTGCAGCAAATCGAGTAAG
TGATATGTCTCTCACTGCATATTGTGACTCTGATTGGGCTGGTGATACATCCGATCGTCGTTCTACATCTGGCTTTATTGCTTTTCTTGGTTCCAGCCCCATCTCCTGGT
CTGCTAAAAAGCAACCGACAGTTTCCCGCTCTTCTACAGAAGCTGAATATCGATCACTAGCAACTACAACAGCTGACTTGTATTGGATACGTCAACTTTTGTGCGATCTT
CATATACCTCTGACTACTCCTCCCACGTTATGGTGTGACAATGTATCAGCCATTTCTCTTGCCAACAATCCAGTTTTCCATGCTCGCACAAAACACATCGAGATAGACTA
CCATTTCGTTCGAGAGAAAGTGGTGCGAAAGGATATTTCTATGGCAGCATCGAGATTAGGATCCAAGAAGGAGAAAAATGGCTGGCCAGGAGGCGGACGCAGTCCTTGTG
CAATGCCTAATCCACTTGGTACAAATAGGAGGGTAAAGCTCCTGAAAAAACAATACGTGGGAATTGCTGAGATGATGGGACCAACTTGTAGTGGGTTTGGGTGGAACGAG
GAGAGGAAGTGCATCGAGGCAGAGAAGAAGATCTTTGATGCGTGGGTTGAGTGTTTGGGAAGGACAATGCAAGAAGGGGGAGCTTGCACTCCTATTGAGCTTACACCAGA
ACCGAAACCGGTAGTCGATCTTGGAGAGGACATGAACGTAGACTATGAAAATTGTTACGTCCCCAGTCCACCTGTTATTGATCCCACATCTGGGGAAGAATTTTGTGGGA
CATTGACTGGCAGAGCAGCTGATGCAGGATTGTTTAAGACACAGCAGAGAGAGAGACGGAAGCAGCGGAGGGGTATTGCGTCGTCTGCACGTCGCTGTCGCCGGAGATAC
GTCTGGAGAGAGGGGTCGACGCGAGTGGGTTTCGCAGGTGGGTTTCTTGCAGGTGGGTTTCGTGGGTTTCACAGGTGGGTTTCAGAGAGAGAGACGACGACGGTGGAGGG
GTGCTGCGTCGTTGCTGTCGCGCGTAGATGGCGTCTGGAGGGAGGGGTTGGTCGGCGATTAGTGACTGCAAGGAGAAGAAAGAAAAAAGCGAAAGAAAAGAAGAAGAGAA
AAAATGGAGTAAGGGAAGAAGAAATGAGAAGAAAAAAAAAAAAAAAGGTCGCCGGCGACCGGCGGTGGCGGCGGCAGTGGCCGCCGGTCGCCGGACATAAGAGGAAGAAG
AATAAGGAAGAGGTGAAGAGGAAGAAGATGAAGGAGAAGAAAGAGGAGGAGAGAGAGGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTAGCTCCACAACTACCACTGTCAATTCCTCCTCCATGTCTTCTCCTCTGTTCCTTTTGTCAAATATATGCAATTTGGTTCCACTTCGCTTGGATTCTACTAA
TTATGTTCTTTGGAAGTTCCAAATCTCTTCCATTCTGAAGGCCCACTCTCTTTTTGGACATATTGATGATTCTCTTCCCCAACCACGCAAATTCATTCGTTCTTCGACCC
AGACAGACACCAGCGACATAGACAACACCAACACCAACACCAACACCAACACCAACACCAGCGACACAGACACCGCCACAGAGATCAGTCCTGATTATCTCCAATGGGTT
GCTCGAGATCAAGCCCTTATTACATTAATCAACGCCACGTTGTCACCATCAGCACTTGCCCATGTCGTTGGAACAGCGTCGGCCAAAGAGCTCTGGAAATCAATCAAAGA
TCTTGTTGACAGGCTTGCTGCAGCTTCAATCACCATAGATGATGAAGAAATTTTGGTTCATACCTTGAATGGATTACCAGATGAATTTGGCGCCTTTAGAACGTCCATAC
GGACTCGCAGTGGATCTTTGTCGCTTGAGGAGCTTCATGCCCTTCTTGATGCTGAAGAAAAAACGCTCATGGCGGCCGCGGGCGAGGAAGACGACAATTTGCGAGGGCGT
GCAAATCATTTTGGGCAATATCACGGCTGGTCGTGGTCAAATCAATTGGGCATTCTCGGATCTGGCTTAAATTCGGGTCAATCCATCGGATCTGGAGCAGCACTCGGGAT
AAAGTCTCTGGAAAAATTATATACTGGTGCCAGCATCAACGGTCTCTACCCAATTCCAAGCCCTTCAGCTCTATCCAGTTCCGAACGCAAACACCGTCACATTGTCGAAA
CTGCAATGTCATTACTTTTTCATGCTTCTGTTCCTCTTGAATTCTGGCCCTATGCTTTCTCCACTGCAGTTTTTCTCATAAATCGAATGTCTTCTTCATCTCTAAATATG
TCTTCTCCGTTTGAAACACTGTTTGGTTACACTCCTGATTTGCATCATTTAAAAGTTTTTGGTGACCCAACTCCATTACTTACCCTTCTAAATCTCTCTCAGACCCCTCC
TCAAAACCCCTACTCTGTGATACATACACAATTACCCACTGTTGACAATTCGAGTCCATCCACGGTTGAGTTACTCGATAACGTTACTTGTGCTGATTCTCTTTGTCAGA
ATGCAGATACTGTACATTCGATGCCTACTCAAACTGAGCCTGCCAGTAATGCTCCAATTGATGGGAGTTCACTGCAACCTACTAATACTCACTCCATGCAAACTCGGGCG
AAGTCTGGTATTTTCAAGCCAAAGGCCTTTTTATCTACTATGATGGCCTTTGTTCCCACCGATCCTTCATCTTTCACTGAAGCCTCCAAGTATCTCGAGTGGAGAAATGC
CATGTGTGAAGAATTCAATGCTCTTCAAGAACAAGGTACGTGGACTTTAGTACCTCGATTGCCATCCATGAATGTTGTAGGTTGCAAATGGGTTTTTCGAACTAAATACC
ATCCTGATGGCTCCATTGCTCGACATAAGGCCGCCTGGTTGCAAAAGGTTATCACCAAGTTGAAGTTGGATGTGAAAAATGCATTTCTCCATGGACATCTCGAAGAAGAA
GTCTACATGTCTCAACCCTCTGGCTTTCTAGATAAATCTTGTCCAACCAGTGTTTGTTTGCTTCACAAGAGCCTGTATGGTCTTAAGCAAGCTCCTCGAGCTTGGTTTGA
TCGCTTTACATCAGACCTATTAACCTTGGGATTTATTGCTTCTGCTGCTGATCCATCTTGTTTATACGTCCTTGAAATATTTCCTTGGTTGGAAGTTCACTCATCTGCTG
ATGGTATTTTTGTTAATCAAGCTAAGTACCTCAATGATTTGCTTCATACATCAGGCATGACATCTGCCAAATCTTGTTTGACCCCTATGTCTACTACAATTGATCTATAT
GCTTCAGCACCCTTGTTCAATGACGCAACTCTTTATCGTCAGCTGGTTGGTTCTCTTCAGTATCTCACATTCACAAGGCCGGATATCACATTTGCAGCAAATCGAGTAAG
TGATATGTCTCTCACTGCATATTGTGACTCTGATTGGGCTGGTGATACATCCGATCGTCGTTCTACATCTGGCTTTATTGCTTTTCTTGGTTCCAGCCCCATCTCCTGGT
CTGCTAAAAAGCAACCGACAGTTTCCCGCTCTTCTACAGAAGCTGAATATCGATCACTAGCAACTACAACAGCTGACTTGTATTGGATACGTCAACTTTTGTGCGATCTT
CATATACCTCTGACTACTCCTCCCACGTTATGGTGTGACAATGTATCAGCCATTTCTCTTGCCAACAATCCAGTTTTCCATGCTCGCACAAAACACATCGAGATAGACTA
CCATTTCGTTCGAGAGAAAGTGGTGCGAAAGGATATTTCTATGGCAGCATCGAGATTAGGATCCAAGAAGGAGAAAAATGGCTGGCCAGGAGGCGGACGCAGTCCTTGTG
CAATGCCTAATCCACTTGGTACAAATAGGAGGGTAAAGCTCCTGAAAAAACAATACGTGGGAATTGCTGAGATGATGGGACCAACTTGTAGTGGGTTTGGGTGGAACGAG
GAGAGGAAGTGCATCGAGGCAGAGAAGAAGATCTTTGATGCGTGGGTTGAGTGTTTGGGAAGGACAATGCAAGAAGGGGGAGCTTGCACTCCTATTGAGCTTACACCAGA
ACCGAAACCGGTAGTCGATCTTGGAGAGGACATGAACGTAGACTATGAAAATTGTTACGTCCCCAGTCCACCTGTTATTGATCCCACATCTGGGGAAGAATTTTGTGGGA
CATTGACTGGCAGAGCAGCTGATGCAGGATTGTTTAAGACACAGCAGAGAGAGAGACGGAAGCAGCGGAGGGGTATTGCGTCGTCTGCACGTCGCTGTCGCCGGAGATAC
GTCTGGAGAGAGGGGTCGACGCGAGTGGGTTTCGCAGGTGGGTTTCTTGCAGGTGGGTTTCGTGGGTTTCACAGGTGGGTTTCAGAGAGAGAGACGACGACGGTGGAGGG
GTGCTGCGTCGTTGCTGTCGCGCGTAGATGGCGTCTGGAGGGAGGGGTTGGTCGGCGATTAGTGACTGCAAGGAGAAGAAAGAAAAAAGCGAAAGAAAAGAAGAAGAGAA
AAAATGGAGTAAGGGAAGAAGAAATGAGAAGAAAAAAAAAAAAAAAGGTCGCCGGCGACCGGCGGTGGCGGCGGCAGTGGCCGCCGGTCGCCGGACATAAGAGGAAGAAG
AATAAGGAAGAGGTGAAGAGGAAGAAGATGAAGGAGAAGAAAGAGGAGGAGAGAGAGGGATAA
Protein sequenceShow/hide protein sequence
MASSSTTTTVNSSSMSSPLFLLSNICNLVPLRLDSTNYVLWKFQISSILKAHSLFGHIDDSLPQPRKFIRSSTQTDTSDIDNTNTNTNTNTNTSDTDTATEISPDYLQWV
ARDQALITLINATLSPSALAHVVGTASAKELWKSIKDLVDRLAAASITIDDEEILVHTLNGLPDEFGAFRTSIRTRSGSLSLEELHALLDAEEKTLMAAAGEEDDNLRGR
ANHFGQYHGWSWSNQLGILGSGLNSGQSIGSGAALGIKSLEKLYTGASINGLYPIPSPSALSSSERKHRHIVETAMSLLFHASVPLEFWPYAFSTAVFLINRMSSSSLNM
SSPFETLFGYTPDLHHLKVFGDPTPLLTLLNLSQTPPQNPYSVIHTQLPTVDNSSPSTVELLDNVTCADSLCQNADTVHSMPTQTEPASNAPIDGSSLQPTNTHSMQTRA
KSGIFKPKAFLSTMMAFVPTDPSSFTEASKYLEWRNAMCEEFNALQEQGTWTLVPRLPSMNVVGCKWVFRTKYHPDGSIARHKAAWLQKVITKLKLDVKNAFLHGHLEEE
VYMSQPSGFLDKSCPTSVCLLHKSLYGLKQAPRAWFDRFTSDLLTLGFIASAADPSCLYVLEIFPWLEVHSSADGIFVNQAKYLNDLLHTSGMTSAKSCLTPMSTTIDLY
ASAPLFNDATLYRQLVGSLQYLTFTRPDITFAANRVSDMSLTAYCDSDWAGDTSDRRSTSGFIAFLGSSPISWSAKKQPTVSRSSTEAEYRSLATTTADLYWIRQLLCDL
HIPLTTPPTLWCDNVSAISLANNPVFHARTKHIEIDYHFVREKVVRKDISMAASRLGSKKEKNGWPGGGRSPCAMPNPLGTNRRVKLLKKQYVGIAEMMGPTCSGFGWNE
ERKCIEAEKKIFDAWVECLGRTMQEGGACTPIELTPEPKPVVDLGEDMNVDYENCYVPSPPVIDPTSGEEFCGTLTGRAADAGLFKTQQRERRKQRRGIASSARRCRRRY
VWREGSTRVGFAGGFLAGGFRGFHRWVSERETTTVEGCCVVAVARRWRLEGGVGRRLVTARRRKKKAKEKKKRKNGVREEEMRRKKKKKVAGDRRWRRQWPPVAGHKRKK
NKEEVKRKKMKEKKEEEREG