; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018094 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018094
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:15997413..16000061
RNA-Seq ExpressionLag0018094
SyntenyLag0018094
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]3.3e-7743.75Show/hide
Query:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------
        EPK YK AL + +W+ AM++E+ AL  NNTW LVQRP D NV+ SKW+F+TK   DG+I+R+KARLVA+GYTQI GL++ ET+SPV              
Subjt:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------

Query:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII
                                              N      + +Y                                        TL+L YVD+II
Subjt:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII

Query:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT
        LTGN  S I  L+  L  +F L+ LGQL  FL IE+KH   GI +SQ KYA ++L ++ MLG + INTPI + P +   D  P DA EYR + G LQYLT
Subjt:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT

Query:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
         TRPD+ +AVN VCQH Q P  KDL+AVKRILRYI+GTL + +     +SLNL AFCDADW GCP T RSTTGFC++L S+CIS
Subjt:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

PNY16899.1 copia-like polyprotein, partial [Trifolium pratense]2.0e-7443.23Show/hide
Query:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------
        EPK YK AL + +W++AM+EE+ AL  NNTW LV RP D NVV SKW+F+TK   DG+I+R+KARLVA+GYTQI GL++ ET+SPV              
Subjt:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------

Query:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII
                                              N      + +Y                                        TL+L YVD+II
Subjt:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII

Query:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT
        LTGN  S I  LI  L  +F L+ LGQL  FL IE+K+   GI +SQ KYA ++L ++ ML  + INTPI   P     D +  DA EYR + G LQYLT
Subjt:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT

Query:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
         TRPD+ +AVN VCQH Q P  KDL+AVKRILRYI+GTL + +     +SLNL AFCDADW GCP T RSTTGFC++L  +CIS
Subjt:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-7440.6Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+ AL  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------------N
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV +                                
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------------N

Query:  QRQIYLTLMLNYVDE------------------------------IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE
        + ++++     +++E                                + GN+++ I  LI TL S+F L+ LG L  FL +EVK+   G+ +SQ KY R+
Subjt:  QRQIYLTLMLNYVDE------------------------------IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE

Query:  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNL
        +L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT+++ I  +K +SL L
Subjt:  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNL

Query:  YAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
          FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  YAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.1e-7639.66Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+  L  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV                                  
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------

Query:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF
                          N      R +Y                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Subjt:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF

Query:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP
         L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P
Subjt:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP

Query:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
           DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

RVX04589.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]6.2e-7639.66Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+  L  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV                                  
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------

Query:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF
                          N      R +Y                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Subjt:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF

Query:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP
         L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P
Subjt:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP

Query:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
           DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

TrEMBL top hitse value%identityAlignment
A0A2K3PNP5 Copia-like polyprotein (Fragment)9.6e-7543.23Show/hide
Query:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------
        EPK YK AL + +W++AM+EE+ AL  NNTW LV RP D NVV SKW+F+TK   DG+I+R+KARLVA+GYTQI GL++ ET+SPV              
Subjt:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------

Query:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII
                                              N      + +Y                                        TL+L YVD+II
Subjt:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII

Query:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT
        LTGN  S I  LI  L  +F L+ LGQL  FL IE+K+   GI +SQ KYA ++L ++ ML  + INTPI   P     D +  DA EYR + G LQYLT
Subjt:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT

Query:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
         TRPD+ +AVN VCQH Q P  KDL+AVKRILRYI+GTL + +     +SLNL AFCDADW GCP T RSTTGFC++L  +CIS
Subjt:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

A0A2Z6P7T0 Reverse transcriptase Ty1/copia-type domain-containing protein1.6e-7743.75Show/hide
Query:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------
        EPK YK AL + +W+ AM++E+ AL  NNTW LVQRP D NV+ SKW+F+TK   DG+I+R+KARLVA+GYTQI GL++ ET+SPV              
Subjt:  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV--------------

Query:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII
                                              N      + +Y                                        TL+L YVD+II
Subjt:  --------------------------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEII

Query:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT
        LTGN  S I  L+  L  +F L+ LGQL  FL IE+KH   GI +SQ KYA ++L ++ MLG + INTPI + P +   D  P DA EYR + G LQYLT
Subjt:  LTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT

Query:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
         TRPD+ +AVN VCQH Q P  KDL+AVKRILRYI+GTL + +     +SLNL AFCDADW GCP T RSTTGFC++L S+CIS
Subjt:  LTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE17.4e-7540.6Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+ AL  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------------N
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV +                                
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------------N

Query:  QRQIYLTLMLNYVDE------------------------------IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE
        + ++++     +++E                                + GN+++ I  LI TL S+F L+ LG L  FL +EVK+   G+ +SQ KY R+
Subjt:  QRQIYLTLMLNYVDE------------------------------IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE

Query:  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNL
        +L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT+++ I  +K +SL L
Subjt:  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNL

Query:  YAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
          FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  YAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-7639.66Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+  L  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV                                  
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------

Query:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF
                          N      R +Y                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Subjt:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF

Query:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP
         L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P
Subjt:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP

Query:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
           DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

A0A438J6K3 Retrovirus-related Pol polyprotein from transposon RE13.0e-7639.66Show/hide
Query:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE
        ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  ++     +R        S   EPK Y+  L  PHW  AM+E
Subjt:  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEE

Query:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------
        E+KAL+ N TW LV RP  TN+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV                                  
Subjt:  EMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV----------------------------------

Query:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF
                          N      R +Y                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Subjt:  ------------------NQTNNNQRQIY---------------------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF

Query:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP
         L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P
Subjt:  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQP

Query:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
           DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G+C+FL +NCIS
Subjt:  IVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-2824.73Show/hide
Query:  WKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------
        W++A+  E+ A  +NNTW + +RP++ N+V+S+W+F  K    G   RYKARLVA+G+TQ   ++YEET++PV + ++                      
Subjt:  WKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN----------------------

Query:  ------NQRQIYLTL---------------------------------------------------------------MLNYVDEIILTGNNSSHIQQLI
               + +IY+ L                                                               +L YVD++++   + + +    
Subjt:  ------NQRQIYLTL---------------------------------------------------------------MLNYVDEIILTGNNSSHIQQLI

Query:  GTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTL-TRPDIVYAVNC
          L  +F +  L ++  F+ I ++     I LSQ  Y ++IL+K  M    A++TP+ +       ++        RS++G L Y+ L TRPD+  AVN 
Subjt:  GTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTL-TRPDIVYAVNC

Query:  VCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSL--NLYAFCDADWGGCPLTR-STTGF
        + ++  +   +  + +KR+LRY++GT+D  +   KN +    +  + D+DW G  + R STTG+
Subjt:  VCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSL--NLYAFCDADWGGCPLTR-STTGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-2426.06Show/hide
Query:  PTLAHEISKLKNSRQQRT-LIAMSHKYEPKNYKMALSFPHWKD---AMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQG
        P    E  ++++ R   T  + +S   EP++ K  LS P       AM+EEM++L  N T+ LV+ P+    ++ KW+FK K  GD  + RYKARLV +G
Subjt:  PTLAHEISKLKNSRQQRT-LIAMSHKYEPKNYKMALSFPHWKD---AMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQG

Query:  YTQIEGLNYEETYSPVNQTNNNQ--------------------------------------------------------------RQIYL----------
        + Q +G++++E +SPV +  + +                                                              RQ Y+          
Subjt:  YTQIEGLNYEETYSPVNQTNNNQ--------------------------------------------------------------RQIYL----------

Query:  --------------------TLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLG--QLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAIN
                             ++L YVD++++ G +   I +L G L   F ++ LG  Q  L + I  + T++ + LSQ KY   +L +  M     ++
Subjt:  --------------------TLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLG--QLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAIN

Query:  TPIMTSPQDTHKDTQPTDAKE--------YRSIVGLLQY-LTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCD
        TP +       K   PT  +E        Y S VG L Y +  TRPDI +AV  V + L+ P  +  +AVK ILRY++GT   D   +  +   L  + D
Subjt:  TPIMTSPQDTHKDTQPTDAKE--------YRSIVGLLQY-LTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCD

Query:  ADWGG-CPLTRSTTGFCVFLRSNCIS
        AD  G     +S+TG+        IS
Subjt:  ADWGG-CPLTRSTTGFCVFLRSNCIS

P92519 Uncharacterized mitochondrial protein AtMg008105.8e-4545.24Show/hide
Query:  MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRS
        +L YVD+I+LTG++++ +  LI  L S F ++ LG +  FL I++K    G+ LSQ KYA +IL  +GML    ++TP+      +    +  D  ++RS
Subjt:  MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRS

Query:  IVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQESRD
        IVG LQYLTLTRPDI YAVN VCQ + +P + D   +KR+LRY++GT+ + + ++KN+ LN+ AFCD+DW GC  T RSTTGFC FL  N IS   + + 
Subjt:  IVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQESRD

Query:  ----SSTQTD
            SST+T+
Subjt:  ----SSTQTD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-4628.49Show/hide
Query:  SPVNQTNNNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPH
        SP    +++  S    ++ I  P  +A  +           +++H M TR    AK  ++ P   + ++           ++++ + EP+    AL    
Subjt:  SPVNQTNNNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPH

Query:  WKDAMEEEMKALMLNNTWILV-QRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN---------------------N
        W++AM  E+ A + N+TW LV   P    +V  +WIF  K   DG++NRYKARLVA+GY Q  GL+Y ET+SPV ++                      N
Subjt:  WKDAMEEEMKALMLNNTWILV-QRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN---------------------N

Query:  N----------------------------------------------------------------------QRQIYLTLMLNYVDEIILTGNNSSHIQQL
        N                                                                      QR   +  ML YVD+I++TGN+ + +   
Subjt:  N----------------------------------------------------------------------QRQIYLTLMLNYVDEIILTGNNSSHIQQL

Query:  IGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQ-DTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVN
        +  L  +F ++   +L  FL IE K    G+ LSQ +Y  ++L ++ M+    + TP+  SP+   +  T+ TD  EYR IVG LQYL  TRPDI YAVN
Subjt:  IGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQ-DTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVN

Query:  CVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLTR-STTGFCVFLRSNCISSLQESRDSSTQTDVGKSLFCVGKKETT
         + Q +  P  + L+A+KRILRY+ GT ++ I L K N+L+L+A+ DADW G      ST G+ V+L  + IS   + +    ++        V    + 
Subjt:  CVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLTR-STTGFCVFLRSNCISSLQESRDSSTQTDVGKSLFCVGKKETT

Query:  MQ
        MQ
Subjt:  MQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-4631.71Show/hide
Query:  PTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILV-
        P L A  I +    A VN   +H M TR+K   + P  +   ++  S   NS             EP+    A+    W+ AM  E+ A + N+TW LV 
Subjt:  PTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILV-

Query:  QRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN---------------------NN---------------------
          P    +V  +WIF  K   DG++NRYKARLVA+GY Q  GL+Y ET+SPV ++                      NN                     
Subjt:  QRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN---------------------NN---------------------

Query:  -------------------------------------------------QRQIYLTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSI
                                                         QR   +  ML YVD+I++TGN++  ++  +  L  +F ++    L  FL I
Subjt:  -------------------------------------------------QRQIYLTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSI

Query:  EVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDT-HKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILR
        E K   +G+ LSQ +Y  ++L ++ ML    + TP+ TSP+ T H  T+  D  EYR IVG LQYL  TRPD+ YAVN + Q++  P      A+KR+LR
Subjt:  EVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDT-HKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILR

Query:  YIQGTLDYDISLYKNNSLNLYAFCDADWGG-CPLTRSTTGFCVFLRSNCIS
        Y+ GT D+ I L K N+L+L+A+ DADW G      ST G+ V+L  + IS
Subjt:  YIQGTLDYDISLYKNNSLNLYAFCDADWGG-CPLTRSTTGFCVFLRSNCIS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.1e-4229.72Show/hide
Query:  LIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQ----
        L+ ++   EP  Y  A  F  W  AM++E+ A+   +TW +   P +   +  KW++K K   DGTI RYKARLVA+GYTQ EG+++ ET+SPV +    
Subjt:  LIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQ----

Query:  ------------------------TNNNQRQIYLTL----------------------------------------------------------------
                                  +   +IY+ L                                                                
Subjt:  ------------------------TNNNQRQIYLTL----------------------------------------------------------------

Query:  ---MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQ-DTHKDTQPTDAK
           +L YVD+II+  NN + + +L   L+S F L  LG L  FL +E+  +A GI + Q KYA ++L ++G+LG    + P+  S     H      DAK
Subjt:  ---MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQ-DTHKDTQPTDAK

Query:  EYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS
         YR ++G L YL +TR DI +AVN + Q  + P +   +AV +IL YI+GT+   +       + L  F DA +  C  T RST G+C+FL ++ IS
Subjt:  EYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCIS

ATMG00240.1 Gag-Pol-related retrotransposon family protein8.7e-1238.04Show/hide
Query:  YLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQES
        YLT+TRPD+ +AVN + Q         ++AV ++L Y++GT+   +     + L L AF D+DW  CP T RS TGFC  +    + +L++S
Subjt:  YLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQES

ATMG00810.1 DNA/RNA polymerases superfamily protein4.2e-4645.24Show/hide
Query:  MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRS
        +L YVD+I+LTG++++ +  LI  L S F ++ LG +  FL I++K    G+ LSQ KYA +IL  +GML    ++TP+      +    +  D  ++RS
Subjt:  MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRS

Query:  IVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQESRD
        IVG LQYLTLTRPDI YAVN VCQ + +P + D   +KR+LRY++GT+ + + ++KN+ LN+ AFCD+DW GC  T RSTTGFC FL  N IS   + + 
Subjt:  IVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTGFCVFLRSNCISSLQESRD

Query:  ----SSTQTD
            SST+T+
Subjt:  ----SSTQTD

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.0e-2152.08Show/hide
Query:  IAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQT
        I  + K EPK+   AL  P W  AM+EE+ AL  N TWILV  P + N++  KW+FKTK   DGT++R KARLVA+G+ Q EG+ + ETYSPV +T
Subjt:  IAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTGCTCAAGGATACACCAGAAGAAACACATACACCAACAACACCAACAGATCAAGACGTCAACACAACTATAATCCTCAATTTGAGACATGATACTCCAATTTT
GCAAGTGAACCAAGGCATATTTGAGACCAATGAAACTGATATATCAAAATTGGGCAAAGACTTCACTCTCTTGGAAAATCAAGTTCATTACCAGGATGGAACATATGCTC
ATGACGATTCCATGACCATGACACCTACGCTCATAACTCCTAATGAAGCAGCTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGA
ACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGGTTTTAGCTCTCACCACATGGTCACAACAAGCAAGTTAGCAAAGAATCTGGTCCTTGATCCCACTCTTGCACA
TCAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTAATAGCAATGAGTCACAAATCTGAGCCACAAAACTACAAAATGGCCCTATCTTTGCCTCATTGGAAGG
ATGCCATGGATGAAGAAATGAAGGCTTTAATGTTGAATAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTCGATCTTCAAGATAAAG
TGCAAGGAAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAAC
CAACAACAATCAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGATATTA
GCTCTCACCACATGGTCACAAGAAGCAAGTTATTAGCAAAGAATCCAGTCCTTGATCCCACTCTTGCACATGAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACA
CTTATAGCAATGAGTCACAAATATGAGCCAAAAAACTACAAAATGGCCCTATCTTTTCCTCATTGGAAGGATGCCATGGAGGAAGAAATGAAGGCTTTAATGTTGAACAA
CACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTGGATCTTCAAGACAAAGTGCAAGGGAGATGGAACGATTAATCGCTACAAGGCTAGAC
TTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGAGACAAATCTATTTAACACTAATGCTT
AATTACGTTGATGAAATTATATTGACAGGAAACAACTCATCACATATCCAACAACTCATAGGAACACTACGTTCACAATTTTGCCTTGAAAGACTTGGACAACTTACACT
ATTTCTTAGCATAGAAGTGAAGCACACTGCAAAAGGAATTATGCTATCACAAGGAAAATATGCTCGTGAAATACTCACCAAGTCAGGAATGTTGGGAGTTGCTGCCATTA
ATACTCCAATCATGACCTCACCTCAAGATACCCATAAGGACACTCAACCCACTGATGCAAAGGAATACAGGAGCATCGTTGGATTGTTGCAATACCTTACACTCACACGA
CCTGATATAGTATATGCAGTTAATTGTGTATGCCAACACCTACAACAGCCCATCGTTAAGGATCTCAAAGCTGTGAAAAGGATTCTTCGATACATACAAGGAACCCTCGA
CTACGATATCTCTCTTTATAAAAACAATTCACTAAATTTATATGCTTTTTGTGATGCAGATTGGGGTGGCTGCCCTCTTACACGAAGTACTACAGGCTTTTGTGTATTCC
TCAGATCCAATTGTATCTCATCACTACAAGAATCTCGGGATTCGTCGACGCAAACAGACGTCGGCAAATCCCTCTTTTGCGTCGGCAAAAAGGAAACGACGATGCAATCA
TGCGTCGGCTTAAATGTCGCTAAATCCTTGTTGGCATCCAAAAAACTGACGAAAATGGCGGTTCGTGGGCAGAAAGGTACATTTTCGCCAACGCAAACTCTTGCGTTGCC
TAAACCTGTAATAACCGATGCAACATGCGTCGGTTATAATACGGTTTCATTTATCAGATTTTTGAGTTTAGCCCACGCAAGAAGCGTCGGTGAAAGTCATAAATTGCGTC
GGCTACAATACGACTTCATTCCACAGATTTTCGAGATGAGGCCACGAAATTGCGTCGGTGAAAATAAAAATTGCGTCGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTGCTCAAGGATACACCAGAAGAAACACATACACCAACAACACCAACAGATCAAGACGTCAACACAACTATAATCCTCAATTTGAGACATGATACTCCAATTTT
GCAAGTGAACCAAGGCATATTTGAGACCAATGAAACTGATATATCAAAATTGGGCAAAGACTTCACTCTCTTGGAAAATCAAGTTCATTACCAGGATGGAACATATGCTC
ATGACGATTCCATGACCATGACACCTACGCTCATAACTCCTAATGAAGCAGCTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGA
ACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGGTTTTAGCTCTCACCACATGGTCACAACAAGCAAGTTAGCAAAGAATCTGGTCCTTGATCCCACTCTTGCACA
TCAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTAATAGCAATGAGTCACAAATCTGAGCCACAAAACTACAAAATGGCCCTATCTTTGCCTCATTGGAAGG
ATGCCATGGATGAAGAAATGAAGGCTTTAATGTTGAATAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTCGATCTTCAAGATAAAG
TGCAAGGAAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAAC
CAACAACAATCAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGATATTA
GCTCTCACCACATGGTCACAAGAAGCAAGTTATTAGCAAAGAATCCAGTCCTTGATCCCACTCTTGCACATGAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACA
CTTATAGCAATGAGTCACAAATATGAGCCAAAAAACTACAAAATGGCCCTATCTTTTCCTCATTGGAAGGATGCCATGGAGGAAGAAATGAAGGCTTTAATGTTGAACAA
CACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTGGATCTTCAAGACAAAGTGCAAGGGAGATGGAACGATTAATCGCTACAAGGCTAGAC
TTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGAGACAAATCTATTTAACACTAATGCTT
AATTACGTTGATGAAATTATATTGACAGGAAACAACTCATCACATATCCAACAACTCATAGGAACACTACGTTCACAATTTTGCCTTGAAAGACTTGGACAACTTACACT
ATTTCTTAGCATAGAAGTGAAGCACACTGCAAAAGGAATTATGCTATCACAAGGAAAATATGCTCGTGAAATACTCACCAAGTCAGGAATGTTGGGAGTTGCTGCCATTA
ATACTCCAATCATGACCTCACCTCAAGATACCCATAAGGACACTCAACCCACTGATGCAAAGGAATACAGGAGCATCGTTGGATTGTTGCAATACCTTACACTCACACGA
CCTGATATAGTATATGCAGTTAATTGTGTATGCCAACACCTACAACAGCCCATCGTTAAGGATCTCAAAGCTGTGAAAAGGATTCTTCGATACATACAAGGAACCCTCGA
CTACGATATCTCTCTTTATAAAAACAATTCACTAAATTTATATGCTTTTTGTGATGCAGATTGGGGTGGCTGCCCTCTTACACGAAGTACTACAGGCTTTTGTGTATTCC
TCAGATCCAATTGTATCTCATCACTACAAGAATCTCGGGATTCGTCGACGCAAACAGACGTCGGCAAATCCCTCTTTTGCGTCGGCAAAAAGGAAACGACGATGCAATCA
TGCGTCGGCTTAAATGTCGCTAAATCCTTGTTGGCATCCAAAAAACTGACGAAAATGGCGGTTCGTGGGCAGAAAGGTACATTTTCGCCAACGCAAACTCTTGCGTTGCC
TAAACCTGTAATAACCGATGCAACATGCGTCGGTTATAATACGGTTTCATTTATCAGATTTTTGAGTTTAGCCCACGCAAGAAGCGTCGGTGAAAGTCATAAATTGCGTC
GGCTACAATACGACTTCATTCCACAGATTTTCGAGATGAGGCCACGAAATTGCGTCGGTGAAAATAAAAATTGCGTCGGCTAA
Protein sequenceShow/hide protein sequence
MVVLKDTPEETHTPTTPTDQDVNTTIILNLRHDTPILQVNQGIFETNETDISKLGKDFTLLENQVHYQDGTYAHDDSMTMTPTLITPNEAAKLSDISKNLHIDLPTLIAG
TIPKQGVLAQVNGFSSHHMVTTSKLAKNLVLDPTLAHQISKLKNSRQQRTLIAMSHKSEPQNYKMALSLPHWKDAMDEEMKALMLNNTWILVQRPQDTNVVESKSIFKIK
CKEDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNNNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRT
LIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNNNQRQIYLTLML
NYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTR
PDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLTRSTTGFCVFLRSNCISSLQESRDSSTQTDVGKSLFCVGKKETTMQS
CVGLNVAKSLLASKKLTKMAVRGQKGTFSPTQTLALPKPVITDATCVGYNTVSFIRFLSLAHARSVGESHKLRRLQYDFIPQIFEMRPRNCVGENKNCVG