; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008951 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008951
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:32914556..32916641
RNA-Seq ExpressionLag0008951
SyntenyLag0008951
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.2e-9039.18Show/hide
Query:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------
        YDLE+FL  + +PP +++++  +SS     + T TPNP Y  WKRQD +ISSWL+GSMSE+IL+Q                                   
Subjt:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------

Query:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA
                        Q VDALA++ KPV ++DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NESKL +  +LPSVN+ T      A
Subjt:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA

Query:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS
        + S + +N    + N     RGG G   SNRG R   NR   QCQ+C+K G++A RC+FRY P S                                   
Subjt:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS

Query:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD
                                                                          +S+  +P S + ++   N     PQMSA++ A D
Subjt:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD

Query:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC
        +N D+NWYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +    SF+S     + F LNNLL VPSIT+NLISVSQF KDN VFFEFHPTLC
Subjt:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC

Query:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        YVKD  +G+VLLQG L++GLY+F +  +        +  K + +T        V+  S    LD+WHRRLGHPH+  VK VL
Subjt:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]2.9e-5433.21Show/hide
Query:  NPEYSRWKRQDHIISSWLVGSMSEDILHQQ---------------------------------------------------YVDALAAVGKPVDTEDHIL
        NP +  W RQD ++ S+L+ SMSE    Q                                                    Y+D LAA G  +  +D IL
Subjt:  NPEYSRWKRQDHIISSWLVGSMSEDILHQQ---------------------------------------------------YVDALAAVGKPVDTEDHIL

Query:  FILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLST--PESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGR-
         IL G+G +YES+V  ++++V   S+ EV +LLL  E R E+   T    + PSVN+TT    P  + ++  S   P         RG GR  + RGGR 
Subjt:  FILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLST--PESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGR-

Query:  TWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQEN
         W+N G   CQ+C   GH A+ CY+R+     P + G    +  Q+NR                                                    
Subjt:  TWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQEN

Query:  RNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGN
                  S PS             PP A +S  S   S                            +  WYPDSGA++H+T+  GNLSV +EY GG+
Subjt:  RNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGN

Query:  QVHVGNGAGLPILNCDFSSFSS-PINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSE
        +V VGNGAGL I N   S+ +  P +R F L NLLHVP IT+NLISVS+F  DN V+FEFHP+ C VKD A+  VLL+GTLH GLYRFNL    S P+  
Subjt:  QVHVGNGAGLPILNCDFSSFSS-PINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSE

Query:  KAAVKALCSTWS-SSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
         A +++  S       +P  L  +T   LD WH RLGHP IATVK VL
Subjt:  KAAVKALCSTWS-SSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.0e-5932.39Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ----------------------QYVDALAAVGKPVD
        GY LE FL       F        +  D    + PNP++  ++RQDH++ SWL+ S+    L Q                       Y D LA  G  + 
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ----------------------QYVDALAAVGKPVD

Query:  TEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYP----QNFN--GGNRDRGGG
          DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+S+ +   SVN T+ +       S   SN YP    QN N  GGN+   G 
Subjt:  TEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYP----QNFN--GGNRDRGGG

Query:  RFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVH
         F  NRG      +G I  QCQLC+KFGHT  RC++RY P+   + P +                                             GPT   
Subjt:  RFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVH

Query:  EVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGN
                               P V L +G++          S + S+ G+ +           + +M A++  P+   +  W+PDSGATNH+TH  GN
Subjt:  EVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGN

Query:  LSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRF
        L+ G EY G +++H+GNG GL I +   S F  SS  N++  L N+L VP+I +NL+SVSQF +DN V+FEFHP +C+VKD+++  +LLQG LH+GLY+F
Subjt:  LSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRF

Query:  NLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        NL          ++ S   +E     A      +S  P   + S+    D+WH+RLGHP    V  VL
Subjt:  NLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.8e-5630.82Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ------------------------------------
        GY LE FL       F        +  D    + PNP++  ++RQDH++ SWL+ S+    L Q                                    
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ------------------------------------

Query:  ---------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK
                        Y D LA  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+S+ +   SVN T+ +      
Subjt:  ---------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK

Query:  PSKVVSNPYP----QNFN--GGNRDRGGGRFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDH
         S   SN YP    QN N  GGN+   G  F  NRG      +G I  QCQLC+KFGHT  RC++RY P+   + P +                      
Subjt:  PSKVVSNPYP----QNFN--GGNRDRGGGRFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDH

Query:  ILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSA
                               GPT                          P V L +G++          S + S+ G+ +           + +M A
Subjt:  ILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSA

Query:  LLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFE
        ++  P+   +  W+PDSGATNH+TH  GNL+ G EY G +++H+GNG GL I +   S F  SS  N++  L N+L VP+I +NL+SVSQF +DN V+FE
Subjt:  LLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFE

Query:  FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        FHP +C+VKD+++  +LLQG LH+GLY+FNL          ++ S   +E     A      +S  P   + S+    D+WH+RLGHP    V  VL
Subjt:  FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.2e-9039.18Show/hide
Query:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------
        YDLE+FL  + +PP +++++  +SS     + T TPNP Y  WKRQD +ISSWL+GSMSE+IL+Q                                   
Subjt:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------

Query:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA
                        Q VDALA++ KPV ++DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NESKL +  +LPSVN+ T      A
Subjt:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA

Query:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS
        + S + +N    + N     RGG G   SNRG R   NR   QCQ+C+K G++A RC+FRY P S                                   
Subjt:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS

Query:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD
                                                                          +S+  +P S + ++   N     PQMSA++ A D
Subjt:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD

Query:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC
        +N D+NWYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +    SF+S     + F LNNLL VPSIT+NLISVSQF KDN VFFEFHPTLC
Subjt:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC

Query:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        YVKD  +G+VLLQG L++GLY+F +  +        +  K + +T        V+  S    LD+WHRRLGHPH+  VK VL
Subjt:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein1.4e-5433.21Show/hide
Query:  NPEYSRWKRQDHIISSWLVGSMSEDILHQQ---------------------------------------------------YVDALAAVGKPVDTEDHIL
        NP +  W RQD ++ S+L+ SMSE    Q                                                    Y+D LAA G  +  +D IL
Subjt:  NPEYSRWKRQDHIISSWLVGSMSEDILHQQ---------------------------------------------------YVDALAAVGKPVDTEDHIL

Query:  FILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLST--PESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGR-
         IL G+G +YES+V  ++++V   S+ EV +LLL  E R E+   T    + PSVN+TT    P  + ++  S   P         RG GR  + RGGR 
Subjt:  FILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLST--PESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGR-

Query:  TWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQEN
         W+N G   CQ+C   GH A+ CY+R+     P + G    +  Q+NR                                                    
Subjt:  TWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQEN

Query:  RNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGN
                  S PS             PP A +S  S   S                            +  WYPDSGA++H+T+  GNLSV +EY GG+
Subjt:  RNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGN

Query:  QVHVGNGAGLPILNCDFSSFSS-PINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSE
        +V VGNGAGL I N   S+ +  P +R F L NLLHVP IT+NLISVS+F  DN V+FEFHP+ C VKD A+  VLL+GTLH GLYRFNL    S P+  
Subjt:  QVHVGNGAGLPILNCDFSSFSS-PINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSE

Query:  KAAVKALCSTWS-SSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
         A +++  S       +P  L  +T   LD WH RLGHP IATVK VL
Subjt:  KAAVKALCSTWS-SSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-5932.39Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ----------------------QYVDALAAVGKPVD
        GY LE FL       F        +  D    + PNP++  ++RQDH++ SWL+ S+    L Q                       Y D LA  G  + 
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ----------------------QYVDALAAVGKPVD

Query:  TEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYP----QNFN--GGNRDRGGG
          DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+S+ +   SVN T+ +       S   SN YP    QN N  GGN+   G 
Subjt:  TEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYP----QNFN--GGNRDRGGG

Query:  RFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVH
         F  NRG      +G I  QCQLC+KFGHT  RC++RY P+   + P +                                             GPT   
Subjt:  RFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVH

Query:  EVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGN
                               P V L +G++          S + S+ G+ +           + +M A++  P+   +  W+PDSGATNH+TH  GN
Subjt:  EVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGN

Query:  LSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRF
        L+ G EY G +++H+GNG GL I +   S F  SS  N++  L N+L VP+I +NL+SVSQF +DN V+FEFHP +C+VKD+++  +LLQG LH+GLY+F
Subjt:  LSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRF

Query:  NLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        NL          ++ S   +E     A      +S  P   + S+    D+WH+RLGHP    V  VL
Subjt:  NLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-5630.82Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ------------------------------------
        GY LE FL       F        +  D    + PNP++  ++RQDH++ SWL+ S+    L Q                                    
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ------------------------------------

Query:  ---------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK
                        Y D LA  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+S+ +   SVN T+ +      
Subjt:  ---------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK

Query:  PSKVVSNPYP----QNFN--GGNRDRGGGRFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDH
         S   SN YP    QN N  GGN+   G  F  NRG      +G I  QCQLC+KFGHT  RC++RY P+   + P +                      
Subjt:  PSKVVSNPYP----QNFN--GGNRDRGGGRFGSNRGGRTWNNRGHI--QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDH

Query:  ILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSA
                               GPT                          P V L +G++          S + S+ G+ +           + +M A
Subjt:  ILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSA

Query:  LLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFE
        ++  P+   +  W+PDSGATNH+TH  GNL+ G EY G +++H+GNG GL I +   S F  SS  N++  L N+L VP+I +NL+SVSQF +DN V+FE
Subjt:  LLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSF--SSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFE

Query:  FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        FHP +C+VKD+++  +LLQG LH+GLY+FNL          ++ S   +E     A      +S  P   + S+    D+WH+RLGHP    V  VL
Subjt:  FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLF---------INPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-9039.18Show/hide
Query:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------
        YDLE+FL  + +PP +++++  +SS     + T TPNP Y  WKRQD +ISSWL+GSMSE+IL+Q                                   
Subjt:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------

Query:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA
                        Q VDALA++ KPV ++DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NESKL +  +LPSVN+ T      A
Subjt:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA

Query:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS
        + S + +N    + N     RGG G   SNRG R   NR   QCQ+C+K G++A RC+FRY P S                                   
Subjt:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS

Query:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD
                                                                          +S+  +P S + ++   N     PQMSA++ A D
Subjt:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD

Query:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC
        +N D+NWYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +    SF+S     + F LNNLL VPSIT+NLISVSQF KDN VFFEFHPTLC
Subjt:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC

Query:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        YVKD  +G+VLLQG L++GLY+F +  +        +  K + +T        V+  S    LD+WHRRLGHPH+  VK VL
Subjt:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-9039.18Show/hide
Query:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------
        YDLE+FL  + +PP +++++  +SS     + T TPNP Y  WKRQD +ISSWL+GSMSE+IL+Q                                   
Subjt:  YDLESFL--KIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQ-----------------------------------

Query:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA
                        Q VDALA++ KPV ++DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NESKL +  +LPSVN+ T      A
Subjt:  ----------------QYVDALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEA

Query:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS
        + S + +N    + N     RGG G   SNRG R   NR   QCQ+C+K G++A RC+FRY P S                                   
Subjt:  KPSKVVSNPYPQNFNGGNRDRGG-GRFGSNRGGRTWNNRGHIQCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILS

Query:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD
                                                                          +S+  +P S + ++   N     PQMSA++ A D
Subjt:  GLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPD

Query:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC
        +N D+NWYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +    SF+S     + F LNNLL VPSIT+NLISVSQF KDN VFFEFHPTLC
Subjt:  INHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPI--NRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLC

Query:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL
        YVKD  +G+VLLQG L++GLY+F +  +        +  K + +T        V+  S    LD+WHRRLGHPH+  VK VL
Subjt:  YVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.5e-2423.63Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDI----------------LHQQYV-----------------
        GY+L  FL          T+  +++  D  P +  NP+Y+RWKRQD +I S ++G++S  +                L + Y                  
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDI----------------LHQQYV-----------------

Query:  ------------------DALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK
                          D LA +GKP+D ++ +  +L  L  +Y+ ++  I+AK  P ++ E+   LL     +ESK+    S   + +T       A 
Subjt:  ------------------DALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAK

Query:  PSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGRTW----------NNRGHI---QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPV
             +     N N GNR+       +N   + W          NN+      +CQ+C   GH+A+RC                                
Subjt:  PSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGRTW----------NNRGHI---QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPV

Query:  DTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAF
                      S  +  +S ++++  P+                     TP   P  NL  GS          P S+                    
Subjt:  DTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAF

Query:  PQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGV
                        NW  DSGAT+H+T  F NLS+   Y GG+ V V +G+ +PI +   +S S+  +R  +L+N+L+VP+I +NLISV +    NGV
Subjt:  PQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGV

Query:  FFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVLKMFRPSMSI
          EF P    VKD  +G  LLQG   + LY +        P++            SS P     S S+ +    WH RLGHP  + +  V+  +  S+S+
Subjt:  FFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHVLKMFRPSMSI

Query:  I
        +
Subjt:  I

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-2624.66Show/hide
Query:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDI----------------LHQQYV----------------D
        GY+L  FL    P      +  +++  D  P +  NP+Y+RW+RQD +I S ++G++S  +                L + Y                 D
Subjt:  GYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDI----------------LHQQYV----------------D

Query:  ALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFN-GGNR
         LA +GKP+D ++ +  +L  L  DY+ ++  I+AK  P S+ E+   L+ +    ESKL    S   V +T          +    N    N N   N 
Subjt:  ALAAVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFN-GGNR

Query:  DRGGGRFGSNRGGRTWNNRGHI---QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKV
        +R      S+ G R+ N +      +CQ+CS  GH+A+RC                 P  +Q+                                     
Subjt:  DRGGGRFGSNRGGRTWNNRGHI---QCQLCSKFGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKV

Query:  GPTSVHEVMSLLLTQENRNESKLSTPES--LPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATN
                      Q   N+ + ++P +   P  NL   S                           YN                     NW  DSGAT+
Subjt:  GPTSVHEVMSLLLTQENRNESKLSTPES--LPSVNLTTGSKLPEAEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATN

Query:  HLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLH
        H+T  F NLS    Y GG+ V + +G+ +PI +   +S  +  +R   LN +L+VP+I +NLISV +    N V  EF P    VKD  +G  LLQG   
Subjt:  HLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPINRIFHLNNLLHVPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLH

Query:  EGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATV-----KHVLKMFRPSMSII
        + LY +        P++   AV    S  S          +T S+   WH RLGHP +A +      H L +  PS  ++
Subjt:  EGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATV-----KHVLKMFRPSMSII

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCTCAGGGTTCTTCTGGCACCACGATTGATGATGCTTCGCAAGCATCTTCTCAGACATTCAGTCTGGGATATGATTTAGAATCCTTCCTTAAAATCGATCCACC
GGAACAATTTATCCTCACTCTTGGAGCATCTTCTTTAGCCGAAGACACTACTCCAACTATAACTCCGAACCCTGAGTATAGTCGATGGAAACGACAGGATCACATTATAT
CTTCCTGGCTTGTTGGTTCGATGTCCGAAGACATCTTGCATCAGCAATATGTGGATGCTTTGGCTGCTGTGGGTAAACCTGTTGATACTGAGGATCATATTTTGTTTATC
CTGTCTGGTCTCGGTTCTGATTATGAGTCTATGGTCTCTGTTATTTCTGCTAAAGTTGGTCCTACTTCTGTTCACGAAGTTATGTCTTTACTTCTGACTCAAGAAAATCG
CAATGAATCCAAATTGTCTACACCTGAATCTCTTCCATCTGTGAATCTTACTACGGGCTTTAAACCTCCTGAGGCGAAACCATCGAAGGTTGTATCCAACCCCTATCCAC
AAAACTTCAATGGTGGTAATAGAGATCGTGGGGGTGGTCGTTTTGGCTCAAATCGGGGTGGAAGAACCTGGAACAATCGTGGTCATATTCAGTGTCAGCTTTGCAGTAAG
TTTGGGCATACTGCTCAGAGATGCTACTTTCGGTATGCTCCCTCCAGTGCTCCATCAGCTCCAGGTTCTTTCTCTCCTGCTTTCAATCAGTATAATAGACAACTTGTGGG
TAAACCTGTTGATACTGAGGATCATATTTTGTTTATCCTGTCTGGTCTTGGTTCTGATTATGAGTCTATGGTCTCTGTTATTTCTGCTAAAGTTGGTCCTACTTCTGTTC
ACGAAGTTATGTCTTTACTTCTGACTCAAGAAAATCGCAATGAATCCAAATTGTCTACACCTGAATCTCTTCCATCTGTGAATCTTACTACGGGCTCTAAACTTCCTGAG
GCAGAACCACCGAAGGCTCCCTCCAGTGCTACATCAGCTCCAGGTTCTTTCTCTCCTGCTTTCAATCAGTATAATAGACAACCTGCCTTTCCTCAAATGTCTGCCTTACT
TACTGCTCCGGACATCAATCATGATACCAACTGGTATCCAGACTCGGGTGCCACTAATCATTTAACTCACAGCTTTGGCAACCTCTCGGTAGGTACCGAATATGGAGGTG
GTAATCAAGTTCATGTAGGCAATGGAGCAGGTTTGCCGATACTTAACTGTGACTTCTCTTCCTTTTCTTCTCCTATTAATCGTATTTTTCATTTGAATAATCTTCTTCAT
GTTCCTTCCATCACAAGGAATTTAATTAGTGTAAGTCAATTTACCAAAGATAATGGTGTCTTCTTTGAATTTCACCCTACTTTGTGTTATGTGAAGGATCAAGCATCTGG
TCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTTTATCGTTTCAATCTATTCATCAATCCTTCCTTGCCTGTTTCTGAGAAAGCTGCTGTGAAAGCTCTATGTTCTA
CTTGGAGTTCTTCTCCCACTCCTTATGTTTTATCTGTTTCTACTATGTCTAATCTGGATATATGGCATAGGCGTTTAGGCCACCCTCATATTGCCACTGTTAAACATGTT
CTTAAAATGTTTAGGCCAAGCATGTCCATAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGTCTCAGGGTTCTTCTGGCACCACGATTGATGATGCTTCGCAAGCATCTTCTCAGACATTCAGTCTGGGATATGATTTAGAATCCTTCCTTAAAATCGATCCACC
GGAACAATTTATCCTCACTCTTGGAGCATCTTCTTTAGCCGAAGACACTACTCCAACTATAACTCCGAACCCTGAGTATAGTCGATGGAAACGACAGGATCACATTATAT
CTTCCTGGCTTGTTGGTTCGATGTCCGAAGACATCTTGCATCAGCAATATGTGGATGCTTTGGCTGCTGTGGGTAAACCTGTTGATACTGAGGATCATATTTTGTTTATC
CTGTCTGGTCTCGGTTCTGATTATGAGTCTATGGTCTCTGTTATTTCTGCTAAAGTTGGTCCTACTTCTGTTCACGAAGTTATGTCTTTACTTCTGACTCAAGAAAATCG
CAATGAATCCAAATTGTCTACACCTGAATCTCTTCCATCTGTGAATCTTACTACGGGCTTTAAACCTCCTGAGGCGAAACCATCGAAGGTTGTATCCAACCCCTATCCAC
AAAACTTCAATGGTGGTAATAGAGATCGTGGGGGTGGTCGTTTTGGCTCAAATCGGGGTGGAAGAACCTGGAACAATCGTGGTCATATTCAGTGTCAGCTTTGCAGTAAG
TTTGGGCATACTGCTCAGAGATGCTACTTTCGGTATGCTCCCTCCAGTGCTCCATCAGCTCCAGGTTCTTTCTCTCCTGCTTTCAATCAGTATAATAGACAACTTGTGGG
TAAACCTGTTGATACTGAGGATCATATTTTGTTTATCCTGTCTGGTCTTGGTTCTGATTATGAGTCTATGGTCTCTGTTATTTCTGCTAAAGTTGGTCCTACTTCTGTTC
ACGAAGTTATGTCTTTACTTCTGACTCAAGAAAATCGCAATGAATCCAAATTGTCTACACCTGAATCTCTTCCATCTGTGAATCTTACTACGGGCTCTAAACTTCCTGAG
GCAGAACCACCGAAGGCTCCCTCCAGTGCTACATCAGCTCCAGGTTCTTTCTCTCCTGCTTTCAATCAGTATAATAGACAACCTGCCTTTCCTCAAATGTCTGCCTTACT
TACTGCTCCGGACATCAATCATGATACCAACTGGTATCCAGACTCGGGTGCCACTAATCATTTAACTCACAGCTTTGGCAACCTCTCGGTAGGTACCGAATATGGAGGTG
GTAATCAAGTTCATGTAGGCAATGGAGCAGGTTTGCCGATACTTAACTGTGACTTCTCTTCCTTTTCTTCTCCTATTAATCGTATTTTTCATTTGAATAATCTTCTTCAT
GTTCCTTCCATCACAAGGAATTTAATTAGTGTAAGTCAATTTACCAAAGATAATGGTGTCTTCTTTGAATTTCACCCTACTTTGTGTTATGTGAAGGATCAAGCATCTGG
TCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTTTATCGTTTCAATCTATTCATCAATCCTTCCTTGCCTGTTTCTGAGAAAGCTGCTGTGAAAGCTCTATGTTCTA
CTTGGAGTTCTTCTCCCACTCCTTATGTTTTATCTGTTTCTACTATGTCTAATCTGGATATATGGCATAGGCGTTTAGGCCACCCTCATATTGCCACTGTTAAACATGTT
CTTAAAATGTTTAGGCCAAGCATGTCCATAATATGA
Protein sequenceShow/hide protein sequence
MMSQGSSGTTIDDASQASSQTFSLGYDLESFLKIDPPEQFILTLGASSLAEDTTPTITPNPEYSRWKRQDHIISSWLVGSMSEDILHQQYVDALAAVGKPVDTEDHILFI
LSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGFKPPEAKPSKVVSNPYPQNFNGGNRDRGGGRFGSNRGGRTWNNRGHIQCQLCSK
FGHTAQRCYFRYAPSSAPSAPGSFSPAFNQYNRQLVGKPVDTEDHILFILSGLGSDYESMVSVISAKVGPTSVHEVMSLLLTQENRNESKLSTPESLPSVNLTTGSKLPE
AEPPKAPSSATSAPGSFSPAFNQYNRQPAFPQMSALLTAPDINHDTNWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNCDFSSFSSPINRIFHLNNLLH
VPSITRNLISVSQFTKDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLFINPSLPVSEKAAVKALCSTWSSSPTPYVLSVSTMSNLDIWHRRLGHPHIATVKHV
LKMFRPSMSII