; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24599 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24599
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCarg_Chr19:2560354..2567556
RNA-Seq ExpressionCarg24599
SyntenyCarg24599
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66898.1 hypothetical protein VITISV_037436 [Vitis vinifera]2.2e-11337.66Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR  A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---
        L+M+E  SIA ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +KLK+++IRDL+L E IR R  G++  SG AL+ +     
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---

Query:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGYAAEFGK-------------------------------
          K + K   K  +D  +   TE+ +DAL+L+VDSP +   W                IQ   A +FGK                               
Subjt:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGYAAEFGK-------------------------------

Query:  ----------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI
                                     +WK+ KGA V+ARG K+GTLY T+   +  A   ++++++LWH RLGH+S KGMKML++KG L  LKS+D 
Subjt:  ----------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI

Query:  ---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK----------------------------------------
                                         D+WGPSPV+SLGGSR Y+TFI+D SRK                                        
Subjt:  ---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK----------------------------------------

Query:  --------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDK
                                                     +AV+T TYLINRG SVP++F+LPEEVW+GKE+K+SHL++FGC +YVH+D + R K
Subjt:  --------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDK

Query:  LDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEK
        LD K+  C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+YK++  + S+ T+  + + E+    E + S V    +E    V  ++D+   TP  +
Subjt:  LDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEK

Query:  IIQNYQSTR
        + ++ ++ R
Subjt:  IIQNYQSTR

CAN73240.1 hypothetical protein VITISV_035336 [Vitis vinifera]4.8e-11637.87Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR+ A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------
        L+M+E  S+  ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +KLK+++IRDL+L E IR R  G++SG   +         
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------

Query:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY
                                           T   K + K   K  +DD +   TE+ +DAL+L+VDSP +   W                IQ   
Subjt:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY

Query:  AAEFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDG
        A +FGK                                                            +WK+ KGA V+ARG K+GTLY T+   +  A   
Subjt:  AAEFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDG

Query:  STSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK---
        +++++SLWH RLGH+S KGMKML++KG L  LKS+D                                  D+WGPSPV+SLGGSR Y+TFI+D SRK   
Subjt:  STSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK---

Query:  ----------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVK
                                                A+AV+T  YLINRG SVP++F+LPEEVW+GKE+K+SHL++FGC +YVH+D + R KLD K
Subjt:  ----------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVK

Query:  AVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQN
        +  C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+YK++  + S+ T+  + + E+    E + S V    +E    V  ++D+   TP  ++ ++
Subjt:  AVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQN

Query:  YQSTR
         ++TR
Subjt:  YQSTR

CAN76274.1 hypothetical protein VITISV_008497 [Vitis vinifera]1.8e-11840.5Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR+ A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---
        L+M+E   +A ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +K K+++IRDL+L E IR R  G++  SG AL+ +     
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---

Query:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAE------------------------SGF----------------------------WI-----
          K + K   K  +DD +   TE+ +DAL+L+VDSP +                         GF                            W+     
Subjt:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAE------------------------SGF----------------------------WI-----

Query:  ----------------QQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--
                         +G+A  F   +WK+ KGA V+ARG K+GTLY T+   N  A    ++++SLWH RLGH+S KGMKML++KG L  LKS+D   
Subjt:  ----------------QQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--

Query:  -------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------------
                                       D+WGPSPV+SLGGSR Y+TFI+D SRK                                          
Subjt:  -------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------------

Query:  -ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVL
         A+AV+T  YLINRG SVP++F+LPEEVW+GKE+K+SHL++F C +YVH+D + R KLD K+  C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+
Subjt:  -ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVL

Query:  YKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAE
        YK++  + S+ T     E++ ++    ++    +    PVAE
Subjt:  YKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAE

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MNKSFRIPNGAHFPRKDRQIPLPMFISVLQPTITFFSAFELGSTIAQLDSLFDLLKDIWILFHYDEYGCVYYETDYDHHVHHDDTMVAILSGRSAPVRQR
        MNKSFRIPNGAHFPRKDRQIPLPMFISVLQPTITFFSAFELGSTIAQLDSLFDLLKDIWILFHYDEYGCVYYETDYDHHVHHDDTMVAILSGRSAPVRQR
Subjt:  MNKSFRIPNGAHFPRKDRQIPLPMFISVLQPTITFFSAFELGSTIAQLDSLFDLLKDIWILFHYDEYGCVYYETDYDHHVHHDDTMVAILSGRSAPVRQR

Query:  VPQPIGTWDVRACLVGPHLHIWIVCREGKPEREIPKRTISTSSGSRPLQMVTEPDTGRCASLLAVPRRRVDTRRCASKDAGPQRGVDLGAVPHRLEEGKS
        VPQPIGTWDVRACLVGPHLHIWIVCREGKPEREIPKRTISTSSGSRPLQMVTEPDTGRCASLLAVPRRRVDTRRCASKDAGPQRGVDLGAVPHRLEEGKS
Subjt:  VPQPIGTWDVRACLVGPHLHIWIVCREGKPEREIPKRTISTSSGSRPLQMVTEPDTGRCASLLAVPRRRVDTRRCASKDAGPQRGVDLGAVPHRLEEGKS

Query:  ATVDAGPLRGGKPEKESPKRTISAGGGSGLLRDDTCPLALVECVRCCAILTTGIRARWKSPRSMKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEP
        ATVDAGPLRGGKPEKESPKRTISAGGGSGLLRDDTCPLALVECVRCCAILTTGIRARWKSPRSMKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEP
Subjt:  ATVDAGPLRGGKPEKESPKRTISAGGGSGLLRDDTCPLALVECVRCCAILTTGIRARWKSPRSMKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEP

Query:  LWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEIN
        LWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEIN
Subjt:  LWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEIN

Query:  FKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALSTDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSP
        FKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALSTDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSP
Subjt:  FKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALSTDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSP

Query:  AESGFWIQQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDIDVWGPSPVSSL
        AESGFWIQQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDIDVWGPSPVSSL
Subjt:  AESGFWIQQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDIDVWGPSPVSSL

Query:  GGSRLYVTFINDFSRKANAVNTTYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRK
        GGSRLYVTFINDFSRKANAVNTTYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRK
Subjt:  GGSRLYVTFINDFSRKANAVNTTYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRK

Query:  ILRHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQNYQSTR
        ILRHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQNYQSTR
Subjt:  ILRHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQNYQSTR

RVW69626.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.0e-11437.98Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR+ A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------
        L+M+E  S+  ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +KLK+++IRDL+L E IR R  G++SG   +         
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------

Query:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY
                                           T   K + K   K  +DD +   TE+ +DAL+L+VDSP +   W                IQ   
Subjt:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY

Query:  AAEFGK------------------------SSW---------------------------------KIVKGAMVVARGTKSGTLYTTAECINMTATDGST
        A +FGK                        S W                                 K+ KGA V+ARG K+GTLY T+   +  A   ++
Subjt:  AAEFGK------------------------SSW---------------------------------KIVKGAMVVARGTKSGTLYTTAECINMTATDGST

Query:  SNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK-----
        +++SLWH RLGH+S KGMKML++KG L  LKS+D                                  D+WGPSPV+SLGGSR Y+TFI+D SRK     
Subjt:  SNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK-----

Query:  --------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAV
                                              A+AV+T  YLINRG SVP++F+LPEEVW+GKE+K+SHL++FGC +YVH+D +   KLD K+ 
Subjt:  --------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAV

Query:  KCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQNYQ
         C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+YK++  + S+ T+  + + E+    E + S V    +E    V  ++D+   TP  ++ ++ +
Subjt:  KCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQNYQ

Query:  STR
        +TR
Subjt:  STR

TrEMBL top hitse value%identityAlignment
A0A2N9FNY4 Uncharacterized protein4.4e-11537.72Show/hide
Query:  MKMESSKI-GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK
        M  E  K+ GIEKFDG+DF +W+MQIEDYLY K LH PL G K + M   +W L DRQ L +IRLTLSR  A N++KEKTT++L+ AL  MYEK SA NK
Subjt:  MKMESSKI-GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK

Query:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGD--SSGKA
        V+LM++LFNL+M+EG ++A ++NEFN I ++LS VEI F DEI+ALI+++SLP SW+ +  A+++S G  KLK+++IRDL+LGE +R R  G+  SSG A
Subjt:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGD--SSGKA

Query:  LSTDCTKLKKKQNH---------------------------------------KSEDDDDSIYTTEDTEDALILSVDSPAESGFW---------------
        L+ +     K +N+                                       K  ++D +   TE+  DAL+LSVDSP ES  W               
Subjt:  LSTDCTKLKKKQNH---------------------------------------KSEDDDDSIYTTEDTEDALILSVDSPAESGFW---------------

Query:  -IQQGYAAEFGK--------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK
         IQ   A +FGK                     +WKI KGAMVVARG K+GTLY T    +  A   + ++++LWH RLGH+S KGMK+L++KG L  LK
Subjt:  -IQQGYAAEFGK--------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK

Query:  SVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------
        SV+                                  D+WGPSP++SLGGSR YVTFI+D SRK                                    
Subjt:  SVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------

Query:  -------------------------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLR
                                                                     A+AVNT  YLINRG SVPL+F++PEEVW+GKE+  S+L+
Subjt:  -------------------------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLR

Query:  IFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKE--KINSETTKQVRVEL----EWQENSPSDVTVEAQ
        +FGC +YVH+D + R KLD K+ KC+FIGYG   FGYRFWDD+NRK++R  ++ F+E V+YK++   K++    +Q + E     E+  N+  +   E +
Subjt:  IFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKE--KINSETTKQVRVEL----EWQENSPSDVTVEAQ

Query:  ETPNPVAEELDVEQVTPEEKIIQNYQSTR
        E  NP      VEQ TP   + ++ ++ R
Subjt:  ETPNPVAEELDVEQVTPEEKIIQNYQSTR

A0A2N9I1L5 Uncharacterized protein4.4e-11537.72Show/hide
Query:  MKMESSKI-GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK
        M  E  K+ GIEKFDG+DF +W+MQIEDYLY K LH PL G K + M   +W L DRQ L +IRLTLSR  A N++KEKTT++L+ AL  MYEK SA NK
Subjt:  MKMESSKI-GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK

Query:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGD--SSGKA
        V+LM++LFNL+M+EG ++A ++NEFN I ++LS VEI F DEI+ALI+++SLP SW+ +  A+++S G  KLK+++IRDL+LGE +R R  G+  SSG A
Subjt:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGD--SSGKA

Query:  LSTDCTKLKKKQNH---------------------------------------KSEDDDDSIYTTEDTEDALILSVDSPAESGFW---------------
        L+ +     K +N+                                       K  ++D +   TE+  DAL+LSVDSP ES  W               
Subjt:  LSTDCTKLKKKQNH---------------------------------------KSEDDDDSIYTTEDTEDALILSVDSPAESGFW---------------

Query:  -IQQGYAAEFGK--------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK
         IQ   A +FGK                     +WKI KGAMVVARG K+GTLY T    +  A   + ++++LWH RLGH+S KGMK+L++KG L  LK
Subjt:  -IQQGYAAEFGK--------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK

Query:  SVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------
        SV+                                  D+WGPSP++SLGGSR YVTFI+D SRK                                    
Subjt:  SVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------

Query:  -------------------------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLR
                                                                     A+AVNT  YLINRG SVPL+F++PEEVW+GKE+  S+L+
Subjt:  -------------------------------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLR

Query:  IFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKE--KINSETTKQVRVEL----EWQENSPSDVTVEAQ
        +FGC +YVH+D + R KLD K+ KC+FIGYG   FGYRFWDD+NRK++R  ++ F+E V+YK++   K++    +Q + E     E+  N+  +   E +
Subjt:  IFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKE--KINSETTKQVRVEL----EWQENSPSDVTVEAQ

Query:  ETPNPVAEELDVEQVTPEEKIIQNYQSTR
        E  NP      VEQ TP   + ++ ++ R
Subjt:  ETPNPVAEELDVEQVTPEEKIIQNYQSTR

A0A5B7BAK4 Uncharacterized protein1.4e-11838.6Show/hide
Query:  MKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKV
        M  E  K+ I+KFDG+DF FWKMQIEDYLY+K L++PL G K D M  E W L DRQAL ++RLTL+RN AFNI KEKTT+ L+ ALSNMYEK SA NKV
Subjt:  MKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKV

Query:  YLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALST
        YLMRRLFNL+MSEG S+A+++NEFN++ ++LS VEI F DEI+ALIL+SSLPESW+  V A++SS G+ KLK+D++RDL+L E IR R++G+SSG AL+ 
Subjt:  YLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALST

Query:  -------------------------------------------DCTKLKKKQNHK-----SEDDDDSIYTTEDTEDALILSVDSPAESGFWIQQGYAA--
                                                   DC   KK++  K     +++ + +   TE+ +DALILS+D   ES  W+    A+  
Subjt:  -------------------------------------------DCTKLKKKQNHK-----SEDDDDSIYTTEDTEDALILSVDSPAESGFWIQQGYAA--

Query:  --------------EFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYT
                      +FGK                                                            SWK+ KGAMV+ARG K GTLY 
Subjt:  --------------EFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYT

Query:  TAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYV
        T    +      + S+S+LWH RLGH+S KGMK+L +KG L+GLKSVDI                                 DVWGPSPVSSLGGS  YV
Subjt:  TAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYV

Query:  TFINDFSRK-------------------------------------------------------------------------------------------
        TFI+D +RK                                                                                           
Subjt:  TFINDFSRK-------------------------------------------------------------------------------------------

Query:  ------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTF
              A+AVNT  YLINRG SVPL   LPEE W+GKE+  SHL++FGC +YVH+D + R KLD K+ KC FIGYG++ FGYRFWDD+NRKI R  D+ F
Subjt:  ------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTF

Query:  DENVLYKNK---EKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTP
        +E VLYK++   E  N++T  +    +E +E S S+V    Q  P  +  +  VE VTP
Subjt:  DENVLYKNK---EKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTP

A5B0V5 Integrase catalytic domain-containing protein8.5e-11940.5Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR+ A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---
        L+M+E   +A ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +K K+++IRDL+L E IR R  G++  SG AL+ +     
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDS--SGKALSTDCT---

Query:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAE------------------------SGF----------------------------WI-----
          K + K   K  +DD +   TE+ +DAL+L+VDSP +                         GF                            W+     
Subjt:  --KLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAE------------------------SGF----------------------------WI-----

Query:  ----------------QQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--
                         +G+A  F   +WK+ KGA V+ARG K+GTLY T+   N  A    ++++SLWH RLGH+S KGMKML++KG L  LKS+D   
Subjt:  ----------------QQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--

Query:  -------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------------
                                       D+WGPSPV+SLGGSR Y+TFI+D SRK                                          
Subjt:  -------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK------------------------------------------

Query:  -ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVL
         A+AV+T  YLINRG SVP++F+LPEEVW+GKE+K+SHL++F C +YVH+D + R KLD K+  C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+
Subjt:  -ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVL

Query:  YKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAE
        YK++  + S+ T     E++ ++    ++    +    PVAE
Subjt:  YKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAE

A5CAX7 Uncharacterized protein2.3e-11637.87Show/hide
Query:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN
        GIEKFDG+DF +W+MQIEDYLY + LH PL G K ++M  E+W L DRQ L +IRLTLSR+ A N++KEKTT+DL+KALS MYEK SA NKV+LM++LFN
Subjt:  GIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFN

Query:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------
        L+M+E  S+  ++NEFN I ++LS VEI+F DEI+ALI+++SLP SW+ +  A+++S G +KLK+++IRDL+L E IR R  G++SG   +         
Subjt:  LQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS---------

Query:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY
                                           T   K + K   K  +DD +   TE+ +DAL+L+VDSP +   W                IQ   
Subjt:  -----------------------------------TDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFW----------------IQQGY

Query:  AAEFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDG
        A +FGK                                                            +WK+ KGA V+ARG K+GTLY T+   +  A   
Subjt:  AAEFGK-----------------------------------------------------------SSWKIVKGAMVVARGTKSGTLYTTAECINMTATDG

Query:  STSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK---
        +++++SLWH RLGH+S KGMKML++KG L  LKS+D                                  D+WGPSPV+SLGGSR Y+TFI+D SRK   
Subjt:  STSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDI---------------------------------DVWGPSPVSSLGGSRLYVTFINDFSRK---

Query:  ----------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVK
                                                A+AV+T  YLINRG SVP++F+LPEEVW+GKE+K+SHL++FGC +YVH+D + R KLD K
Subjt:  ----------------------------------------ANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVK

Query:  AVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQN
        +  C+FIGYG   FGYRFWD++NRKI+R  ++ F+E V+YK++  + S+ T+  + + E+    E + S V    +E    V  ++D+   TP  ++ ++
Subjt:  AVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEW---QENSPSDVTVEAQETPNPVAEELDVEQVTPEEKIIQN

Query:  YQSTR
         ++TR
Subjt:  YQSTR

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-1034.62Show/hide
Query:  VNTTYLINRGSSVPL--KFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKN
        +  TYLINR  S  L    K P E+W  K+    HLR+FG T YVH+   K+ K D K+ K  F+GY  N  G++ WD  N K +   D+  DE  +  N
Subjt:  VNTTYLINRGSSVPL--KFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKN

Query:  KEKINSETTKQVRVELEWQENSPSDVTVEAQ-ETPNPVAEELDVEQVTPEEKIIQN
           +  ET      +    +N P+D     Q E PN  ++E D  Q   + K  +N
Subjt:  KEKINSETTKQVRVELEWQENSPSDVTVEAQ-ETPNPVAEELDVEQVTPEEKIIQN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-4724.24Show/hide
Query:  MESSKIGIEKFDGSD-FDFWKMQIEDYLYKKDLHEPL--WGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK
        M   K  + KF+G + F  W+ ++ D L ++ LH+ L     K DTM  E W   D +A   IRL LS +   NII E T   +   L ++Y   +  NK
Subjt:  MESSKIGIEKFDGSD-FDFWKMQIEDYLYKKDLHEPL--WGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNK

Query:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS
        +YL ++L+ L MSEG +   ++N FN ++++L+ + +  ++E KA++L++SLP S+D +   I   + + +LK D    L+L E  + RK  ++ G+AL 
Subjt:  VYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIRDLVLGESIRTRKTGDSSGKALS

Query:  T---------------------------------------------DCTKLKKKQNHKS-EDDDDSIYTTEDTEDALILSVDSPAE-------SGFWI--
        T                                             DC   +K +   S + +DD+        D ++L ++   E          W+  
Subjt:  T---------------------------------------------DCTKLKKKQNHKS-EDDDDSIYTTEDTEDALILSVDSPAE-------SGFWI--

Query:  -------------------------------------------------------------------------QQGYAAEFGKSSWKIVKGAMVVARGTK
                                                                                 + GY + F    W++ KG++V+A+G  
Subjt:  -------------------------------------------------------------------------QQGYAAEFGKSSWKIVKGAMVVARGTK

Query:  SGTLYTTAECINMTATDGSTSNSS--LWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--------------------------------DVWGPSPVSSL
         GTLY T   I     + +    S  LWH R+GH+S KG+++L  K  +   K   +                                DV GP  + S+
Subjt:  SGTLYTTAECINMTATDGSTSNSS--LWHNRLGHLSVKGMKMLIAKGALEGLKSVDI--------------------------------DVWGPSPVSSL

Query:  GGSRLYVTFINDFSRK-----------------------------------------------------------------------ANAVNTT------
        GG++ +VTFI+D SRK                                                                       A  +N T      
Subjt:  GGSRLYVTFINDFSRK-----------------------------------------------------------------------ANAVNTT------

Query:  ---------------------YLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKIL
                             YLINR  SVPL F++PE VWT KE+ YSHL++FGC A+ HV  E+R KLD K++ C FIGYG   FGYR WD   +K++
Subjt:  ---------------------YLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKIL

Query:  RHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEE
        R  D+ F E+ +    +   SE  K   +       S S+    A+ T + V+E+
Subjt:  RHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEE

P92512 Uncharacterized mitochondrial protein AtMg007103.5e-0545.1Show/hide
Query:  KANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVD
        +A+A NT  ++IN+  S  + F +P+EVW      YS+LR FGC AY+H D
Subjt:  KANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVD

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein3.2e-1748.89Show/hide
Query:  EKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKV
        +K DG+ + F +M+IEDYLY K LH+PL G K++TM+ + W +  RQ L +IRLT+S+N A N+ KEK+   L+K LS++Y+K S  N V
Subjt:  EKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLSRNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKV

ATMG00300.1 Gag-Pol-related retrotransposon family protein4.0e-0430.88Show/hide
Query:  KIVKGAMVVARGTKSGTLYT---TAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK
        K++KG   + +G +  +LY    + E       + +   + LWH+RL H+S +GM++L+ KG L+  K
Subjt:  KIVKGAMVVARGTKSGTLYT---TAECINMTATDGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLK

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0645.1Show/hide
Query:  KANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVD
        +A+A NT  ++IN+  S  + F +P+EVW      YS+LR FGC AY+H D
Subjt:  KANAVNT-TYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTAYVHVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAATCCTTCCGGATTCCTAATGGTGCACATTTTCCCAGGAAAGATCGCCAAATACCTCTTCCAATGTTCATTTCAGTGCTGCAGCCCACTATAACTTTCTTCTC
GGCCTTTGAGCTCGGCAGCACCATCGCGCAACTTGATAGCTTATTCGACTTGTTAAAGGATATTTGGATACTTTTCCATTATGATGAGTATGGATGCGTATATTATGAGA
CTGATTATGACCATCATGTTCACCATGATGATACGATGGTCGCTATTCTCTCTGGAAGGTCAGCACCAGTCCGTCAGCGAGTGCCCCAACCAATAGGCACATGGGATGTG
CGAGCCTGCCTAGTGGGTCCACATTTGCACATATGGATCGTGTGTAGAGAAGGGAAGCCCGAAAGGGAAATCCCAAAGAGGACAATATCTACTAGCAGTGGATCTAGGCC
GTTACAAATGGTAACAGAGCCAGACACTGGACGATGTGCTAGCCTTCTCGCTGTTCCCCGAAGGAGGGTAGACACGAGGCGGTGTGCCAGTAAGGATGCTGGGCCTCAAA
GGGGGGTGGATTTGGGGGCGGTCCCACATCGATTGGAGGAAGGAAAGAGCGCCACCGTGGACGCTGGGCCCCTAAGGGGGGGGAAGCCTGAAAAGGAAAGCCCAAAGAGG
ACAATATCTGCTGGCGGTGGATCTGGGCTGTTACGTGATGACACATGCCCATTGGCCCTGGTCGAGTGTGTGCGTTGTTGTGCGATCCTAACAACTGGTATCAGAGCTAG
GTGGAAGTCACCACGATCTATGAAGATGGAAAGTTCAAAGATTGGAATTGAGAAGTTCGATGGATCCGATTTCGATTTCTGGAAGATGCAGATTGAAGATTATCTGTACA
AGAAAGATCTTCATGAACCCCTGTGGGGGGTGAAGCTGGATACTATGACCACGGAACAGTGGAAGCTCAAGGATCGACAAGCCTTATGGCTGATTCGGTTGACGCTATCT
AGAAACGCGGCGTTCAACATTATCAAGGAGAAGACAACGTCAGATCTGTTGAAGGCGTTATCAAATATGTATGAAAAACTGTCAGCTATGAACAAGGTGTATTTGATGCG
GAGATTGTTCAATCTACAGATGTCTGAAGGTGGATCTATTGCTGATTATATAAATGAATTCAATATGATCGTAAGTCGACTGAGTTTGGTGGAAATTAATTTCAAGGATG
AAATTAAAGCGTTGATTTTGATGTCATCTTTACCCGAGTCATGGGATACTGTTGTTGCCGCAATCAACAGTTCTCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGA
GATCTAGTTCTCGGCGAAAGTATTCGCACACGGAAAACTGGAGATTCATCAGGCAAAGCTCTTAGTACAGATTGTACAAAACTGAAGAAGAAGCAGAATCATAAATCTGA
AGATGATGATGATTCTATATATACAACAGAAGATACTGAGGACGCTTTAATCCTTAGTGTGGATAGCCCAGCTGAATCTGGATTTTGGATTCAGCAAGGCTATGCAGCAG
AGTTTGGAAAGAGTTCGTGGAAGATTGTGAAAGGCGCCATGGTTGTTGCACGTGGCACAAAATCCGGAACCTTATACACTACTGCAGAGTGTATAAACATGACTGCTACT
GATGGCAGTACTTCCAATTCAAGTCTATGGCACAATAGACTTGGACATTTGAGCGTCAAAGGAATGAAGATGCTGATTGCAAAAGGAGCTTTAGAAGGCTTAAAATCTGT
TGATATAGACGTCTGGGGTCCATCTCCAGTTTCATCACTTGGTGGATCAAGGTTATACGTCACCTTCATCAATGACTTCAGTAGGAAGGCTAATGCTGTGAACACAACAT
ATTTGATTAATAGAGGGTCGTCAGTACCCTTAAAGTTCAAATTGCCTGAAGAAGTATGGACAGGAAAAGAACTCAAATACTCTCACTTGAGAATTTTTGGTTGTACTGCG
TATGTTCATGTTGATTTAGAGAAGAGAGATAAGCTTGATGTTAAGGCTGTAAAATGCTACTTCATTGGCTATGGCTCTAACATATTCGGGTACAGGTTTTGGGATGACAA
GAATAGGAAAATCCTAAGACACTACGATATGACCTTTGATGAAAATGTTCTGTACAAGAACAAAGAGAAGATAAACTCTGAGACTACTAAGCAAGTGAGAGTTGAGCTTG
AGTGGCAAGAAAATTCACCTAGTGATGTTACAGTAGAAGCTCAAGAAACTCCTAATCCTGTTGCTGAAGAACTAGACGTGGAGCAAGTTACACCTGAGGAGAAAATCATC
CAGAACTATCAGAGCACCAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAATCCTTCCGGATTCCTAATGGTGCACATTTTCCCAGGAAAGATCGCCAAATACCTCTTCCAATGTTCATTTCAGTGCTGCAGCCCACTATAACTTTCTTCTC
GGCCTTTGAGCTCGGCAGCACCATCGCGCAACTTGATAGCTTATTCGACTTGTTAAAGGATATTTGGATACTTTTCCATTATGATGAGTATGGATGCGTATATTATGAGA
CTGATTATGACCATCATGTTCACCATGATGATACGATGGTCGCTATTCTCTCTGGAAGGTCAGCACCAGTCCGTCAGCGAGTGCCCCAACCAATAGGCACATGGGATGTG
CGAGCCTGCCTAGTGGGTCCACATTTGCACATATGGATCGTGTGTAGAGAAGGGAAGCCCGAAAGGGAAATCCCAAAGAGGACAATATCTACTAGCAGTGGATCTAGGCC
GTTACAAATGGTAACAGAGCCAGACACTGGACGATGTGCTAGCCTTCTCGCTGTTCCCCGAAGGAGGGTAGACACGAGGCGGTGTGCCAGTAAGGATGCTGGGCCTCAAA
GGGGGGTGGATTTGGGGGCGGTCCCACATCGATTGGAGGAAGGAAAGAGCGCCACCGTGGACGCTGGGCCCCTAAGGGGGGGGAAGCCTGAAAAGGAAAGCCCAAAGAGG
ACAATATCTGCTGGCGGTGGATCTGGGCTGTTACGTGATGACACATGCCCATTGGCCCTGGTCGAGTGTGTGCGTTGTTGTGCGATCCTAACAACTGGTATCAGAGCTAG
GTGGAAGTCACCACGATCTATGAAGATGGAAAGTTCAAAGATTGGAATTGAGAAGTTCGATGGATCCGATTTCGATTTCTGGAAGATGCAGATTGAAGATTATCTGTACA
AGAAAGATCTTCATGAACCCCTGTGGGGGGTGAAGCTGGATACTATGACCACGGAACAGTGGAAGCTCAAGGATCGACAAGCCTTATGGCTGATTCGGTTGACGCTATCT
AGAAACGCGGCGTTCAACATTATCAAGGAGAAGACAACGTCAGATCTGTTGAAGGCGTTATCAAATATGTATGAAAAACTGTCAGCTATGAACAAGGTGTATTTGATGCG
GAGATTGTTCAATCTACAGATGTCTGAAGGTGGATCTATTGCTGATTATATAAATGAATTCAATATGATCGTAAGTCGACTGAGTTTGGTGGAAATTAATTTCAAGGATG
AAATTAAAGCGTTGATTTTGATGTCATCTTTACCCGAGTCATGGGATACTGTTGTTGCCGCAATCAACAGTTCTCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGA
GATCTAGTTCTCGGCGAAAGTATTCGCACACGGAAAACTGGAGATTCATCAGGCAAAGCTCTTAGTACAGATTGTACAAAACTGAAGAAGAAGCAGAATCATAAATCTGA
AGATGATGATGATTCTATATATACAACAGAAGATACTGAGGACGCTTTAATCCTTAGTGTGGATAGCCCAGCTGAATCTGGATTTTGGATTCAGCAAGGCTATGCAGCAG
AGTTTGGAAAGAGTTCGTGGAAGATTGTGAAAGGCGCCATGGTTGTTGCACGTGGCACAAAATCCGGAACCTTATACACTACTGCAGAGTGTATAAACATGACTGCTACT
GATGGCAGTACTTCCAATTCAAGTCTATGGCACAATAGACTTGGACATTTGAGCGTCAAAGGAATGAAGATGCTGATTGCAAAAGGAGCTTTAGAAGGCTTAAAATCTGT
TGATATAGACGTCTGGGGTCCATCTCCAGTTTCATCACTTGGTGGATCAAGGTTATACGTCACCTTCATCAATGACTTCAGTAGGAAGGCTAATGCTGTGAACACAACAT
ATTTGATTAATAGAGGGTCGTCAGTACCCTTAAAGTTCAAATTGCCTGAAGAAGTATGGACAGGAAAAGAACTCAAATACTCTCACTTGAGAATTTTTGGTTGTACTGCG
TATGTTCATGTTGATTTAGAGAAGAGAGATAAGCTTGATGTTAAGGCTGTAAAATGCTACTTCATTGGCTATGGCTCTAACATATTCGGGTACAGGTTTTGGGATGACAA
GAATAGGAAAATCCTAAGACACTACGATATGACCTTTGATGAAAATGTTCTGTACAAGAACAAAGAGAAGATAAACTCTGAGACTACTAAGCAAGTGAGAGTTGAGCTTG
AGTGGCAAGAAAATTCACCTAGTGATGTTACAGTAGAAGCTCAAGAAACTCCTAATCCTGTTGCTGAAGAACTAGACGTGGAGCAAGTTACACCTGAGGAGAAAATCATC
CAGAACTATCAGAGCACCAGATAG
Protein sequenceShow/hide protein sequence
MNKSFRIPNGAHFPRKDRQIPLPMFISVLQPTITFFSAFELGSTIAQLDSLFDLLKDIWILFHYDEYGCVYYETDYDHHVHHDDTMVAILSGRSAPVRQRVPQPIGTWDV
RACLVGPHLHIWIVCREGKPEREIPKRTISTSSGSRPLQMVTEPDTGRCASLLAVPRRRVDTRRCASKDAGPQRGVDLGAVPHRLEEGKSATVDAGPLRGGKPEKESPKR
TISAGGGSGLLRDDTCPLALVECVRCCAILTTGIRARWKSPRSMKMESSKIGIEKFDGSDFDFWKMQIEDYLYKKDLHEPLWGVKLDTMTTEQWKLKDRQALWLIRLTLS
RNAAFNIIKEKTTSDLLKALSNMYEKLSAMNKVYLMRRLFNLQMSEGGSIADYINEFNMIVSRLSLVEINFKDEIKALILMSSLPESWDTVVAAINSSRGSDKLKFDEIR
DLVLGESIRTRKTGDSSGKALSTDCTKLKKKQNHKSEDDDDSIYTTEDTEDALILSVDSPAESGFWIQQGYAAEFGKSSWKIVKGAMVVARGTKSGTLYTTAECINMTAT
DGSTSNSSLWHNRLGHLSVKGMKMLIAKGALEGLKSVDIDVWGPSPVSSLGGSRLYVTFINDFSRKANAVNTTYLINRGSSVPLKFKLPEEVWTGKELKYSHLRIFGCTA
YVHVDLEKRDKLDVKAVKCYFIGYGSNIFGYRFWDDKNRKILRHYDMTFDENVLYKNKEKINSETTKQVRVELEWQENSPSDVTVEAQETPNPVAEELDVEQVTPEEKII
QNYQSTR