; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g03790 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g03790
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr7:3315933..3321629
RNA-Seq ExpressionMoc07g03790
SyntenyMoc07g03790
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.0e-15452.28Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNS
        T E  + +M++K+N L K+VE+RD++IA+L +Q++       +ESSQ  VVK +DK KN+++E Q  + SVS+ASLSVQQLQDMI NSIRAQYGGP Q S
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNS

Query:  FTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEPESINGWEQLEREFLNRFYST
        F YSKPYTKRIDNL MP+GYQP KFQQFD KGNPKQHI HFVETCENAG+RGD LV++FVR LKGNAF+      + +E  +     +   +++NR+ + 
Subjt:  FTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEPESINGWEQLEREFLNRFYST

Query:  SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS
        SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEELATR HDM+LSIANRG KD LV           + K+I+ N    L ESM++  T LKS S
Subjt:  SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS

Query:  KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIEL
        KRK+ K ++  + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE +GKV+DPNYCKYHRV+ H V KCFVLKELI KLA+E KIEL
Subjt:  KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIEL

Query:  NIDELAQANHIGV----------------------------------ATNTCN----------------QICSKTF---HGQNVEELATTH---------
        +IDE+AQ NH+ V                                   T T N                +   ++F   H + + E+   H         
Subjt:  NIDELAQANHIGV----------------------------------ATNTCN----------------QICSKTF---HGQNVEELATTH---------

Query:  -YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF
         Y + +E+D+S EIKQ+TSVFD IKP TTR SVFQR+SMA  KEENQC T    + S F+RLS+S S+K++ STYT D LK+  ND+ +  MK L+ K F
Subjt:  -YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF

Query:  ----DGKRIHN-LASRMKRKLSLVIKPKLHGTPSP
            D  +IH+ + SRMKRKLS+ I  +   T  P
Subjt:  ----DGKRIHN-LASRMKRKLSLVIKPKLHGTPSP

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.6e-15349.07Show/hide
Query:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ
        M    A +TMEE    M++K+N L K  E+RD++IA+L +Q++        ESSQ  VVK +DK KN++QE Q  + SVS+ASLSVQQLQDMI NSIRAQ
Subjt:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ

Query:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR
        YGGP Q SF YSK YTKRIDNL MP+GYQPPKFQQFD KGNPKQHIAHFVETCENAG+RGD LV++FVR LKGNAF+WYTDLEPE      +   +++NR
Subjt:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR

Query:  FYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS
        + + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEEL+TR HDMELSIAN G KD LV   KR  KN            L ESM++  T LKS
Subjt:  FYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS

Query:  SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI
         SKRK+ K E+  + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE  GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Subjt:  SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI

Query:  ELNIDELAQANHIGV----------------------------------ATNTCN---------------------------------------------
        EL+IDE+AQ NH+ V                                   T T N                                             
Subjt:  ELNIDELAQANHIGV----------------------------------ATNTCN---------------------------------------------

Query:  ----------------------------------QICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ
                                          +   ++F   H + + E+   H          Y + +EVD+S EIKQ+T VF RIKP T R SVFQ
Subjt:  ----------------------------------QICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ

Query:  RMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKRKL--------SLVIKP
        R+SMA  +EENQC T    R S F+RLS+ST +K++ ST   D LK+  ND+ +  MK L+ K F    D  +IH+ + SRMKRKL        SL++KP
Subjt:  RMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKRKL--------SLVIKP

Query:  K
        +
Subjt:  K

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-17256.64Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF
        T E ++ +++KK+NML K+VE+RDY+IA L N IE +D    AESS  H VK +DK K +MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q   
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF

Query:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----
         Y KPYTKRIDNL MP GYQPPKFQQFD KGNPKQH+AHF++TCE AGTRGDLLVK+FVR LKGNA DWY DLEPESI+ WEQLER+FLNRFYST     
Subjt:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----

Query:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS
                                 SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P        L    
Subjt:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS

Query:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP
        K     ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++DMLEQLL+ QLI+LPECKRPE  GKV+DPNYCKYHRV+ HP
Subjt:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP

Query:  VGKCFVLKELILKLAQEGKIELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL-------------ATTHYINVKEVDDS
        V KCFVLKELILKLA+E KIEL+IDE+AQ NH I + +N                T  +   ++F   + EE+                +Y + KEV++S
Subjt:  VGKCFVLKELILKLAQEGKIELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL-------------ATTHYINVKEVDDS

Query:  KEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN
         EI Q+TSVFDRIKPSTTR SVFQR+S+A  +EENQC     TR S  +RLS+ST +K++ ST + D LK+  ND+ +  MK  + K F    D  +IH+
Subjt:  KEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN

Query:  -LASRMKRKL--------SLVIKPK
         + SRMKRKL        SL +KP+
Subjt:  -LASRMKRKL--------SLVIKPK

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.8e-16349.79Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF
        T E ++ +++KK+NML K+VE+RDY+IA L N IE +D    AESS  H VK +DK K +MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q   
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF

Query:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----
         YSKPYTKRIDNL MP GYQPPKFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFD Y DLEPESI+ WEQLER+FLNRFYST     
Subjt:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----

Query:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS
                                 SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSI NRG KD L+P        L    
Subjt:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS

Query:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP
        K     ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPF DSD++DMLEQLL+ QLI+LP+CKRP+   KV+DPNYCKYHRV+ HP
Subjt:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP

Query:  VGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATN--------------------------------------------------------------
        V KCFVLKELILKLA+E KIELNIDE+AQ NH+ +                                                                 
Subjt:  VGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATN--------------------------------------------------------------

Query:  -----------------------------------------------TCNQICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTS
                                                       T  +   ++F   H + + E+ T H          Y + KEV++  EI Q+TS
Subjt:  -----------------------------------------------TCNQICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTS

Query:  VFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKR
        VFDRIKPSTTR SVFQR+SMA  +EENQC     TR S F+RLS+S S+KN+ ST   D LK+  ND+ +  MK L+ K F    D  +IH+ + SRMKR
Subjt:  VFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKR

Query:  KLSLVIKPK
        KL + I  K
Subjt:  KLSLVIKPK

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]2.2e-15265.32Show/hide
Query:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ
        M+   A   +E  + +M++K+N+L K+V++RD++IA+L  Q++ ++    AESSQ  VVKV DK KN++QE Q  + S S+ASLSVQQLQDMIT+SIRAQ
Subjt:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ

Query:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR
        YGGPSQ SF YSKPYTKRIDNL MP+GYQPPKFQQFD KGNPKQH+AHFVETCENAG+RGD LV++FVR LKGNAF+WYTDLEPESI  WEQLE+EFLNR
Subjt:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR

Query:  FYST------------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKR
        FYST                              SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIA+RG KD LVP +K+
Subjt:  FYST------------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKR

Query:  ISKN-------TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYC
          K         + TL+ESMV+NTT LK  SK K+ +VEK+ + +ERRRLTLK+RQEKVYPFPDSDI+DMLEQLL+ QLI+LPECKRPE  GKV+DPNYC
Subjt:  ISKN-------TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYC

Query:  KYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV
        KYHRV+ HPV KCFVLKELIL+LA+E +IEL+++E+AQ NH  V
Subjt:  KYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein4.3e-15452.28Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNS
        T E  + +M++K+N L K+VE+RD++IA+L +Q++       +ESSQ  VVK +DK KN+++E Q  + SVS+ASLSVQQLQDMI NSIRAQYGGP Q S
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNS

Query:  FTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEPESINGWEQLEREFLNRFYST
        F YSKPYTKRIDNL MP+GYQP KFQQFD KGNPKQHI HFVETCENAG+RGD LV++FVR LKGNAF+      + +E  +     +   +++NR+ + 
Subjt:  FTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEPESINGWEQLEREFLNRFYST

Query:  SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS
        SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEELATR HDM+LSIANRG KD LV           + K+I+ N    L ESM++  T LKS S
Subjt:  SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS

Query:  KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIEL
        KRK+ K ++  + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE +GKV+DPNYCKYHRV+ H V KCFVLKELI KLA+E KIEL
Subjt:  KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIEL

Query:  NIDELAQANHIGV----------------------------------ATNTCN----------------QICSKTF---HGQNVEELATTH---------
        +IDE+AQ NH+ V                                   T T N                +   ++F   H + + E+   H         
Subjt:  NIDELAQANHIGV----------------------------------ATNTCN----------------QICSKTF---HGQNVEELATTH---------

Query:  -YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF
         Y + +E+D+S EIKQ+TSVFD IKP TTR SVFQR+SMA  KEENQC T    + S F+RLS+S S+K++ STYT D LK+  ND+ +  MK L+ K F
Subjt:  -YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF

Query:  ----DGKRIHN-LASRMKRKLSLVIKPKLHGTPSP
            D  +IH+ + SRMKRKLS+ I  +   T  P
Subjt:  ----DGKRIHN-LASRMKRKLSLVIKPKLHGTPSP

A0A5A7SUW1 Retrotransposon gag protein1.3e-15349.07Show/hide
Query:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ
        M    A +TMEE    M++K+N L K  E+RD++IA+L +Q++        ESSQ  VVK +DK KN++QE Q  + SVS+ASLSVQQLQDMI NSIRAQ
Subjt:  MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQ

Query:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR
        YGGP Q SF YSK YTKRIDNL MP+GYQPPKFQQFD KGNPKQHIAHFVETCENAG+RGD LV++FVR LKGNAF+WYTDLEPE      +   +++NR
Subjt:  YGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNR

Query:  FYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS
        + + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEEL+TR HDMELSIAN G KD LV   KR  KN            L ESM++  T LKS
Subjt:  FYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS

Query:  SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI
         SKRK+ K E+  + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE  GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Subjt:  SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI

Query:  ELNIDELAQANHIGV----------------------------------ATNTCN---------------------------------------------
        EL+IDE+AQ NH+ V                                   T T N                                             
Subjt:  ELNIDELAQANHIGV----------------------------------ATNTCN---------------------------------------------

Query:  ----------------------------------QICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ
                                          +   ++F   H + + E+   H          Y + +EVD+S EIKQ+T VF RIKP T R SVFQ
Subjt:  ----------------------------------QICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ

Query:  RMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKRKL--------SLVIKP
        R+SMA  +EENQC T    R S F+RLS+ST +K++ ST   D LK+  ND+ +  MK L+ K F    D  +IH+ + SRMKRKL        SL++KP
Subjt:  RMSMAAMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKRKL--------SLVIKP

Query:  K
        +
Subjt:  K

A0A5A7URH1 Ty3-gypsy retrotransposon protein9.3e-17356.64Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF
        T E ++ +++KK+NML K+VE+RDY+IA L N IE +D    AESS  H VK +DK K +MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q   
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF

Query:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----
         Y KPYTKRIDNL MP GYQPPKFQQFD KGNPKQH+AHF++TCE AGTRGDLLVK+FVR LKGNA DWY DLEPESI+ WEQLER+FLNRFYST     
Subjt:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----

Query:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS
                                 SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P        L    
Subjt:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS

Query:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP
        K     ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++DMLEQLL+ QLI+LPECKRPE  GKV+DPNYCKYHRV+ HP
Subjt:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP

Query:  VGKCFVLKELILKLAQEGKIELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL-------------ATTHYINVKEVDDS
        V KCFVLKELILKLA+E KIEL+IDE+AQ NH I + +N                T  +   ++F   + EE+                +Y + KEV++S
Subjt:  VGKCFVLKELILKLAQEGKIELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL-------------ATTHYINVKEVDDS

Query:  KEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN
         EI Q+TSVFDRIKPSTTR SVFQR+S+A  +EENQC     TR S  +RLS+ST +K++ ST + D LK+  ND+ +  MK  + K F    D  +IH+
Subjt:  KEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN

Query:  -LASRMKRKL--------SLVIKPK
         + SRMKRKL        SL +KP+
Subjt:  -LASRMKRKL--------SLVIKPK

A0A5D3BX77 Retrotransposon gag protein1.3e-16349.79Show/hide
Query:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF
        T E ++ +++KK+NML K+VE+RDY+IA L N IE +D    AESS  H VK +DK K +MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q   
Subjt:  TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSF

Query:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----
         YSKPYTKRIDNL MP GYQPPKFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFD Y DLEPESI+ WEQLER+FLNRFYST     
Subjt:  TYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-----

Query:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS
                                 SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSI NRG KD L+P        L    
Subjt:  -------------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS

Query:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP
        K     ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPF DSD++DMLEQLL+ QLI+LP+CKRP+   KV+DPNYCKYHRV+ HP
Subjt:  KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHP

Query:  VGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATN--------------------------------------------------------------
        V KCFVLKELILKLA+E KIELNIDE+AQ NH+ +                                                                 
Subjt:  VGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATN--------------------------------------------------------------

Query:  -----------------------------------------------TCNQICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTS
                                                       T  +   ++F   H + + E+ T H          Y + KEV++  EI Q+TS
Subjt:  -----------------------------------------------TCNQICSKTF---HGQNVEELATTH----------YINVKEVDDSKEIKQKTS

Query:  VFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKR
        VFDRIKPSTTR SVFQR+SMA  +EENQC     TR S F+RLS+S S+KN+ ST   D LK+  ND+ +  MK L+ K F    D  +IH+ + SRMKR
Subjt:  VFDRIKPSTTRVSVFQRMSMAAMKEENQC----LTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF----DGKRIHN-LASRMKR

Query:  KLSLVIKPK
        KL + I  K
Subjt:  KLSLVIKPK

A0A5D3D4X3 Ty3-gypsy retrotransposon protein9.1e-15265.36Show/hide
Query:  EEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTY
        E+++ +++KK+NML K+VE+RDY+IA L N IE +D    AESS  H VK +DK K +MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q    Y
Subjt:  EEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTY

Query:  SKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-------
        SKPYTKRIDNL MP GYQPPKFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFDWY DLEPESI+ WEQLER+FLNRFYST       
Subjt:  SKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYST-------

Query:  -----------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRISKN
                               SLDCKDRL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P        L    K 
Subjt:  -----------------------SLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRISKN

Query:  TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVG
            ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++DMLEQLL+ QLI+LPECKRPE  GKV+DPNYCKYHRV+ HPV 
Subjt:  TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVG

Query:  KCFVLKELILKLAQEGKIELNIDELAQANHIGV
        KCFVLKELILKLA+E KIEL+IDE+AQ NH+ +
Subjt:  KCFVLKELILKLAQEGKIELNIDELAQANHIGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCAAGGAAGCTCCATCCACCATGGAAGAAAAGTTGGTAAAAATGGACAAGAAACTTAACATGCTGACAAAAATGGTCGAAAAGAGGGACTATAAGATT
GCTTCCCTTATGAACCAGATCGAAATTCAAGATGTTGCAAATGTCGCTGAGTCAAGCCAAAATCATGTTGTGAAAGTCAGCGACAAAAGAAAGAACATCATGCAG
GAAAAACAACATGAACATTCTGTTTCAATTGCTTCACTGTCTGTCCAGCAGCTCCAAGATATGATCACAAATTCGATCAGAGCTCAATATGGCGGACCTTCTCAA
AACTCCTTCACGTACTCCAAACCGTATACCAAGAGGATCGACAACTTGATAATGCCTATGGGATATCAGCCTCCAAAGTTCCAACAATTTGACGAAAAGGGCAAT
CCTAAGCAACATATTGCTCATTTCGTTGAGACATGTGAAAATGCTGGCACTCGAGGAGATCTACTCGTCAAAGAATTCGTTCGAAAACTAAAAGGTAATGCTTTT
GACTGGTATACAGATTTAGAGCCTGAATCCATCAACGGTTGGGAGCAATTGGAAAGAGAGTTCTTAAATCGCTTCTACAGCACAAGTCTCGACTGCAAAGATCGA
CTAATGGAATTATCTGCTATCGAGTTGTGCATTCAAGGTATGCATTGGGGACTTCTCTATATCCTTCAAGGAATAAAACCCCGCACGTTTGAAGAATTAGCAACC
CGCGGCCACGACATGGAGCTAAGTATTGCTAATCGAGGAGATAAAGATCTTTTAGTCCCTAACTTAAAGAGAATAAGCAAGAATACTGAGATGACTCTTGAAGAA
TCAATGGTTATCAACACAACTTCTCTCAAGTCTTCTTCAAAAAGAAAGGATAAGAAGGTCGAAAAACGACAAGAGAATGAAAGGCGTCGTCTAACTCTAAAGGAT
AGACAAGAAAAAGTTTATCCTTTTCCTGATTCTGATATCTCTGACATGTTAGAACAACTACTAAAAATGCAATTGATCGAACTCCCTGAATGCAAACGACCAGAA
GACATGGGAAAAGTAAACGATCCAAACTACTGCAAATACCATAGAGTTGTTGGCCATCCAGTGGGGAAATGCTTCGTCTTAAAGGAACTAATTCTAAAGTTAGCT
CAAGAAGGAAAAATCGAGCTGAACATTGATGAATTAGCTCAAGCAAATCATATTGGAGTAGCGACAAACACATGCAATCAAATATGCTCAAAGACATTTCATGGT
CAAAATGTGGAAGAACTTGCAACTACACATTACATAAATGTGAAAGAAGTCGACGATTCAAAAGAAATCAAACAAAAGACTTCTGTCTTTGATCGCATCAAGCCT
TCAACTACTCGAGTTTCAGTCTTCCAAAGAATGAGTATGGCCGCGATGAAAGAAGAAAATCAATGTTTGACTCGACCTTCAGTCTTCCAAAGGTTAAGTGTCTCC
ACATCAAGAAAAAATCAAACTTCAACATATACTTTGGATTGTCTTAAAGTTGGTGCAAATGATCGAAACAAAGGAAGAATGAAGATCTTGGAGATGAAAATATTT
GATGGCAAAAGGATTCACAACCTTGCTTCACGTATGAAGAGAAAATTATCTCTTGTCATAAAGCCTAAACTTCACGGTACTCCTAGCCCACATGAGTTTAAAAGA
AATTGCACCCGCTTGATCATTGGCGGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCTGTTTGATCGTTGGCGCTGCACCCGTTGGATTATTGGC
GAAGACGTTGCACTTGCTCGATCCTTGGCGAAGAAATTGCACTCGCTCGATCCTTGGCAAAGAAATTGCACTCGCTCAATCGTCGGCGAAGAGGCTGCACTCGCT
CGATCCTTTGGCGAAGAAGTTACACCCGCTCGATCATTGGTAGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCCGTTTGATCGTTGGCGCTGCA
CCCGTTGGATCATTGGCGAAGACGTTGCACTCGCTCGATCGTTGGCGAAGAAGTTGCACCCACTCGATTGTTGGCGAAGACGTTGCACCCCCTCGATCACAAGCT
GCACCCATTGGATCGTTGGCGAAGAGGTTGCACCCGCTTGATCGTTGGCGAAGAAGTTGCACCCGCTCGATCGTTGGTGAAGATGAAATGACTGCAAGGAAGGTG
ACCACCGCTGACCTAGGAACACCTCAACCAGCGGAAGAGATGTTAACCTACTCAGGAATGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACA
CTTCAACCAGTGGAAAAAAAAGCGTTAACCTACTCAGCAGTGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACCTCAACTAGCAGAAGAG
GCATTAACCTACTCAACAATGAAATCACTACAAGGAAGATTCCATCAAACTGCAAAGTCTTCATCAAAACTGACTGAAGAACTTCAACAAGCAGTGCAAAGCCGA
AGAACTTCACAAGCTACAAAGCCGAAGAACTTCAACAAGCTGCAAAGTCGAAGCACTCCACAAGCTGCAAAGTCGAAACACTTCACAAGCTGCAAAGTCAAAGAA
CTTCACCAGCGCGAAGCCGAAGACTGTATAAAGTCTTCATCAAGCTACAAGCCAAAGAACTTTATCAACAAGTCAAAGTCTTCAAACAAACTCAAAGTTGAAGAC
TTCGAAGATCCTTCGTCAAGCGAGAAGCTGAAGGCATCAACCATTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCCAAGGAAGCTCCATCCACCATGGAAGAAAAGTTGGTAAAAATGGACAAGAAACTTAACATGCTGACAAAAATGGTCGAAAAGAGGGACTATAAGATT
GCTTCCCTTATGAACCAGATCGAAATTCAAGATGTTGCAAATGTCGCTGAGTCAAGCCAAAATCATGTTGTGAAAGTCAGCGACAAAAGAAAGAACATCATGCAG
GAAAAACAACATGAACATTCTGTTTCAATTGCTTCACTGTCTGTCCAGCAGCTCCAAGATATGATCACAAATTCGATCAGAGCTCAATATGGCGGACCTTCTCAA
AACTCCTTCACGTACTCCAAACCGTATACCAAGAGGATCGACAACTTGATAATGCCTATGGGATATCAGCCTCCAAAGTTCCAACAATTTGACGAAAAGGGCAAT
CCTAAGCAACATATTGCTCATTTCGTTGAGACATGTGAAAATGCTGGCACTCGAGGAGATCTACTCGTCAAAGAATTCGTTCGAAAACTAAAAGGTAATGCTTTT
GACTGGTATACAGATTTAGAGCCTGAATCCATCAACGGTTGGGAGCAATTGGAAAGAGAGTTCTTAAATCGCTTCTACAGCACAAGTCTCGACTGCAAAGATCGA
CTAATGGAATTATCTGCTATCGAGTTGTGCATTCAAGGTATGCATTGGGGACTTCTCTATATCCTTCAAGGAATAAAACCCCGCACGTTTGAAGAATTAGCAACC
CGCGGCCACGACATGGAGCTAAGTATTGCTAATCGAGGAGATAAAGATCTTTTAGTCCCTAACTTAAAGAGAATAAGCAAGAATACTGAGATGACTCTTGAAGAA
TCAATGGTTATCAACACAACTTCTCTCAAGTCTTCTTCAAAAAGAAAGGATAAGAAGGTCGAAAAACGACAAGAGAATGAAAGGCGTCGTCTAACTCTAAAGGAT
AGACAAGAAAAAGTTTATCCTTTTCCTGATTCTGATATCTCTGACATGTTAGAACAACTACTAAAAATGCAATTGATCGAACTCCCTGAATGCAAACGACCAGAA
GACATGGGAAAAGTAAACGATCCAAACTACTGCAAATACCATAGAGTTGTTGGCCATCCAGTGGGGAAATGCTTCGTCTTAAAGGAACTAATTCTAAAGTTAGCT
CAAGAAGGAAAAATCGAGCTGAACATTGATGAATTAGCTCAAGCAAATCATATTGGAGTAGCGACAAACACATGCAATCAAATATGCTCAAAGACATTTCATGGT
CAAAATGTGGAAGAACTTGCAACTACACATTACATAAATGTGAAAGAAGTCGACGATTCAAAAGAAATCAAACAAAAGACTTCTGTCTTTGATCGCATCAAGCCT
TCAACTACTCGAGTTTCAGTCTTCCAAAGAATGAGTATGGCCGCGATGAAAGAAGAAAATCAATGTTTGACTCGACCTTCAGTCTTCCAAAGGTTAAGTGTCTCC
ACATCAAGAAAAAATCAAACTTCAACATATACTTTGGATTGTCTTAAAGTTGGTGCAAATGATCGAAACAAAGGAAGAATGAAGATCTTGGAGATGAAAATATTT
GATGGCAAAAGGATTCACAACCTTGCTTCACGTATGAAGAGAAAATTATCTCTTGTCATAAAGCCTAAACTTCACGGTACTCCTAGCCCACATGAGTTTAAAAGA
AATTGCACCCGCTTGATCATTGGCGGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCTGTTTGATCGTTGGCGCTGCACCCGTTGGATTATTGGC
GAAGACGTTGCACTTGCTCGATCCTTGGCGAAGAAATTGCACTCGCTCGATCCTTGGCAAAGAAATTGCACTCGCTCAATCGTCGGCGAAGAGGCTGCACTCGCT
CGATCCTTTGGCGAAGAAGTTACACCCGCTCGATCATTGGTAGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCCGTTTGATCGTTGGCGCTGCA
CCCGTTGGATCATTGGCGAAGACGTTGCACTCGCTCGATCGTTGGCGAAGAAGTTGCACCCACTCGATTGTTGGCGAAGACGTTGCACCCCCTCGATCACAAGCT
GCACCCATTGGATCGTTGGCGAAGAGGTTGCACCCGCTTGATCGTTGGCGAAGAAGTTGCACCCGCTCGATCGTTGGTGAAGATGAAATGACTGCAAGGAAGGTG
ACCACCGCTGACCTAGGAACACCTCAACCAGCGGAAGAGATGTTAACCTACTCAGGAATGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACA
CTTCAACCAGTGGAAAAAAAAGCGTTAACCTACTCAGCAGTGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACCTCAACTAGCAGAAGAG
GCATTAACCTACTCAACAATGAAATCACTACAAGGAAGATTCCATCAAACTGCAAAGTCTTCATCAAAACTGACTGAAGAACTTCAACAAGCAGTGCAAAGCCGA
AGAACTTCACAAGCTACAAAGCCGAAGAACTTCAACAAGCTGCAAAGTCGAAGCACTCCACAAGCTGCAAAGTCGAAACACTTCACAAGCTGCAAAGTCAAAGAA
CTTCACCAGCGCGAAGCCGAAGACTGTATAAAGTCTTCATCAAGCTACAAGCCAAAGAACTTTATCAACAAGTCAAAGTCTTCAAACAAACTCAAAGTTGAAGAC
TTCGAAGATCCTTCGTCAAGCGAGAAGCTGAAGGCATCAACCATTCAATAA
Protein sequenceShow/hide protein sequence
MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQ
NSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYSTSLDCKDR
LMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQENERRRLTLKD
RQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATNTCNQICSKTFHG
QNVEELATTHYINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF
DGKRIHNLASRMKRKLSLVIKPKLHGTPSPHEFKRNCTRLIIGGEAAPIGSLAKKLHLFDRWRCTRWIIGEDVALARSLAKKLHSLDPWQRNCTRSIVGEEAALA
RSFGEEVTPARSLVEKLHPLDHWQRSCTRLIVGAAPVGSLAKTLHSLDRWRRSCTHSIVGEDVAPPRSQAAPIGSLAKRLHPLDRWRRSCTRSIVGEDEMTARKV
TTADLGTPQPAEEMLTYSGMKSLQGREGDHCDLGTLQPVEKKALTYSAVKSLQGREGDHCDLGTPQLAEEALTYSTMKSLQGRFHQTAKSSSKLTEELQQAVQSR
RTSQATKPKNFNKLQSRSTPQAAKSKHFTSCKVKELHQREAEDCIKSSSSYKPKNFINKSKSSNKLKVEDFEDPSSSEKLKASTIQ