; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g09150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g09150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:6507266..6510975
RNA-Seq ExpressionMoc11g09150
SyntenyMoc11g09150
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVX02815.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-14051.15Show/hide
Query:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK
        E     S +APP+FDG+NYQAWA+++  ++E  D WEAIE+DY++ PLP NPTM Q+KTHKER TRK+KA+A L++ VS  IF RIM L+SAK+IW++LK
Subjt:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK

Query:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ
         EY+G+ER K MKVLNL+REFE ++MK++E+IK+YSDKL+GI NK R LG D SD R+VQKILV++PE+Y + I+SLE SKDL+ + + E+V+ALQAQEQ
Subjt:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ

Query:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA
        RR++R+E S+EGAL+A+ +    G+ ++    KK + ++++          C HC K NHP  +CW RPDVKCR+C  +GH+ER C   K+QQ   A A+
Subjt:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA

Query:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-
         +Q +E+QLFVATCF +S +  DSWL+DSGCTNHMT+D+ELFK+LD++  S+VKI NGE++ VK KGTV+IES  G K IT+VL+VP I QNLLSVGQL 
Subjt:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-

Query:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------
                                   MR KSF+L+ +E EQIAF   V++ EL H+RL HFH  GL Y Q+  +  GVP+LE+   +C  C        
Subjt:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------

Query:  -LQKSIWKEAEKLQLSHTELCG
           ++ W+   KLQL HT++ G
Subjt:  -LQKSIWKEAEKLQLSHTELCG

XP_003613757.4 uncharacterized protein LOC11413243 [Medicago truncatula]8.9e-14350.84Show/hide
Query:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK
        +  ++ S +APP+FDG NY  WA+K++AY+E  D WEAIE+DYE+ PLP+NPTM Q+K HKE+ T+KAKA++CL+A VS  +F RIM LK+ K IW++LK
Subjt:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK

Query:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ
         EY GDERI+ M+VLNL+REFE  +MK+SE+IKEYSDKL+ IANK R LGT  +D+R+V+KILV+VPERY  ++ASLEN+KDL K+ + EV+ ALQAQEQ
Subjt:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ

Query:  RRLMRQEGSIEGALKARMQQGEYEREKKWKK-----------GSGSNSSESVV--KDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAK
        RR MRQEG +EGAL  + Q    +REK WKK            + SN+   +V  K     C+HC K  HP FRCWRRP+ KC +C+ +GH    C+   
Subjt:  RRLMRQEGSIEGALKARMQQGEYEREKKWKK-----------GSGSNSSESVV--KDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAK

Query:  TQQQGGAHAAVQQEEDQ---LFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGI
         +Q+  A  A Q+EED+   LFVATCFS     DSWL+DSGCTNHMT DKE+FK+L  S  S+V+I NG+ + VK KGT++I SC GTKLI++VL+VP I
Subjt:  TQQQGGAHAAVQQEEDQ---LFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGI

Query:  AQNLLSVGQL----------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNC
         QNLLSVGQL                            MR KSF+L+PLE++Q AF  + + TE+ HKRL H+H +GL   Q  ++   +P+LE+   +C
Subjt:  AQNLLSVGQL----------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNC

Query:  GVC---------LQKSIWKEAEKLQLSHTELCG
          C           KS W+  +KLQL HT+LCG
Subjt:  GVC---------LQKSIWKEAEKLQLSHTELCG

XP_022148138.1 uncharacterized protein LOC111016891 [Momordica charantia]1.4e-17281.44Show/hide
Query:  MEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKSEYEGDERIKGMKVLNLVREFERMQMKDS
        MEGCDYWEAI++DYEIAP+PDNPTMH+I THKERVTRK KA ACL AAVSPAIFNRIMALKSAKEIWEFLK+EYEG+ERIKGMKVLNLVREFE MQMKDS
Subjt:  MEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKSEYEGDERIKGMKVLNLVREFERMQMKDS

Query:  ESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQRRLMRQEGSIEGALKARMQQGEYEREKKW
        ESIKEYSDKLIGIANKARALGTDLS NRLVQKILVSVPERY  TIASLEN+KDL+KLKVIEVVS LQAQEQRRL+ QEGS+EGALKARMQ GE  RE KW
Subjt:  ESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQRRLMRQEGSIEGALKARMQQGEYEREKKW

Query:  --KKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCT
          KK SGS+SSE   KD  SACKHCGKHNHPHFRCWRRP VKCR C+LLGHIERF K AKTQQQGGAHAAVQQEEDQLFVATCFSS  QCD WLVDSGCT
Subjt:  --KKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCT

Query:  NHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQLMRHKSFSLDPLEKEQIAFKCQVNDT---ELRHK
        NHMTSDKELFKDLD+SFKSRVKI NGEYLEVK KGTVSIESCVGTKLI EVLFVP I QNLLSVGQL+  K F +  L +++   KC ++D+   EL   
Subjt:  NHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQLMRHKSFSLDPLEKEQIAFKCQVNDT---ELRHK

Query:  RLRH
        +++H
Subjt:  RLRH

XP_022158688.1 uncharacterized protein LOC111025149 [Momordica charantia]3.7e-19776.91Show/hide
Query:  GSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS
        GSNNLSSLAPPVFDGENYQ WAI+IQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS
Subjt:  GSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS

Query:  EYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQR
        EYEGDERIKGMKVLNLVREFER+QMKDSESIKEYSDKLIGIANKARALG DLS+NRLVQKI+VSVPERY  TIASLENSKDLTKLKVIEVVSALQAQ QR
Subjt:  EYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQR

Query:  RLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQE
                                                                   RCWRRPDVK RRCHLLGHIERFCKEAKTQQQGGAHAAVQQE
Subjt:  RLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQE

Query:  EDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-------
        EDQLFVATCFSSVTQCDSWLVDSGCTN M SDKELFKDLDRSFKSRVKI NGEYLEVK K T SIESCVGTKLITEVLFVP IAQNLLSVGQL       
Subjt:  EDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-------

Query:  ---------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGV
                             M+HKSFSLDP+EK+QI  KCQVNDTELRHKRL HFHQ+GLQY+QENEMTIGVPILEELKTNCGV
Subjt:  ---------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGV

XP_038889190.1 uncharacterized protein LOC120079069 [Benincasa hispida]1.9e-14867.73Show/hide
Query:  QEGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFL
        + GSN+LSSL P VFDGENYQAWAI++QAYME CDYWE IEQDYEIAPLPDNPT++QIKTHKERVTRK KAR CLY AVSPAIFNRIMALKS KEIWEFL
Subjt:  QEGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFL

Query:  KSEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQE
        KSEYEGDERIKGMKVLNLVREFERMQMKD +SIKEYSDKLIGIANKARALG DLSDNRL                             VIEVVSALQAQE
Subjt:  KSEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQE

Query:  QRRLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQ
        Q RL+RQEGSIEGALKAR+ QGE  RE   KKGSGS+SSES +KDA S CKHCGK NHPHFRCWRRP+VKCR CHLLGHIER                  
Subjt:  QRRLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQ

Query:  QEEDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-----
                       TQCD WLVDSGCTNHMT+DKELFKD+D+SFK RVKI NGEYLEVK K TVSIESC G KLIT+VLFVP I QNLLSVGQL     
Subjt:  QEEDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-----

Query:  -----------------------MRHKSFSLDPLEKE
                               M+HK FSLD LEKE
Subjt:  -----------------------MRHKSFSLDPLEKE

TrEMBL top hitse value%identityAlignment
A0A438CDR2 Retrovirus-related Pol polyprotein from transposon RE11.0e-13950.77Show/hide
Query:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK
        E     S +APP+FDG+NYQAWA+++  ++E  D WEAIE+DY++ PLP NPTM Q+KTHKE+ TRK+KA+A L++ VS  IF RIM L+ AK+IW++LK
Subjt:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK

Query:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ
         EY+G+ER K MKVLNL+REFE ++MK++E+IK+YSDKL+GI NK R LG D SD R+VQKILV++PE+Y + I+SLE SKDL+ + + E+V+ALQAQEQ
Subjt:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ

Query:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA
        RR++R+E S+EGAL+A+ +    G+ ++    KK + ++++          C HC K NHP  +CW RPDVKCR+C  +GH+ER C   K+QQ   A A+
Subjt:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA

Query:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-
         +Q +E+QLFVATCF +S +  DSWL+DSGCTNHMT+D+ELFK+LD++  S+VKI NGE++ VK KGTV+IES  G K IT+VL+VP I QNLLSVGQL 
Subjt:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-

Query:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------
                                   MR KSF+L+ +E EQIAF   V++ EL H+RL HFH  GL Y Q+  +  GVP+LE+   +C  C        
Subjt:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------

Query:  -LQKSIWKEAEKLQLSHTELCG
           ++ W+   KLQL HT++ G
Subjt:  -LQKSIWKEAEKLQLSHTELCG

A0A438J1J0 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-14151.15Show/hide
Query:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK
        E     S +APP+FDG+NYQAWA+++  ++E  D WEAIE+DY++ PLP NPTM Q+KTHKER TRK+KA+A L++ VS  IF RIM L+SAK+IW++LK
Subjt:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK

Query:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ
         EY+G+ER K MKVLNL+REFE ++MK++E+IK+YSDKL+GI NK R LG D SD R+VQKILV++PE+Y + I+SLE SKDL+ + + E+V+ALQAQEQ
Subjt:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ

Query:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA
        RR++R+E S+EGAL+A+ +    G+ ++    KK + ++++          C HC K NHP  +CW RPDVKCR+C  +GH+ER C   K+QQ   A A+
Subjt:  RRLMRQEGSIEGALKARMQQ---GEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAA

Query:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-
         +Q +E+QLFVATCF +S +  DSWL+DSGCTNHMT+D+ELFK+LD++  S+VKI NGE++ VK KGTV+IES  G K IT+VL+VP I QNLLSVGQL 
Subjt:  VQQ-EEDQLFVATCF-SSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-

Query:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------
                                   MR KSF+L+ +E EQIAF   V++ EL H+RL HFH  GL Y Q+  +  GVP+LE+   +C  C        
Subjt:  ---------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVC--------

Query:  -LQKSIWKEAEKLQLSHTELCG
           ++ W+   KLQL HT++ G
Subjt:  -LQKSIWKEAEKLQLSHTELCG

A0A5J5C8E9 DUF4219 domain-containing protein9.0e-14154.37Show/hide
Query:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK
        E S + SS+APPVFDGENYQ WA++++ Y++  D WEA+E+DYE+ PLPDNPTM QIK HK+R TRK+KA+ACL+AAVS  IF R+M+LKSAK IW++LK
Subjt:  EGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLK

Query:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ
        +EYEG+ERIKGM+VLNL+R+FE  +MK+SE+IKEYSDKL  IANK R LG+DL D+R+V+KILV+VPE++  TI +LEN+KDL+K+ + E+++ALQAQEQ
Subjt:  SEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQ

Query:  RRLMRQEGSIEGALKARMQQGEYEREKKWKK----------GSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQ
        RR+MRQ G++EGAL A+ Q+    R+KK KK           S  N +E   K     C+HCG+  HP F+CWRRPD KC +C+ LGH    CK    QQ
Subjt:  RRLMRQEGSIEGALKARMQQGEYEREKKWKK----------GSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQ

Query:  QGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLS
        +  A  A Q+ EDQLFVATCF+S +  +SWL+DSGCTNHMT DKE+FK+L+ +  ++V+I NG+++ VK KGT++IESC+GTK I++VL+VP I QNLLS
Subjt:  QGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLS

Query:  VGQL----------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEM
        VGQL                            MR KSFSLDP+E+ Q AF    + T++ HKRL HFH  G+ Y Q  ++
Subjt:  VGQL----------------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEM

A0A6J1D394 uncharacterized protein LOC1110168916.8e-17381.44Show/hide
Query:  MEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKSEYEGDERIKGMKVLNLVREFERMQMKDS
        MEGCDYWEAI++DYEIAP+PDNPTMH+I THKERVTRK KA ACL AAVSPAIFNRIMALKSAKEIWEFLK+EYEG+ERIKGMKVLNLVREFE MQMKDS
Subjt:  MEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKSEYEGDERIKGMKVLNLVREFERMQMKDS

Query:  ESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQRRLMRQEGSIEGALKARMQQGEYEREKKW
        ESIKEYSDKLIGIANKARALGTDLS NRLVQKILVSVPERY  TIASLEN+KDL+KLKVIEVVS LQAQEQRRL+ QEGS+EGALKARMQ GE  RE KW
Subjt:  ESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQRRLMRQEGSIEGALKARMQQGEYEREKKW

Query:  --KKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCT
          KK SGS+SSE   KD  SACKHCGKHNHPHFRCWRRP VKCR C+LLGHIERF K AKTQQQGGAHAAVQQEEDQLFVATCFSS  QCD WLVDSGCT
Subjt:  --KKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCT

Query:  NHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQLMRHKSFSLDPLEKEQIAFKCQVNDT---ELRHK
        NHMTSDKELFKDLD+SFKSRVKI NGEYLEVK KGTVSIESCVGTKLI EVLFVP I QNLLSVGQL+  K F +  L +++   KC ++D+   EL   
Subjt:  NHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQLMRHKSFSLDPLEKEQIAFKCQVNDT---ELRHK

Query:  RLRH
        +++H
Subjt:  RLRH

A0A6J1DWT9 uncharacterized protein LOC1110251491.8e-19776.91Show/hide
Query:  GSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS
        GSNNLSSLAPPVFDGENYQ WAI+IQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS
Subjt:  GSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRKAKARACLYAAVSPAIFNRIMALKSAKEIWEFLKS

Query:  EYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQR
        EYEGDERIKGMKVLNLVREFER+QMKDSESIKEYSDKLIGIANKARALG DLS+NRLVQKI+VSVPERY  TIASLENSKDLTKLKVIEVVSALQAQ QR
Subjt:  EYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASLENSKDLTKLKVIEVVSALQAQEQR

Query:  RLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQE
                                                                   RCWRRPDVK RRCHLLGHIERFCKEAKTQQQGGAHAAVQQE
Subjt:  RLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAKTQQQGGAHAAVQQE

Query:  EDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-------
        EDQLFVATCFSSVTQCDSWLVDSGCTN M SDKELFKDLDRSFKSRVKI NGEYLEVK K T SIESCVGTKLITEVLFVP IAQNLLSVGQL       
Subjt:  EDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQL-------

Query:  ---------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGV
                             M+HKSFSLDP+EK+QI  KCQVNDTELRHKRL HFHQ+GLQY+QENEMTIGVPILEELKTNCGV
Subjt:  ---------------------MRHKSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGV

SwissProt top hitse value%identityAlignment
D3UAG1 UDP-glycosyltransferase 71A161.4e-1031.29Show/hide
Query:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPHVREA--
        +LVF+P PG G + ST+EMA  LV RD +L IT+L+ KLP+D        S+     +  I F+ LPE    K+ T     +  R+ ++++K HVR+A  
Subjt:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPHVREA--

Query:  -------------------------------------VPSYVFYTSSAPSLAITFDLQELYDQ
                                             VPSYVF+TS++ +LA+    Q L D+
Subjt:  -------------------------------------VPSYVFYTSSAPSLAITFDLQELYDQ

D3UAG1 UDP-glycosyltransferase 71A164.3e-0764.71Show/hide
Query:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        IL HPA+G F+SHCGWN TLES+W GVP+  + M
Subjt:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

Q2V6K0 UDP-glucose flavonoid 3-O-glucosyltransferase 61.2e-1235.19Show/hide
Query:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQ-HIQSLPV--AFANKSIRFIVLPETPFPKETTNNFLLNTLRIIKSYKPHVREA
        EL+FIP PG G + ST+E+A +L+ RD  L IT+LI K PF       +I+SL V  +   + IRF+ LP+  F       F       I S+K HV++A
Subjt:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQ-HIQSLPV--AFANKSIRFIVLPETPFPKETTNNFLLNTLRIIKSYKPHVREA

Query:  V----------------------------------PSYVFYTSSAPSLAITFDLQELYDQNN
        V                                  PSYVFYTS A  L + F LQ L D+ N
Subjt:  V----------------------------------PSYVFYTSSAPSLAITFDLQELYDQNN

Q2V6K0 UDP-glucose flavonoid 3-O-glucosyltransferase 62.5e-0771.88Show/hide
Query:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTF
        ILAHPA+G F+SHCGWN TLES+W GVP+ T+
Subjt:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTF

Q66PF3 Putative UDP-glucose flavonoid 3-O-glucosyltransferase 31.5e-1236.65Show/hide
Query:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQ-HIQSLPVAFA--NKSIRFIVLPETPFPKETTNNFLLNTL-RIIKSYKPHVRE
        ELV IP PG G L ST+E+A +LV+RD +L IT+LI   P  +K    ++QSL  + +  ++ I FI LP T    + T   + N+L   ++S +PHV++
Subjt:  ELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQ-HIQSLPVAFA--NKSIRFIVLPETPFPKETTNNFLLNTL-RIIKSYKPHVRE

Query:  A--------------------------------VPSYVFYTSSAPSLAITFDLQELYDQNN
        A                                VPSYVF+TS A +L + F LQEL DQ N
Subjt:  A--------------------------------VPSYVFYTSSAPSLAITFDLQELYDQNN

Q66PF3 Putative UDP-glucose flavonoid 3-O-glucosyltransferase 31.5e-0761.11Show/hide
Query:  VEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        V +LAHP++G F+SHCGWN TLES+W GVP+ T+ +
Subjt:  VEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

Q6VAB2 UDP-glycosyltransferase 71E12.1e-0929.38Show/hide
Query:  MNKFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLP--ETPFPKETTNNFLLNTLRIIKSYKPHV
        M+  ELVFIP PG G LP T+E+A +L+ RD RLS+T+++  L    K   + ++ P      S+RF+ +P  E+     + N F+      ++ +KP V
Subjt:  MNKFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLP--ETPFPKETTNNFLLNTLRIIKSYKPHV

Query:  RE--------------------------------AVPSYVFYTSSAPSLAITFDLQELYDQNNSSKAVEVERLQNSD
        R+                                 VPSY ++TS A +L + F LQ   D     +  +   L+NSD
Subjt:  RE--------------------------------AVPSYVFYTSSAPSLAITFDLQELYDQNNSSKAVEVERLQNSD

Q6VAB2 UDP-glycosyltransferase 71E12.8e-0658.82Show/hide
Query:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        +L+HP++G F+SHCGWN TLES+W GVPM  + +
Subjt:  ILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

Q9LSY5 UDP-glycosyltransferase 71B79.3e-1033.93Show/hide
Query:  KFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPF----DTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPH
        KFELVFIP PG G L ST+EMA +LV R+ RLSI+++I  LPF    +     +I +L  A +N  +R+ V+     P        + T+ I +K+ +P 
Subjt:  KFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPF----DTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPH

Query:  VREAV-------------------------------------PSYVFYTSSAPSLAITFDLQELYDQN
        VR  V                                     PSY+FYTSSA  L++T+ +Q L D+N
Subjt:  VREAV-------------------------------------PSYVFYTSSAPSLAITFDLQELYDQN

Q9LSY5 UDP-glycosyltransferase 71B71.8e-0566.67Show/hide
Query:  VEILAHPAMGRFLSHCGWNLTLESVWRGVP
        V +LA+PA+G F++HCGWN TLES+W GVP
Subjt:  VEILAHPAMGRFLSHCGWNLTLESVWRGVP

Arabidopsis top hitse value%identityAlignment
AT1G07250.1 UDP-glucosyl transferase 71C48.1e-0960.47Show/hide
Query:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        +CG    VE+LAH A+G F+SHCGWN TLES+W GVP+ T+ M
Subjt:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

AT1G07260.1 UDP-glucosyl transferase 71C32.4e-0863.89Show/hide
Query:  VEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        VE+LAH A+G F+SHCGWN  LES+W GVP+ T+ M
Subjt:  VEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

AT2G29710.1 UDP-Glycosyltransferase superfamily protein3.1e-0858.14Show/hide
Query:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        +CG    VEILAH A+G F+SHCGWN  +ES+W GVP++T+ M
Subjt:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

AT2G29730.1 UDP-glucosyl transferase 71D13.1e-0858.14Show/hide
Query:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM
        +CG    VEILAH A+G F+SHCGWN  +ES+W GVP++T+ M
Subjt:  LCG---AVEILAHPAMGRFLSHCGWNLTLESVWRGVPMLTFEM

AT2G29730.1 UDP-glucosyl transferase 71D13.2e-0537.5Show/hide
Query:  MNKFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLPE
        M   EL+FIP P  G L   +E A  L+ +D R+ IT+L+ KL   + L  +++S  +A +   +RFI +PE
Subjt:  MNKFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLPE

AT3G21790.1 UDP-Glycosyltransferase superfamily protein6.6e-1133.93Show/hide
Query:  KFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPF----DTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPH
        KFELVFIP PG G L ST+EMA +LV R+ RLSI+++I  LPF    +     +I +L  A +N  +R+ V+     P        + T+ I +K+ +P 
Subjt:  KFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPF----DTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRI-IKSYKPH

Query:  VREAV-------------------------------------PSYVFYTSSAPSLAITFDLQELYDQN
        VR  V                                     PSY+FYTSSA  L++T+ +Q L D+N
Subjt:  VREAV-------------------------------------PSYVFYTSSAPSLAITFDLQELYDQN

AT3G21790.1 UDP-Glycosyltransferase superfamily protein1.3e-0666.67Show/hide
Query:  VEILAHPAMGRFLSHCGWNLTLESVWRGVP
        V +LA+PA+G F++HCGWN TLES+W GVP
Subjt:  VEILAHPAMGRFLSHCGWNLTLESVWRGVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGTTTGAGCTGGTTTTCATACCCGGCCCGGGGCGGGGCGACCTCCCATCCACCATTGAAATGGCTAATATTCTCGTCACTCGAGATCATCGTCTCTCCATCAC
ACTGCTCATCCAAAAACTGCCCTTTGACACCAAATTACTCCAACATATCCAATCACTCCCTGTGGCTTTTGCCAATAAATCTATCCGCTTCATCGTCCTCCCTGAAACGC
CCTTTCCCAAGGAAACTACAAACAATTTTTTGTTGAACACACTCCGCATCATCAAAAGCTACAAACCCCACGTTAGAGAGGCCGTTCCCAGTTATGTGTTCTACACATCC
AGTGCTCCCTCTCTAGCCATTACTTTTGATCTTCAAGAGCTTTATGATCAGAATAACAGCAGCAAGGCAGTGGAAGTGGAACGGTTACAGAACTCGGATGATGCCAAGTT
TTGTCAATCCGATTCCCAGGAAGGATCAAACAATCTTTCTTCACTGGCTCCACCTGTGTTTGATGGTGAAAACTATCAAGCATGGGCAATCAAAATACAAGCCTACATGG
AGGGTTGTGATTATTGGGAAGCAATTGAGCAAGATTATGAAATTGCTCCACTTCCTGATAATCCAACAATGCATCAGATCAAGACTCACAAGGAGAGGGTCACCAGGAAG
GCAAAGGCTCGAGCTTGCCTATATGCAGCTGTGTCTCCCGCCATATTCAACAGAATTATGGCATTGAAGTCAGCAAAGGAGATCTGGGAGTTCCTCAAAAGTGAGTATGA
AGGTGATGAGAGGATTAAAGGCATGAAGGTGTTGAACTTGGTAAGGGAATTCGAAAGAATGCAGATGAAGGATTCTGAGTCCATCAAAGAGTACTCAGACAAGTTGATCG
GGATTGCTAACAAGGCAAGAGCATTAGGAACTGATCTATCTGACAATAGATTGGTTCAGAAGATTTTGGTTTCAGTACCTGAGAGATATGGAACAACCATTGCTTCCTTA
GAAAATTCTAAAGACCTCACTAAGCTTAAAGTGATAGAAGTAGTGAGTGCTTTACAAGCACAGGAGCAGAGGAGGTTGATGCGACAAGAAGGAAGCATTGAAGGGGCACT
GAAAGCTAGAATGCAGCAGGGAGAATATGAAAGAGAGAAGAAGTGGAAGAAGGGAAGTGGCAGCAATAGCTCAGAGTCTGTTGTAAAGGATGCTATTAGTGCATGCAAGC
ACTGTGGAAAGCATAATCATCCACACTTTAGATGCTGGAGAAGGCCAGATGTGAAGTGTAGAAGGTGTCATTTATTGGGGCACATTGAGAGGTTCTGTAAAGAAGCAAAA
ACTCAACAGCAAGGAGGAGCGCATGCTGCAGTACAACAAGAGGAAGATCAACTCTTTGTGGCTACTTGTTTCTCATCAGTCACTCAATGTGACAGTTGGTTGGTTGATAG
TGGGTGTACCAATCACATGACAAGTGACAAAGAGTTGTTTAAGGACCTTGACAGGTCATTTAAGTCAAGGGTGAAGATAAGGAATGGTGAGTATCTTGAAGTAAAGAGGA
AGGGCACAGTGTCAATAGAAAGCTGTGTTGGAACCAAGTTGATTACTGAAGTATTGTTTGTCCCTGGGATTGCTCAAAACTTGTTAAGTGTTGGTCAACTAATGCGACAC
AAGAGCTTCTCATTAGATCCACTAGAGAAGGAGCAGATAGCTTTCAAGTGTCAAGTGAATGACACTGAGTTAAGGCACAAAAGATTGAGGCATTTTCATCAAAAAGGGTT
GCAGTATCGGCAGGAAAATGAGATGACAATAGGAGTACCAATTCTGGAGGAGCTTAAAACAAACTGCGGAGTTTGTTTGCAGAAGTCAATCTGGAAAGAAGCTGAGAAAC
TGCAGCTCAGCCACACTGAATTATGTGGAGCCGTGGAGATATTAGCCCACCCAGCCATGGGAAGATTTCTATCGCATTGTGGTTGGAACTTGACGCTGGAGAGCGTGTGG
CGAGGCGTGCCAATGTTGACCTTTGAAATGGTGTTGAAAGTTGATGGAAGAGGGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGTTTGAGCTGGTTTTCATACCCGGCCCGGGGCGGGGCGACCTCCCATCCACCATTGAAATGGCTAATATTCTCGTCACTCGAGATCATCGTCTCTCCATCAC
ACTGCTCATCCAAAAACTGCCCTTTGACACCAAATTACTCCAACATATCCAATCACTCCCTGTGGCTTTTGCCAATAAATCTATCCGCTTCATCGTCCTCCCTGAAACGC
CCTTTCCCAAGGAAACTACAAACAATTTTTTGTTGAACACACTCCGCATCATCAAAAGCTACAAACCCCACGTTAGAGAGGCCGTTCCCAGTTATGTGTTCTACACATCC
AGTGCTCCCTCTCTAGCCATTACTTTTGATCTTCAAGAGCTTTATGATCAGAATAACAGCAGCAAGGCAGTGGAAGTGGAACGGTTACAGAACTCGGATGATGCCAAGTT
TTGTCAATCCGATTCCCAGGAAGGATCAAACAATCTTTCTTCACTGGCTCCACCTGTGTTTGATGGTGAAAACTATCAAGCATGGGCAATCAAAATACAAGCCTACATGG
AGGGTTGTGATTATTGGGAAGCAATTGAGCAAGATTATGAAATTGCTCCACTTCCTGATAATCCAACAATGCATCAGATCAAGACTCACAAGGAGAGGGTCACCAGGAAG
GCAAAGGCTCGAGCTTGCCTATATGCAGCTGTGTCTCCCGCCATATTCAACAGAATTATGGCATTGAAGTCAGCAAAGGAGATCTGGGAGTTCCTCAAAAGTGAGTATGA
AGGTGATGAGAGGATTAAAGGCATGAAGGTGTTGAACTTGGTAAGGGAATTCGAAAGAATGCAGATGAAGGATTCTGAGTCCATCAAAGAGTACTCAGACAAGTTGATCG
GGATTGCTAACAAGGCAAGAGCATTAGGAACTGATCTATCTGACAATAGATTGGTTCAGAAGATTTTGGTTTCAGTACCTGAGAGATATGGAACAACCATTGCTTCCTTA
GAAAATTCTAAAGACCTCACTAAGCTTAAAGTGATAGAAGTAGTGAGTGCTTTACAAGCACAGGAGCAGAGGAGGTTGATGCGACAAGAAGGAAGCATTGAAGGGGCACT
GAAAGCTAGAATGCAGCAGGGAGAATATGAAAGAGAGAAGAAGTGGAAGAAGGGAAGTGGCAGCAATAGCTCAGAGTCTGTTGTAAAGGATGCTATTAGTGCATGCAAGC
ACTGTGGAAAGCATAATCATCCACACTTTAGATGCTGGAGAAGGCCAGATGTGAAGTGTAGAAGGTGTCATTTATTGGGGCACATTGAGAGGTTCTGTAAAGAAGCAAAA
ACTCAACAGCAAGGAGGAGCGCATGCTGCAGTACAACAAGAGGAAGATCAACTCTTTGTGGCTACTTGTTTCTCATCAGTCACTCAATGTGACAGTTGGTTGGTTGATAG
TGGGTGTACCAATCACATGACAAGTGACAAAGAGTTGTTTAAGGACCTTGACAGGTCATTTAAGTCAAGGGTGAAGATAAGGAATGGTGAGTATCTTGAAGTAAAGAGGA
AGGGCACAGTGTCAATAGAAAGCTGTGTTGGAACCAAGTTGATTACTGAAGTATTGTTTGTCCCTGGGATTGCTCAAAACTTGTTAAGTGTTGGTCAACTAATGCGACAC
AAGAGCTTCTCATTAGATCCACTAGAGAAGGAGCAGATAGCTTTCAAGTGTCAAGTGAATGACACTGAGTTAAGGCACAAAAGATTGAGGCATTTTCATCAAAAAGGGTT
GCAGTATCGGCAGGAAAATGAGATGACAATAGGAGTACCAATTCTGGAGGAGCTTAAAACAAACTGCGGAGTTTGTTTGCAGAAGTCAATCTGGAAAGAAGCTGAGAAAC
TGCAGCTCAGCCACACTGAATTATGTGGAGCCGTGGAGATATTAGCCCACCCAGCCATGGGAAGATTTCTATCGCATTGTGGTTGGAACTTGACGCTGGAGAGCGTGTGG
CGAGGCGTGCCAATGTTGACCTTTGAAATGGTGTTGAAAGTTGATGGAAGAGGGTGGTGA
Protein sequenceShow/hide protein sequence
MNKFELVFIPGPGRGDLPSTIEMANILVTRDHRLSITLLIQKLPFDTKLLQHIQSLPVAFANKSIRFIVLPETPFPKETTNNFLLNTLRIIKSYKPHVREAVPSYVFYTS
SAPSLAITFDLQELYDQNNSSKAVEVERLQNSDDAKFCQSDSQEGSNNLSSLAPPVFDGENYQAWAIKIQAYMEGCDYWEAIEQDYEIAPLPDNPTMHQIKTHKERVTRK
AKARACLYAAVSPAIFNRIMALKSAKEIWEFLKSEYEGDERIKGMKVLNLVREFERMQMKDSESIKEYSDKLIGIANKARALGTDLSDNRLVQKILVSVPERYGTTIASL
ENSKDLTKLKVIEVVSALQAQEQRRLMRQEGSIEGALKARMQQGEYEREKKWKKGSGSNSSESVVKDAISACKHCGKHNHPHFRCWRRPDVKCRRCHLLGHIERFCKEAK
TQQQGGAHAAVQQEEDQLFVATCFSSVTQCDSWLVDSGCTNHMTSDKELFKDLDRSFKSRVKIRNGEYLEVKRKGTVSIESCVGTKLITEVLFVPGIAQNLLSVGQLMRH
KSFSLDPLEKEQIAFKCQVNDTELRHKRLRHFHQKGLQYRQENEMTIGVPILEELKTNCGVCLQKSIWKEAEKLQLSHTELCGAVEILAHPAMGRFLSHCGWNLTLESVW
RGVPMLTFEMVLKVDGRGW