; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:13708337..13716109
RNA-Seq ExpressionMoc08g18100
SyntenyMoc08g18100
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.1e-24281.7Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPF +D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWYRRL A SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+++I + +  ++    D KSKDKG S SSGR EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN

Query:  IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPN
        IEE+GMEKLLKRPEKLRG PE R+KDKYCRFHR++GHNT   WELKRQIE+LIQDGYFKKFVGKPR SS EKKEERKRSRTPPRR DRPAVINTIFGGP+
Subjt:  IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPN

Query:  GGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFS
        GGQSG KR ELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWT SQLKKSPTPLVGFS
Subjt:  GGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFS

Query:  GESVSPEGCINLPVTIGQDATQVTQMAEFV
        GESV PEG I+LPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCINLPVTIGQDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.6e-23469.92Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPF +D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP++ ID+ +  ++  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI

Query:  LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFG
        LTNIEE+GMEKLLKRPEKLRG PE RNKDKYCRFHR++ HNT   WELKRQIEDLIQD YFKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFG

Query:  GPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLV
        GP+GGQSG+KR ELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWT SQLKKS TPLV
Subjt:  GPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLV

Query:  GFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE
        GFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ  SRECYAS LKGSSVCALE 
Subjt:  GFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE

Query:  QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL
         ++     E   +LP   +R+F+ PTEEL+LVPLL
Subjt:  QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.7e-27166.45Show/hide
Query:  MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMH
        MV PANS NT ++R + A+H  Q+++GA +VE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMH

Query:  HRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLA
                                                                                                  KA+S Y P+ 
Subjt:  HRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLA

Query:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P  VITREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRP+K IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNIEE GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLK

Query:  RPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL
        RPEKLRGDPE RN DKYCRFHRD+GHNT + WELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKR RTPPRR+DRPAVI             NK+ EL
Subjt:  RPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL

Query:  AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCIN
        AREARREVCIIREQ+PT SI F  ADL G+HLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWT SQLKKSPTPLVGFSGES+S EGCI+
Subjt:  AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCIN

Query:  LPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQ
        LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS  K SSVCALEEQ
Subjt:  LPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQ

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.1e-26772.9Show/hide
Query:  PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRK
        ARCEKK+  FDD DLGESPF +DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRL ARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNG
        P+KQIDQK+LSQ++ + D KSKDKG SSS  RTEYRRSE G  RSRPYER                                                  
Subjt:  PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNG

Query:  HNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDA
             CWELKRQIEDLIQD YFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQ  NKR ELA EARR+V IIREQKPTCSITF D 
Subjt:  HNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDA

Query:  DLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS
        DL G+HLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLAL  T SQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ 
Subjt:  DLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEK
        AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYAS LK SSVCALEE       Q S  DLP+E K           L P L+L+ 
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEK

Query:  Q
        +
Subjt:  Q

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]3.2e-24381.68Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYRRL ARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRP+KQIDQKKLSQE+ + D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN
        ISEILTNIEE+GMEKLLKRPEKLRGD E RNK+KYCRFHRD+GHNT SCWELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN

Query:  TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSP
        TIFGGPNGGQSGNKR ELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA LIDH LVRRVL+DG                          
Subjt:  TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST N VG VRGEQKTSRECYAS LKGS+VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVC

Query:  ALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVS
        ALEEQ N GK QES  DLPKEGKRQF PPTEEL+LVPLLS E+Q +
Subjt:  ALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.0e-24281.7Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPF +D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWYRRL A SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+++I + +  ++    D KSKDKG S SSGR EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN

Query:  IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPN
        IEE+GMEKLLKRPEKLRG PE R+KDKYCRFHR++GHNT   WELKRQIE+LIQDGYFKKFVGKPR SS EKKEERKRSRTPPRR DRPAVINTIFGGP+
Subjt:  IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPN

Query:  GGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFS
        GGQSG KR ELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYLALGWT SQLKKSPTPLVGFS
Subjt:  GGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFS

Query:  GESVSPEGCINLPVTIGQDATQVTQMAEFV
        GESV PEG I+LPVT+GQD TQVTQMAEFV
Subjt:  GESVSPEGCINLPVTIGQDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188237.7e-23569.92Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPF +D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP++ ID+ +  ++  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI

Query:  LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFG
        LTNIEE+GMEKLLKRPEKLRG PE RNKDKYCRFHR++ HNT   WELKRQIEDLIQD YFKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFG

Query:  GPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLV
        GP+GGQSG+KR ELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL TYLALGWT SQLKKS TPLV
Subjt:  GPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLV

Query:  GFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE
        GFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ  SRECYAS LKGSSVCALE 
Subjt:  GFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE

Query:  QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL
         ++     E   +LP   +R+F+ PTEEL+LVPLL
Subjt:  QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204792.3e-27166.45Show/hide
Query:  MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMH
        MV PANS NT ++R + A+H  Q+++GA +VE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMH

Query:  HRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLA
                                                                                                  KA+S Y P+ 
Subjt:  HRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLA

Query:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P  VITREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRP+K IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNIEE GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLK

Query:  RPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL
        RPEKLRGDPE RN DKYCRFHRD+GHNT + WELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKR RTPPRR+DRPAVI             NK+ EL
Subjt:  RPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL

Query:  AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCIN
        AREARREVCIIREQ+PT SI F  ADL G+HLPHNDALVIAPLID VLVRR+LVDGGASANILSL TYLALGWT SQLKKSPTPLVGFSGES+S EGCI+
Subjt:  AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCIN

Query:  LPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQ
        LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS  K SSVCALEEQ
Subjt:  LPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQ

A0A6J1DPC9 uncharacterized protein LOC1110222802.0e-26772.9Show/hide
Query:  PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRK
        ARCEKK+  FDD DLGESPF +DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRL ARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNG
        P+KQIDQK+LSQ++ + D KSKDKG SSS  RTEYRRSE G  RSRPYER                                                  
Subjt:  PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNG

Query:  HNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDA
             CWELKRQIEDLIQD YFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQ  NKR ELA EARR+V IIREQKPTCSITF D 
Subjt:  HNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDA

Query:  DLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS
        DL G+HLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLAL  T SQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ 
Subjt:  DLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEK
        AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYAS LK SSVCALEE       Q S  DLP+E K           L P L+L+ 
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEK

Query:  Q
        +
Subjt:  Q

A0A6J1DZB9 uncharacterized protein LOC1110249041.5e-24381.68Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYRRL ARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRP+KQIDQKKLSQE+ + D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN
        ISEILTNIEE+GMEKLLKRPEKLRGD E RNK+KYCRFHRD+GHNT SCWELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN

Query:  TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSP
        TIFGGPNGGQSGNKR ELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA LIDH LVRRVL+DG                          
Subjt:  TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST N VG VRGEQKTSRECYAS LKGS+VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVC

Query:  ALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVS
        ALEEQ N GK QES  DLPKEGKRQF PPTEEL+LVPLLS E+Q +
Subjt:  ALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTTCAGCAGCTGCAGCCATGATGACAAATGGGGGCTGGGTGAGCATTGGTTGGTTTTTGGCCGATCCCGACACCGACATCTCCCTTCCTATGGATTGCGCCAA
TCGACTGACTGATCGGTTCAGTCGGTCGGTCCACCTCGGCCCCACCGAGGTGGACCGAGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGCTCGACCTGATTGTGG
TCCGACCTGCTCGGAACCCGACAGGTCCATCTAGTGTTCAGGTCGAAACCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTGCAGATTCCTGCATAAACATT
TGGCGCCGTCTGTGGGAACGACATCTTAAGTTATCCCGATTTAAAAAAAAAATATACGCAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGT
GAACGCTGATCACGACCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGTCATGCGAACC
AAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAAGAGGGACATCGAGAAAAACCTCCCAAAGGGCTAACCAAGCAGCAGACCCTGAAGCTCTG
TCTGCTCTTCAGCGCGAGTTGGATGATATGCACCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGCGTTAACCGAACTGCGTCTCCCTCTATGGCTCC
AGGCGCACCCGGTGAAAAGGGAGTTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGC
ATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTG
GCACCAGAAGCTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGA
CGATGGCGACTTGGGAGAATCGCCATTCATCGCAGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGATGGGTCTAAGGACCCTA
AAGATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTAT
CGAAGACTGACGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCAC
CATCAGGCAGAAGGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTTCTCA
CCGGCCTGGCCGATGAGGCCTTAACCGTAAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTC
CGAACCAAGACTGGCCGACCTAAAAAACAGATCGATCAGAAGAAACTTAGCCAGGAGAGGATGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCATCCAGTGG
CAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAA
ATGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAATCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGATA
AGCTGCTGGGAACTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTTTAGCTCAATCGAAAAGAAAGAAGAGAGGAA
GCGTTCAAGAACTCCGCCCCGCCGGAATGACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAACGGGGGCCAGTCCGGAAACAAGAGGAATGAGCTAGCTCGCG
AAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTG
ATCGCCCCTCTCATTGATCACGTCCTGGTCCGAAGGGTATTGGTTGATGGAGGCGCATCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCTGGTC
ACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGTGAATCGGTCTCCCCAGAAGGGTGCATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAA
CGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTC
TTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGAGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGA
ACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCAAGCTTGTTCCTTTACTTAGCC
TCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATG
CCTGGTATTGACCCGAAGATTATGGTCGGTCCCGATCGAAATCCTAGACACTCCTTCAATCTTAGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGG
ACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAAGAGCCGAAGGAGCAAAAGAAGATGACGAGAAGAGCAGCTCGGTTCACGCTCCGAGAAGGAGTGTTGTACCGA
CGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCTTCAGCAGCTGCAGCCATGATGACAAATGGGGGCTGGGTGAGCATTGGTTGGTTTTTGGCCGATCCCGACACCGACATCTCCCTTCCTATGGATTGCGCCAA
TCGACTGACTGATCGGTTCAGTCGGTCGGTCCACCTCGGCCCCACCGAGGTGGACCGAGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGCTCGACCTGATTGTGG
TCCGACCTGCTCGGAACCCGACAGGTCCATCTAGTGTTCAGGTCGAAACCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTGCAGATTCCTGCATAAACATT
TGGCGCCGTCTGTGGGAACGACATCTTAAGTTATCCCGATTTAAAAAAAAAATATACGCAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGT
GAACGCTGATCACGACCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGTCATGCGAACC
AAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAAGAGGGACATCGAGAAAAACCTCCCAAAGGGCTAACCAAGCAGCAGACCCTGAAGCTCTG
TCTGCTCTTCAGCGCGAGTTGGATGATATGCACCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGCGTTAACCGAACTGCGTCTCCCTCTATGGCTCC
AGGCGCACCCGGTGAAAAGGGAGTTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGC
ATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTG
GCACCAGAAGCTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGA
CGATGGCGACTTGGGAGAATCGCCATTCATCGCAGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGATGGGTCTAAGGACCCTA
AAGATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTAT
CGAAGACTGACGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCAC
CATCAGGCAGAAGGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTTCTCA
CCGGCCTGGCCGATGAGGCCTTAACCGTAAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTC
CGAACCAAGACTGGCCGACCTAAAAAACAGATCGATCAGAAGAAACTTAGCCAGGAGAGGATGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCATCCAGTGG
CAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAA
ATGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAATCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGATA
AGCTGCTGGGAACTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTTTAGCTCAATCGAAAAGAAAGAAGAGAGGAA
GCGTTCAAGAACTCCGCCCCGCCGGAATGACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAACGGGGGCCAGTCCGGAAACAAGAGGAATGAGCTAGCTCGCG
AAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTG
ATCGCCCCTCTCATTGATCACGTCCTGGTCCGAAGGGTATTGGTTGATGGAGGCGCATCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCTGGTC
ACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGTGAATCGGTCTCCCCAGAAGGGTGCATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAA
CGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTC
TTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGAGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGA
ACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCAAGCTTGTTCCTTTACTTAGCC
TCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATG
CCTGGTATTGACCCGAAGATTATGGTCGGTCCCGATCGAAATCCTAGACACTCCTTCAATCTTAGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGG
ACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAAGAGCCGAAGGAGCAAAAGAAGATGACGAGAAGAGCAGCTCGGTTCACGCTCCGAGAAGGAGTGTTGTACCGA
CGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Protein sequenceShow/hide protein sequence
MPPSAAAAMMTNGGWVSIGWFLADPDTDISLPMDCANRLTDRFSRSVHLGPTEVDRVCPLKDEVWAPKLDLIVVRPARNPTGPSSVQVETGDRVRARFVKNRCADSCINI
WRRLWERHLKLSRFKKKIYAKMVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEAL
SALQRELDDMHHRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPL
APEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWY
RRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELL
RTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTI
SCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALV
IAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQV
LKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDM
PGIDPKIMVGPDRNPRHSFNLRTRCDEGGYSVTHLDGPNRGVHQRKPTARAEGAKEDDEKSSSVHAPRRSVVPTWLLPASPQVFKKFNPLNLRVRGAM