; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:22677391..22682696
RNA-Seq ExpressionMoc06g30130
SyntenyMoc06g30130
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.4e-18066.98Show/hide
Query:  LKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITR+EFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD+LEAPIP KFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY-------------------------------
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSS+HYDKKTATHLATIRQKEGETLREY                               
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY-------------------------------

Query:  ------------------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTN
                          KAKKVIDGQELLRTKTGRPE++I + +  ++   AD KS+DKG SFS+ R EYRR+E+GP+RSRPYE++TPTTIPISEILTN
Subjt:  ------------------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTN

Query:  IEESEMEKLLKRPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEES MEKLLKRPEKL+G PE+R+KDK                                       ++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESEMEKLLKRPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRK LAR ARREVCIIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GETVSPEGCIDLPVTIGQEATQVTQMAEFV
        GE+V PEG IDLPVT+GQ+ TQVTQMAEFV
Subjt:  GETVSPEGCIDLPVTIGQEATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.9e-18865.65Show/hide
Query:  NSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITR+EFD ++ K + QVEALKA+CE+KE   +DGDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQ-HYDKKTATHLATIR--QKEGETLRE--YKAKKVIDGQELLRTKTGRPEKQI
        AA+DAIKCRAFQIALTGSARLW++           QL+    S  S+  ++    A    T++  ++   T  E   KAKKVIDGQELLRTKTGRPE+ I
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQ-HYDKKTATHLATIR--QKEGETLRE--YKAKKVIDGQELLRTKTGRPEKQI

Query:  DQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDK--------------
        D+ + S +  KAD KS+DKG SFS+ R E+RR+ +GP+RSRPYE++TPTTIPISEILTNIEES MEKLLKRPEKL+G PE+RNKDK              
Subjt:  DQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDK--------------

Query:  -------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGV
                                 ++S EKKEERK SRTP RR DRPAVINTIFGGPSGGQSG+KRK LAR ARREVCIIREQ+PTC ITF   DLE V
Subjt:  -------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGV

Query:  HLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVIDGMSAYNAI
        HLPHNDALVIAPLIDHV+VRRVL+D G SANI+SL TYLALGWTRSQLKKS TPLVGFS E+V PEGCIDLPVT+G + TQVTQMAEFVVIDG SAYNAI
Subjt:  HLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVIDGMSAYNAI

Query:  FGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKSQFSPPTEELELVPLL
        FGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ  SRECYASALKGSSVCALE   S+D       +LPR+   +F+ PTEELELVPLL
Subjt:  FGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKSQFSPPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.6e-21357.76Show/hide
Query:  MVQPANSANSTERRGVNADNGTQRDLDARIVEDQVRAGQEGYLLHRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMH
        MVQPANS N+ +RR + A++G QR++ A +VE Q         L RSAR     LPPAHPKPS                                     
Subjt:  MVQPANSANSTERRGVNADNGTQRDLDARIVEDQVRAGQEGYLLHRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMH

Query:  HRLRTMEEMYVEATRANRTTSPSRVPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLT
                                                                                                  KA+S Y P+T
Subjt:  HRLRTMEEMYVEATRANRTTSPSRVPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLT

Query:  PEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P  VITR+EFD +K KFD QVEALKARCEKKE SFDDGDLGE  F+SDILEA IP KFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY------------------------------------------
        LTGSARLWYRRLPAR ISTYSQLRKEFISQFSS+HYD+KT THLATIRQKEGETLREY                                          
Subjt:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY------------------------------------------

Query:  -------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLK
               K KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKSRDKG S S+SR +YRRS S  ++SRPYE YTPTTIPI EILTNIEE+ MEKLLK
Subjt:  -------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLK

Query:  RPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVL
        RPEKL+GDPEKRN DK                                       SNSVEKKEERKR RTPPRRDDRPAVI             NK+K L
Subjt:  RPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVL

Query:  AREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCID
        AREARREVCIIREQ+PT SI F   DLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGE++S EGCID
Subjt:  AREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCID

Query:  LPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL
        LPV+I Q+ TQVTQMAEFVVIDG SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS  K SSVCALEEQT +D+L
Subjt:  LPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]2.0e-18781.04Show/hide
Query:  KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQG
        KAKKVIDGQELLRTKTGRPEKQ+DQKK  Q K +AD +S+DKG S S+SRTEYRR+ESGP+RSRP+E+YTPTTIPISE+LTNIEES MEKLLKRPEKL+G
Subjt:  KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQG

Query:  DPEKRNKD-------------------------KSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSIT
        DPEK NKD                         +SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQ GNKR  LAR  RREVCIIREQKPTC IT
Subjt:  DPEKRNKD-------------------------KSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSIT

Query:  FGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVI
        FGD DLEGVHLPHNDALVIAPLIDH+LVRRVLIDGGASANI SLPTYLALGWTRSQLKKSPTPLVGFSGE+VSPEGCIDL VTIGQ+ATQVTQMAEFVVI
Subjt:  FGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVI

Query:  DGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQT-------SQDDLPRKSKSQFSPPTEELELVPLL
        D  SAYNAIFGRPIIHSF AV STLHQVLKYST NGVGTVRGEQKTSR+CYAS LKG +VC LEEQT       S+ DLP+KSK QFSPPTEELELVPLL
Subjt:  DGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQT-------SQDDLPRKSKSQFSPPTEELELVPLL

Query:  SPEKQVSIGTKLGATNRKELINFLRSNSDVFAWSHEDMSGIDP
        SPEK V+IGTKL AT+RKELINFLRSNSDVFAWSHEDM GIDP
Subjt:  SPEKQVSIGTKLGATNRKELINFLRSNSDVFAWSHEDMSGIDP

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.4e-25979.42Show/hide
Query:  VPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEAL
        +PGAPG+KGAPSIQPG+REPIPNDEGVDYSL DN+LRK+LT+KKK+AS EPEDS SYSREFSNSNLKAQSKY+PL PEAVI R+EFDLMKH+FDEQVEAL
Subjt:  VPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR
        KARCEKKE  FDD DLGESPFTSDI+EAPIP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLR
Subjt:  KARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR

Query:  KEFISQFSSQHYDKKTATHLATIRQKEGETLRE--------------YKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTE
        KEFI QFS +HYD+KTATHLATIRQKE ETL                  AKKVIDGQELLRTKT RPEKQIDQK+LSQ+KRK DSKS+DKGSS S SRTE
Subjt:  KEFISQFSSQHYDKKTATHLATIRQKEGETLRE--------------YKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTE

Query:  YRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDKSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRK
        YRRSESGPSRSRPYE+       I ++   I++S  +K + +P             +SNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRK
Subjt:  YRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDKSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRK

Query:  VLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGC
         LA EARR+V IIREQKPTCSITF DTDLEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS E+VSPEGC
Subjt:  VLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGC

Query:  IDLPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK
        IDLPVTIGQ++TQVTQMAEFVVIDG  AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYASALK SSVCALEEQTSQDDLPR++K
Subjt:  IDLPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK

Query:  SQFSPPTEELELVPLLS
                   L P L+
Subjt:  SQFSPPTEELELVPLLS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-18066.98Show/hide
Query:  LKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P TP  VITR+EFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD+LEAPIP KFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY-------------------------------
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSS+HYDKKTATHLATIRQKEGETLREY                               
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY-------------------------------

Query:  ------------------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTN
                          KAKKVIDGQELLRTKTGRPE++I + +  ++   AD KS+DKG SFS+ R EYRR+E+GP+RSRPYE++TPTTIPISEILTN
Subjt:  ------------------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTN

Query:  IEESEMEKLLKRPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEES MEKLLKRPEKL+G PE+R+KDK                                       ++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESEMEKLLKRPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRK LAR ARREVCIIREQ+PTC ITF   DLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GETVSPEGCIDLPVTIGQEATQVTQMAEFV
        GE+V PEG IDLPVT+GQ+ TQVTQMAEFV
Subjt:  GETVSPEGCIDLPVTIGQEATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.3e-18865.65Show/hide
Query:  NSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P TP+ VITR+EFD ++ K + QVEALKA+CE+KE   +DGDLGESPFTSD+LEA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQ-HYDKKTATHLATIR--QKEGETLRE--YKAKKVIDGQELLRTKTGRPEKQI
        AA+DAIKCRAFQIALTGSARLW++           QL+    S  S+  ++    A    T++  ++   T  E   KAKKVIDGQELLRTKTGRPE+ I
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSQ-HYDKKTATHLATIR--QKEGETLRE--YKAKKVIDGQELLRTKTGRPEKQI

Query:  DQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDK--------------
        D+ + S +  KAD KS+DKG SFS+ R E+RR+ +GP+RSRPYE++TPTTIPISEILTNIEES MEKLLKRPEKL+G PE+RNKDK              
Subjt:  DQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDK--------------

Query:  -------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGV
                                 ++S EKKEERK SRTP RR DRPAVINTIFGGPSGGQSG+KRK LAR ARREVCIIREQ+PTC ITF   DLE V
Subjt:  -------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGV

Query:  HLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVIDGMSAYNAI
        HLPHNDALVIAPLIDHV+VRRVL+D G SANI+SL TYLALGWTRSQLKKS TPLVGFS E+V PEGCIDLPVT+G + TQVTQMAEFVVIDG SAYNAI
Subjt:  HLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVIDGMSAYNAI

Query:  FGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKSQFSPPTEELELVPLL
        FGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ  SRECYASALKGSSVCALE   S+D       +LPR+   +F+ PTEELELVPLL
Subjt:  FGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQD-------DLPRKSKSQFSPPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.8e-21357.76Show/hide
Query:  MVQPANSANSTERRGVNADNGTQRDLDARIVEDQVRAGQEGYLLHRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMH
        MVQPANS N+ +RR + A++G QR++ A +VE Q         L RSAR     LPPAHPKPS                                     
Subjt:  MVQPANSANSTERRGVNADNGTQRDLDARIVEDQVRAGQEGYLLHRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMH

Query:  HRLRTMEEMYVEATRANRTTSPSRVPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLT
                                                                                                  KA+S Y P+T
Subjt:  HRLRTMEEMYVEATRANRTTSPSRVPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLT

Query:  PEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA
        P  VITR+EFD +K KFD QVEALKARCEKKE SFDDGDLGE  F+SDILEA IP KFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Subjt:  PEAVITRKEFDLMKHKFDEQVEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY------------------------------------------
        LTGSARLWYRRLPAR ISTYSQLRKEFISQFSS+HYD+KT THLATIRQKEGETLREY                                          
Subjt:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSQHYDKKTATHLATIRQKEGETLREY------------------------------------------

Query:  -------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLK
               K KKVIDGQELLRTKTGRPEK IDQ +  ++K KADSKSRDKG S S+SR +YRRS S  ++SRPYE YTPTTIPI EILTNIEE+ MEKLLK
Subjt:  -------KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLK

Query:  RPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVL
        RPEKL+GDPEKRN DK                                       SNSVEKKEERKR RTPPRRDDRPAVI             NK+K L
Subjt:  RPEKLQGDPEKRNKDK---------------------------------------SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVL

Query:  AREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCID
        AREARREVCIIREQ+PT SI F   DLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGE++S EGCID
Subjt:  AREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCID

Query:  LPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL
        LPV+I Q+ TQVTQMAEFVVIDG SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS  K SSVCALEEQT +D+L
Subjt:  LPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDL

A0A6J1DPC9 uncharacterized protein LOC1110222802.1e-25979.42Show/hide
Query:  VPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEAL
        +PGAPG+KGAPSIQPG+REPIPNDEGVDYSL DN+LRK+LT+KKK+AS EPEDS SYSREFSNSNLKAQSKY+PL PEAVI R+EFDLMKH+FDEQVEAL
Subjt:  VPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQVEAL

Query:  KARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR
        KARCEKKE  FDD DLGESPFTSDI+EAPIP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPARSISTYSQLR
Subjt:  KARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR

Query:  KEFISQFSSQHYDKKTATHLATIRQKEGETLRE--------------YKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTE
        KEFI QFS +HYD+KTATHLATIRQKE ETL                  AKKVIDGQELLRTKT RPEKQIDQK+LSQ+KRK DSKS+DKGSS S SRTE
Subjt:  KEFISQFSSQHYDKKTATHLATIRQKEGETLRE--------------YKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTE

Query:  YRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDKSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRK
        YRRSESGPSRSRPYE+       I ++   I++S  +K + +P             +SNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRK
Subjt:  YRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQGDPEKRNKDKSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRK

Query:  VLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGC
         LA EARR+V IIREQKPTCSITF DTDLEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS E+VSPEGC
Subjt:  VLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGC

Query:  IDLPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK
        IDLPVTIGQ++TQVTQMAEFVVIDG  AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYASALK SSVCALEEQTSQDDLPR++K
Subjt:  IDLPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSK

Query:  SQFSPPTEELELVPLLS
                   L P L+
Subjt:  SQFSPPTEELELVPLLS

A0A6J1DPX9 uncharacterized protein LOC1110220069.7e-18881.04Show/hide
Query:  KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQG
        KAKKVIDGQELLRTKTGRPEKQ+DQKK  Q K +AD +S+DKG S S+SRTEYRR+ESGP+RSRP+E+YTPTTIPISE+LTNIEES MEKLLKRPEKL+G
Subjt:  KAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTNIEESEMEKLLKRPEKLQG

Query:  DPEKRNKD-------------------------KSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSIT
        DPEK NKD                         +SNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQ GNKR  LAR  RREVCIIREQKPTC IT
Subjt:  DPEKRNKD-------------------------KSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSIT

Query:  FGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVI
        FGD DLEGVHLPHNDALVIAPLIDH+LVRRVLIDGGASANI SLPTYLALGWTRSQLKKSPTPLVGFSGE+VSPEGCIDL VTIGQ+ATQVTQMAEFVVI
Subjt:  FGDTDLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVI

Query:  DGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQT-------SQDDLPRKSKSQFSPPTEELELVPLL
        D  SAYNAIFGRPIIHSF AV STLHQVLKYST NGVGTVRGEQKTSR+CYAS LKG +VC LEEQT       S+ DLP+KSK QFSPPTEELELVPLL
Subjt:  DGMSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQT-------SQDDLPRKSKSQFSPPTEELELVPLL

Query:  SPEKQVSIGTKLGATNRKELINFLRSNSDVFAWSHEDMSGIDP
        SPEK V+IGTKL AT+RKELINFLRSNSDVFAWSHEDM GIDP
Subjt:  SPEKQVSIGTKLGATNRKELINFLRSNSDVFAWSHEDMSGIDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCAACCAGCAAACTCTGCTAATTCGACAGAACGGAGGGGTGTGAACGCTGATAATGGCACTCAGCGAGACCTCGACGCAAGAATAGTCGAGGACCAGGTCCGAGC
AGGGCAAGAGGGATATCTGCTGCACAGATCTGCCCGCCATGCGAACCAAGAGTTACCCCCTGCTCACCCGAAACCCTCAAAGGCCAATCGAGGCCGAGGTGGGACCACGA
GAAAGACCTCCCAAAGGGCCAGCCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGCTGGATGATATGCACCATCGGTTGCGCACAATGGAAGAAATGTAC
GTCGAGGCAACGCGTGCTAACCGAACTACGTCTCCCTCTAGGGTCCCGGGCGCACCCGGTCAAAAGGGAGCTCCATCTATCCAACCCGGTGACCGCGAGCCCATTCCTAA
TGATGAAGGAGTGGATTACAGCTTGTGGGATAACAATCTGAGAAAGTATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCTTACTCCCGAG
AGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACGAGCCTCTGACGCCAGAAGCTGTGATAACTAGAAAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGACATCTTGGAGGCTCCAATCCCTTCGAA
GTTCAAAACTCCCACGATGAAGCCTTATGATGGGTCTAAGGATCCAAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATTAAAT
GCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGGAGACTGCCGGCTAGGTCGATCTCGACCTACTCTCAGCTGAGAAAAGAGTTCATTAGTCAA
TTCTCTTCTCAGCATTATGATAAGAAGACAGCGACTCACCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGAGAGAGTATAAGGCGAAGAAGGTTATTGATGGGCA
AGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAGAAGAAGTTGAGCCAGGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAGGGATCGTCCT
TTTCCGCCAGCAGAACAGAGTACCGTAGGTCGGAGAGCGGCCCCAGCCGGAGCCGACCTTATGAACAGTATACCCCAACCACCATCCCCATCTCCGAGATACTCACAAAC
ATCGAGGAGAGCGAGATGGAAAAGCTCCTCAAGCGACCTGAAAAGCTCCAAGGAGACCCAGAAAAGCGCAACAAAGATAAGTCTAACTCGGTCGAAAAGAAAGAAGAGAG
GAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGGGCCCAAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGTGCTAGCTC
GCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACCTTTGGCGATACCGACCTGGAGGGGGTCCACTTGCCCCATAACGATGCACTG
GTGATCGCCCCTCTGATCGATCATGTCCTGGTCAGAAGAGTGTTGATAGATGGAGGCGCGTCTGCCAACATCTTATCCCTCCCAACATATCTTGCCTTGGGGTGGACCAG
GTCTCAGTTGAAGAAAAGTCCAACACCTTTGGTTGGATTCTCTGGGGAAACGGTCTCCCCTGAAGGGTGTATCGATCTACCAGTCACGATTGGACAAGAGGCTACCCAAG
TAACGCAGATGGCTGAGTTCGTGGTAATCGACGGCATGTCGGCCTACAACGCCATCTTCGGGAGACCCATCATCCACTCATTCCGGGCCGTCCCCTCCACATTGCATCAA
GTCCTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGGGAGTGTTACGCGTCCGCGCTCAAAGGATCGTCAGTATGTGCCCTGGA
AGAGCAAACCAGTCAAGACGACCTTCCGAGGAAAAGCAAAAGCCAGTTCTCTCCACCAACAGAGGAGCTCGAGCTTGTTCCCCTACTTAGCCCTGAAAAACAAGTAAGCA
TAGGAACCAAGCTGGGGGCCACTAACAGGAAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGTCTGGCATCGACCCAAAG
ATCATGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTAGACAGTCCTTCAATCTTGGAGCCAGATGTGATGGAGGTTGATACTCCATCACCCTCTTGGATGGACCC
AATCGTGGAGTTCATCAAAGGAAACTCACCGCAAGATCCGAAGGAGCAAAGGAAGATGGCTCGGAGGGTAGCTCGGTTCACACTCCGTGAAGGAGCGTTGTACCAACGTG
GCTTCTCCCTGCCCCTGCTTAACCATGGTACCAGTAGAGATCGGCATACCAACAGACAGGGTAGAACAGTACGAGCCAACAAAGAACGAGGACGATCTACTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCAACCAGCAAACTCTGCTAATTCGACAGAACGGAGGGGTGTGAACGCTGATAATGGCACTCAGCGAGACCTCGACGCAAGAATAGTCGAGGACCAGGTCCGAGC
AGGGCAAGAGGGATATCTGCTGCACAGATCTGCCCGCCATGCGAACCAAGAGTTACCCCCTGCTCACCCGAAACCCTCAAAGGCCAATCGAGGCCGAGGTGGGACCACGA
GAAAGACCTCCCAAAGGGCCAGCCAGGCAGCAGACCCTGAAGCTTTGTCTACTCTCCAGCGCGAGCTGGATGATATGCACCATCGGTTGCGCACAATGGAAGAAATGTAC
GTCGAGGCAACGCGTGCTAACCGAACTACGTCTCCCTCTAGGGTCCCGGGCGCACCCGGTCAAAAGGGAGCTCCATCTATCCAACCCGGTGACCGCGAGCCCATTCCTAA
TGATGAAGGAGTGGATTACAGCTTGTGGGATAACAATCTGAGAAAGTATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCTTACTCCCGAG
AGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACGAGCCTCTGACGCCAGAAGCTGTGATAACTAGAAAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAG
GTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAGTGCTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGACATCTTGGAGGCTCCAATCCCTTCGAA
GTTCAAAACTCCCACGATGAAGCCTTATGATGGGTCTAAGGATCCAAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGACTTTCAAGCGGCAACAGATGCAATTAAAT
GCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGGAGACTGCCGGCTAGGTCGATCTCGACCTACTCTCAGCTGAGAAAAGAGTTCATTAGTCAA
TTCTCTTCTCAGCATTATGATAAGAAGACAGCGACTCACCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGAGAGAGTATAAGGCGAAGAAGGTTATTGATGGGCA
AGAGCTCCTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAGAAGAAGTTGAGCCAGGAGAAGAGGAAGGCTGATTCCAAGTCTAGAGATAAGGGATCGTCCT
TTTCCGCCAGCAGAACAGAGTACCGTAGGTCGGAGAGCGGCCCCAGCCGGAGCCGACCTTATGAACAGTATACCCCAACCACCATCCCCATCTCCGAGATACTCACAAAC
ATCGAGGAGAGCGAGATGGAAAAGCTCCTCAAGCGACCTGAAAAGCTCCAAGGAGACCCAGAAAAGCGCAACAAAGATAAGTCTAACTCGGTCGAAAAGAAAGAAGAGAG
GAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGGGCCCAAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGTGCTAGCTC
GCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACCTTTGGCGATACCGACCTGGAGGGGGTCCACTTGCCCCATAACGATGCACTG
GTGATCGCCCCTCTGATCGATCATGTCCTGGTCAGAAGAGTGTTGATAGATGGAGGCGCGTCTGCCAACATCTTATCCCTCCCAACATATCTTGCCTTGGGGTGGACCAG
GTCTCAGTTGAAGAAAAGTCCAACACCTTTGGTTGGATTCTCTGGGGAAACGGTCTCCCCTGAAGGGTGTATCGATCTACCAGTCACGATTGGACAAGAGGCTACCCAAG
TAACGCAGATGGCTGAGTTCGTGGTAATCGACGGCATGTCGGCCTACAACGCCATCTTCGGGAGACCCATCATCCACTCATTCCGGGCCGTCCCCTCCACATTGCATCAA
GTCCTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGGGAGTGTTACGCGTCCGCGCTCAAAGGATCGTCAGTATGTGCCCTGGA
AGAGCAAACCAGTCAAGACGACCTTCCGAGGAAAAGCAAAAGCCAGTTCTCTCCACCAACAGAGGAGCTCGAGCTTGTTCCCCTACTTAGCCCTGAAAAACAAGTAAGCA
TAGGAACCAAGCTGGGGGCCACTAACAGGAAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGTCTGGCATCGACCCAAAG
ATCATGACCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTAGACAGTCCTTCAATCTTGGAGCCAGATGTGATGGAGGTTGATACTCCATCACCCTCTTGGATGGACCC
AATCGTGGAGTTCATCAAAGGAAACTCACCGCAAGATCCGAAGGAGCAAAGGAAGATGGCTCGGAGGGTAGCTCGGTTCACACTCCGTGAAGGAGCGTTGTACCAACGTG
GCTTCTCCCTGCCCCTGCTTAACCATGGTACCAGTAGAGATCGGCATACCAACAGACAGGGTAGAACAGTACGAGCCAACAAAGAACGAGGACGATCTACTCCTTAA
Protein sequenceShow/hide protein sequence
MVQPANSANSTERRGVNADNGTQRDLDARIVEDQVRAGQEGYLLHRSARHANQELPPAHPKPSKANRGRGGTTRKTSQRASQAADPEALSTLQRELDDMHHRLRTMEEMY
VEATRANRTTSPSRVPGAPGQKGAPSIQPGDREPIPNDEGVDYSLWDNNLRKYLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYEPLTPEAVITRKEFDLMKHKFDEQ
VEALKARCEKKECSFDDGDLGESPFTSDILEAPIPSKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQ
FSSQHYDKKTATHLATIRQKEGETLREYKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSFSASRTEYRRSESGPSRSRPYEQYTPTTIPISEILTN
IEESEMEKLLKRPEKLQGDPEKRNKDKSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKVLAREARREVCIIREQKPTCSITFGDTDLEGVHLPHNDAL
VIAPLIDHVLVRRVLIDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGETVSPEGCIDLPVTIGQEATQVTQMAEFVVIDGMSAYNAIFGRPIIHSFRAVPSTLHQ
VLKYSTLNGVGTVRGEQKTSRECYASALKGSSVCALEEQTSQDDLPRKSKSQFSPPTEELELVPLLSPEKQVSIGTKLGATNRKELINFLRSNSDVFAWSHEDMSGIDPK
IMTDLARSVPVEILDSPSILEPDVMEVDTPSPSWMDPIVEFIKGNSPQDPKEQRKMARRVARFTLREGALYQRGFSLPLLNHGTSRDRHTNRQGRTVRANKERGRSTP