; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1001821..1007406
RNA-Seq ExpressionMoc07g01220
SyntenyMoc07g01220
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.4e-22581.63Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA F SRHYDKKTATHLATI QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIE

Query:  DSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGG
        +SG+EKLLKRP+KLRG PERRSKDK                                       TSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GG
Subjt:  DSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGG

Query:  QSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL------
        QSG KRKELARAARREVCIIREQ  TCPITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPL      
Subjt:  QSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL------

Query:  ----------------DQTRVTQMAEFV
                        DQT+VTQMAEFV
Subjt:  ----------------DQTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.2e-18584.12Show/hide
Query:  KDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDG+LGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        F SR Y KKT THLATI QKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKRPKKLRGVPERRSKDK---------------
        GRGRSGKD ERADPKSKDK SFSSGRAEYRRAESGPT+SRPYERFTPTTIPISEIL NIE+SG+EKLLKRP+KLRG PERRSKDK               
Subjt:  GRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKRPKKLRGVPERRSKDK---------------

Query:  ------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVH
                                TSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GGQSG KRKELARAARREVCIIREQG TCPITFDGAD +EVH
Subjt:  ------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.2e-22868.76Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDG+LGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILR
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDK SFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL 
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILR

Query:  NIEDSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGP
        NIE+SG+EKLLKRP+KLRG PERR+KDK                                       TSSAEKKEERK SRTPLRR DRPAVINTIFGGP
Subjt:  NIEDSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGP

Query:  NGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL---
        +GGQSG KRKELARAARREVCIIREQ  TCPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPL   
Subjt:  NGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL---

Query:  -------------------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTL-
                           DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCAL+TL 
Subjt:  -------------------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.8e-22358.73Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGVGCRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGVGCRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DG+LGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QF SRHYD+KT THLATI QKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDK-RSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DK  S SS R +YRR+ S   +SRPYE +TPTTIPI EIL NIE++G+EKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDK-RSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKR

Query:  PKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELA
        P+KLRG PE+R+ DK                                       ++S EKKEERKR RTP RR DRPAVIN             K+KELA
Subjt:  PKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELA

Query:  RAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------
        R ARREVCIIREQ  T  I F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPL                
Subjt:  RAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------

Query:  ------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAL--KTLRD
              D T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCAL  +T+RD
Subjt:  ------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAL--KTLRD

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]2.5e-18889.39Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEI

Query:  LRNIEDSGIEKLLKRPKKLRGVPERRSKDKTSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGA
        L NIEDSG+EKLLKRP+KLRG PERRSKDKTSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GGQSG KRKELAR ARREVCIIREQG TCPITFDGA
Subjt:  LRNIEDSGIEKLLKRPKKLRGVPERRSKDKTSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGA

Query:  DLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------------DQTRVTQMAEFVVVDGRS
        DL+EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL                      DQTRVTQM EFVVVDGRS
Subjt:  DLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------------DQTRVTQMAEFVVVDGRS

Query:  AYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
         YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYAAALKGSSVCAL+TLRDGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  AYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.6e-22581.63Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA F SRHYDKKTATHLATI QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEIL NIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIE

Query:  DSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGG
        +SG+EKLLKRP+KLRG PERRSKDK                                       TSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GG
Subjt:  DSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGG

Query:  QSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL------
        QSG KRKELARAARREVCIIREQ  TCPITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPL      
Subjt:  QSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL------

Query:  ----------------DQTRVTQMAEFV
                        DQT+VTQMAEFV
Subjt:  ----------------DQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.5e-22868.76Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDG+LGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILR
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDK SFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL 
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILR

Query:  NIEDSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGP
        NIE+SG+EKLLKRP+KLRG PERR+KDK                                       TSSAEKKEERK SRTPLRR DRPAVINTIFGGP
Subjt:  NIEDSGIEKLLKRPKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGP

Query:  NGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL---
        +GGQSG KRKELARAARREVCIIREQ  TCPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPL   
Subjt:  NGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL---

Query:  -------------------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTL-
                           DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCAL+TL 
Subjt:  -------------------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E  P+ + +G
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE--PDLMEIG

A0A6J1D9W7 uncharacterized protein LOC1110187085.7e-18684.12Show/hide
Query:  KDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDG+LGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        F SR Y KKT THLATI QKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKRPKKLRGVPERRSKDK---------------
        GRGRSGKD ERADPKSKDK SFSSGRAEYRRAESGPT+SRPYERFTPTTIPISEIL NIE+SG+EKLLKRP+KLRG PERRSKDK               
Subjt:  GRGRSGKD-ERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKRPKKLRGVPERRSKDK---------------

Query:  ------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVH
                                TSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GGQSG KRKELARAARREVCIIREQG TCPITFDGAD +EVH
Subjt:  ------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGADLKEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204798.9e-22458.73Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGVGCRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGVGCRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DG+LGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QF SRHYD+KT THLATI QKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDK-RSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DK  S SS R +YRR+ S   +SRPYE +TPTTIPI EIL NIE++G+EKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDK-RSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKR

Query:  PKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELA
        P+KLRG PE+R+ DK                                       ++S EKKEERKR RTP RR DRPAVIN             K+KELA
Subjt:  PKKLRGVPERRSKDK---------------------------------------TSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELA

Query:  RAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------
        R ARREVCIIREQ  T  I F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPL                
Subjt:  RAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------

Query:  ------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAL--KTLRD
              D T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCAL  +T+RD
Subjt:  ------DQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAL--KTLRD

A0A6J1DYW5 uncharacterized protein LOC1110243321.2e-18889.39Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEI

Query:  LRNIEDSGIEKLLKRPKKLRGVPERRSKDKTSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGA
        L NIEDSG+EKLLKRP+KLRG PERRSKDKTSSAEKKEERKRSRTP RRTDRPAVINTIFGGP+GGQSG KRKELAR ARREVCIIREQG TCPITFDGA
Subjt:  LRNIEDSGIEKLLKRPKKLRGVPERRSKDKTSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSGLKRKELARAARREVCIIREQGLTCPITFDGA

Query:  DLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------------DQTRVTQMAEFVVVDGRS
        DL+EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL                      DQTRVTQM EFVVVDGRS
Subjt:  DLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL----------------------DQTRVTQMAEFVVVDGRS

Query:  AYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
         YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYAAALKGSSVCAL+TLRDGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  AYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGACAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGTAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCATCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGTAGGGTGCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGCGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGAATTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTTCTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCATGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGAGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAAGGAACATCGAGGATTCTGGAATCGAAAAACTACTCAAGCGTCCGAAGAAACTTCGGGGAGTCCCGGAGAGGCGCAGCAAGGACAAGACCAGCT
CAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACTCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAATCCGGA
CTTAAAAGAAAGGAGTTAGCCCGTGCAGCTAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCTGACCTGCCCAATCACCTTCGACGGTGCAGACTTGAAGGAGGTCCA
CTTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCT
ACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAAAAGCCCGACGCCGCTGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCC
TATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGG
AGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCAAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGA
GGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCTGTC
GAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAACTCACCACA
AGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACGAATGAGGATGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAA
GGAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATTGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGG
AGGGTCCAAGCGCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGGCATACATATTGGCCGATATGAAAGGAGACGT
CCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGACAAGGTCACGA
CGGCCTAGCAACGGAACCCCTCCGTAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCATCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTAT
AACGAAATGGTGCTAGCTGCAGGCGTAGGGTGCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGCGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGAATTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTTCTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCATGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCC
AAGTCCAAGGACAAGAGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAAGGAACATCGAGGATTCTGGAATCGAAAAACTACTCAAGCGTCCGAAGAAACTTCGGGGAGTCCCGGAGAGGCGCAGCAAGGACAAGACCAGCT
CAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACTCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAATCCGGA
CTTAAAAGAAAGGAGTTAGCCCGTGCAGCTAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCTGACCTGCCCAATCACCTTCGACGGTGCAGACTTGAAGGAGGTCCA
CTTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCT
ACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAAAAGCCCGACGCCGCTGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCC
TATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGG
AGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCAAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGA
GGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTCGCCAGGTCGGTCCCTGTC
GAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAACTCACCACA
AGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACGAATGAGGATGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAA
GGAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATTGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGG
AGGGTCCAAGCGCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGGCATACATATTGGCCGATATGAAAGGAGACGT
CCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMY
NEMVLAAGVGCRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFFS
RHYDKKTATHLATIMQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADP
KSKDKRSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILRNIEDSGIEKLLKRPKKLRGVPERRSKDKTSSAEKKEERKRSRTPLRRTDRPAVINTIFGGPNGGQSG
LKRKELARAARREVCIIREQGLTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLDQTRVTQMAEFVVVDGRSA
YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALKTLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPV
EILDNPSILEPDLMEIGAPESSWMDPITDFIRGNSPQDPKERRKLARRAARVEHYEPTTNEDELLLNLDLLEERRAMAQLRLAEYQGRIARHYNARVRPRAFQVGHLVLR
RVQAHVGALDPTWEGPFEIKGIVRPGAYILADMKGDVLAHPWNAEHLKRYYP