; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:7590126..7596269
RNA-Seq ExpressionMoc07g09940
SyntenyMoc07g09940
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]9.9e-15455.23Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFE LM+FQ A+DAIKCRAF+IALT SARLW              LR+EF++ FSSRHYD+KTATHL TIRQKEGE LREYVTRFQEE
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA
        QLKVAHCS+DSAMCYFLTGLADE LTVKLGEEA AT AEVLQKAKKVIDGQELL+TKTGRPE                     R+ GR       G SG 
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA

Query:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG
                                                                                  +I  +D   +D G    GR       
Subjt:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG

Query:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG
                         +      G TR  P ER              P+   +S           +++++I     + S   + L+R       +KLRG
Subjt:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG

Query:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE
         P +RSKDKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGKP ++ AEKKEERKRSRTPPRR DRPAVINTIFG PSGGQSG KRKELAR ARRE
Subjt:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE

Query:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ
        VCIIREQRPTC ITFD  DLE VHLPHNDALVIAPLIDHV+V RVLVD GTSANILSLPTYLALGWTRSQLK+S TPLVGFSGESV PE  IDLPV  GQ
Subjt:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ

Query:  DSTQVTQMAEFM
        D TQVTQMAEF+
Subjt:  DSTQVTQMAEFM

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.2e-16852.92Show/hide
Query:  TYDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLWLRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAM
        +YDGSKDPKDYV+VFEGLM+FQ A+DAIKCRAFQIALT SARLW                                       FQE+QLKVA  S+DSAM
Subjt:  TYDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLWLRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAM

Query:  CYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAA
        CYFLTGLADE LTVKLG+EA AT AEVLQKAKKVIDGQELL+TKTGRPE+                +DR  R G+ +                       
Subjt:  CYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAA

Query:  AGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLR
                                                                        K+D+  +D G    GR                    
Subjt:  AGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLR

Query:  LEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRF
            +      G TR  P ER              P+   +S           +++++I     + S   + L+R       +KLRG P +R+KDKYCRF
Subjt:  LEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRF

Query:  HRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSI
        HR+H HNTS+ WELKRQIEDLIQD YFKKFVGKP ++ AEKKEERK SRTP RR DRPAVINTIFG PSGGQSG+KRKELAR ARREVCIIREQRPTC I
Subjt:  HRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSI

Query:  TFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMV
        TFD  DLE VHLPHNDALVIAPLIDHV+VRRVLVDEG SANI+SL TYLALGWTRSQLK+S TPLVGFS ESV PE CIDLPV  G D TQVTQMAEF+V
Subjt:  TFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMV

Query:  VDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPL
        +DGRSAYNAIFGRPIIHSFR +PS LHQV KYSTPNGV  VRGEQ  SRECYAS LKGSSVCAL+  + RD   E +A+LPR   R+F+A TEELELVPL
Subjt:  VDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPL

Query:  L
        L
Subjt:  L

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.0e-17857.06Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFE LM+FQ ATDAIKC AFQIALT SARLW              LRKEFISQFSSRHYDRKT THL TIRQKEGE LREYVTRF EE
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVD-RSRRRGRQQWRATPGSSG
        QLKVAHCS+DSAMCYFLTGLADETLTVKL EEA AT AEVLQK KKVIDGQELL+TKTGRPEK     D  R        D +SR +G       P SS 
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVD-RSRRRGRQQWRATPGSSG

Query:  ARRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVE
        +R     D R                                                R +  H  S                       ++  TP+ + 
Subjt:  ARRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVE

Query:  GGRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLR
            L N       +EE  +E                                                               + L+R       +KLR
Subjt:  GGRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLR

Query:  GDPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARR
        GDP KR+ DKYCRFHRDHGHNTSN WELKRQIEDLIQDGYFKKFVGKP SN  EKKEERKR RTPPRRDDRPAVI             NK+KELAREARR
Subjt:  GDPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARR

Query:  EVCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANG
        EVCIIREQRPT SI F+  DLEGVHLPHNDALVIAPLID VLVRR+LVD G SANILSL TYLALGWTRSQLK+S TPLVGFSGES+S E CIDLPV+  
Subjt:  EVCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANG

Query:  QDSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKL
        QD TQVTQMAEF+V+DGRSAYNAIFGRPIIHSFR VPS LHQV KYST NGV TVRGE KTSRECYAS  K SSVCAL+EQ IRD+L
Subjt:  QDSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.8e-14648.95Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFEGLM+FQ ATDAIKC AFQIALT SARLW              LRKEFI QFS RHYDRKTATHL TIRQKE              
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA
                             DETLTVKLGEEA AT AEVLQ AKKVIDGQELL+TKT RPEKQ               +D+ R               +
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA

Query:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG
        +++   D++                                     GS S G R  +R S                                        
Subjt:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG

Query:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG
                                G +R  P ER                                                                  
Subjt:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG

Query:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE
                               CWELKRQIEDLIQD YFKKFVGKP SN  EKKEERKRSRTPPRR+DRPAVINTIFG PSGGQ  NKRKELA EARR+
Subjt:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE

Query:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ
        V IIREQ+PTCSITF DTDLEGVHLPHNDALVIAPLIDHVLVRRVLVD G SANILSLPTYLAL  TRSQLK+S TPLVGFS ESVSPE CIDLPV  GQ
Subjt:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ

Query:  DSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQ
        DSTQVTQMAEF+V+DGR AYNAIF RPIIHSF+ VPS+LHQV KYSTPNGV TVRGEQKTSRECYAS LK SSVCAL+EQ  +D       DLPREAK  
Subjt:  DSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQ

Query:  FSALTEELELVPLLS
                 L P L+
Subjt:  FSALTEELELVPLLS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.2e-17254.62Show/hide
Query:  MNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAMCYFL
        M+FQ ATDAIKCRAFQIALT SARLW              LRKEFISQFSS HYDRKTATHL TIRQKE E LREYVTRFQEEQLKVAHCS+DSAMCYFL
Subjt:  MNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAMCYFL

Query:  TGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAAAGVR
        T LADETLTVKLGEEA  T  EVLQKAKKVIDGQELL+TKTGRPEKQ               +D+ +             S  +R+A + +R        
Subjt:  TGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAAAGVR

Query:  GVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLRLEEL
                                     GS S   R  +R                                                      RLE  
Subjt:  GVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLRLEEL

Query:  DLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRFHRDH
               G +R  P ER            +  S   +S           +++++I     + S   + L+R       +KLRGD  KR+K+KYCRFHRDH
Subjt:  DLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRFHRDH

Query:  GHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDD
        GHNT++CWELKRQIEDLIQDGYFKKFVGKP SN  EKKEERKRSRTPPRR+DRPAVINTIFG P+GGQSGNKRKELAREARREVCIIRE +PTCSITF D
Subjt:  GHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDD

Query:  TDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMVVDGR
         DLEGVHLPHNDALVIA LIDH LVRRVL+D G                                        CIDLPV  GQD+TQVTQMAEF+V+DGR
Subjt:  TDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMVVDGR

Query:  SAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPLLSPE
        SAYNAIFGRPIIHSFR VPS LHQV KYSTPN V  VRGEQKTSRECYAS LKGS+VCAL+EQ  R KLQESEADLP+E KRQF   TEELELVPLLSPE
Subjt:  SAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPLLSPE

Query:  KQA
        +QA
Subjt:  KQA

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088134.8e-15455.23Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFE LM+FQ A+DAIKCRAF+IALT SARLW              LR+EF++ FSSRHYD+KTATHL TIRQKEGE LREYVTRFQEE
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA
        QLKVAHCS+DSAMCYFLTGLADE LTVKLGEEA AT AEVLQKAKKVIDGQELL+TKTGRPE                     R+ GR       G SG 
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA

Query:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG
                                                                                  +I  +D   +D G    GR       
Subjt:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG

Query:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG
                         +      G TR  P ER              P+   +S           +++++I     + S   + L+R       +KLRG
Subjt:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG

Query:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE
         P +RSKDKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGKP ++ AEKKEERKRSRTPPRR DRPAVINTIFG PSGGQSG KRKELAR ARRE
Subjt:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE

Query:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ
        VCIIREQRPTC ITFD  DLE VHLPHNDALVIAPLIDHV+V RVLVD GTSANILSLPTYLALGWTRSQLK+S TPLVGFSGESV PE  IDLPV  GQ
Subjt:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ

Query:  DSTQVTQMAEFM
        D TQVTQMAEF+
Subjt:  DSTQVTQMAEFM

A0A6J1D9E1 uncharacterized protein LOC1110188234.4e-16852.92Show/hide
Query:  TYDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLWLRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAM
        +YDGSKDPKDYV+VFEGLM+FQ A+DAIKCRAFQIALT SARLW                                       FQE+QLKVA  S+DSAM
Subjt:  TYDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLWLRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAM

Query:  CYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAA
        CYFLTGLADE LTVKLG+EA AT AEVLQKAKKVIDGQELL+TKTGRPE+                +DR  R G+ +                       
Subjt:  CYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAA

Query:  AGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLR
                                                                        K+D+  +D G    GR                    
Subjt:  AGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLR

Query:  LEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRF
            +      G TR  P ER              P+   +S           +++++I     + S   + L+R       +KLRG P +R+KDKYCRF
Subjt:  LEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRF

Query:  HRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSI
        HR+H HNTS+ WELKRQIEDLIQD YFKKFVGKP ++ AEKKEERK SRTP RR DRPAVINTIFG PSGGQSG+KRKELAR ARREVCIIREQRPTC I
Subjt:  HRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSI

Query:  TFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMV
        TFD  DLE VHLPHNDALVIAPLIDHV+VRRVLVDEG SANI+SL TYLALGWTRSQLK+S TPLVGFS ESV PE CIDLPV  G D TQVTQMAEF+V
Subjt:  TFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMV

Query:  VDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPL
        +DGRSAYNAIFGRPIIHSFR +PS LHQV KYSTPNGV  VRGEQ  SRECYAS LKGSSVCAL+  + RD   E +A+LPR   R+F+A TEELELVPL
Subjt:  VDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPL

Query:  L
        L
Subjt:  L

A0A6J1DHB3 uncharacterized protein LOC1110204799.6e-17957.06Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFE LM+FQ ATDAIKC AFQIALT SARLW              LRKEFISQFSSRHYDRKT THL TIRQKEGE LREYVTRF EE
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVD-RSRRRGRQQWRATPGSSG
        QLKVAHCS+DSAMCYFLTGLADETLTVKL EEA AT AEVLQK KKVIDGQELL+TKTGRPEK     D  R        D +SR +G       P SS 
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVD-RSRRRGRQQWRATPGSSG

Query:  ARRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVE
        +R     D R                                                R +  H  S                       ++  TP+ + 
Subjt:  ARRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVE

Query:  GGRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLR
            L N       +EE  +E                                                               + L+R       +KLR
Subjt:  GGRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLR

Query:  GDPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARR
        GDP KR+ DKYCRFHRDHGHNTSN WELKRQIEDLIQDGYFKKFVGKP SN  EKKEERKR RTPPRRDDRPAVI             NK+KELAREARR
Subjt:  GDPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARR

Query:  EVCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANG
        EVCIIREQRPT SI F+  DLEGVHLPHNDALVIAPLID VLVRR+LVD G SANILSL TYLALGWTRSQLK+S TPLVGFSGES+S E CIDLPV+  
Subjt:  EVCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANG

Query:  QDSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKL
        QD TQVTQMAEF+V+DGRSAYNAIFGRPIIHSFR VPS LHQV KYST NGV TVRGE KTSRECYAS  K SSVCAL+EQ IRD+L
Subjt:  QDSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKL

A0A6J1DPC9 uncharacterized protein LOC1110222802.8e-14648.95Show/hide
Query:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE
        YDGSKDPKDYV+VFEGLM+FQ ATDAIKC AFQIALT SARLW              LRKEFI QFS RHYDRKTATHL TIRQKE              
Subjt:  YDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEE

Query:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA
                             DETLTVKLGEEA AT AEVLQ AKKVIDGQELL+TKT RPEKQ               +D+ R               +
Subjt:  QLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGA

Query:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG
        +++   D++                                     GS S G R  +R S                                        
Subjt:  RRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEG

Query:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG
                                G +R  P ER                                                                  
Subjt:  GRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRG

Query:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE
                               CWELKRQIEDLIQD YFKKFVGKP SN  EKKEERKRSRTPPRR+DRPAVINTIFG PSGGQ  NKRKELA EARR+
Subjt:  DPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARRE

Query:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ
        V IIREQ+PTCSITF DTDLEGVHLPHNDALVIAPLIDHVLVRRVLVD G SANILSLPTYLAL  TRSQLK+S TPLVGFS ESVSPE CIDLPV  GQ
Subjt:  VCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQ

Query:  DSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQ
        DSTQVTQMAEF+V+DGR AYNAIF RPIIHSF+ VPS+LHQV KYSTPNGV TVRGEQKTSRECYAS LK SSVCAL+EQ  +D       DLPREAK  
Subjt:  DSTQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQ

Query:  FSALTEELELVPLLS
                 L P L+
Subjt:  FSALTEELELVPLLS

A0A6J1DZB9 uncharacterized protein LOC1110249046.0e-17354.62Show/hide
Query:  MNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAMCYFL
        M+FQ ATDAIKCRAFQIALT SARLW              LRKEFISQFSS HYDRKTATHL TIRQKE E LREYVTRFQEEQLKVAHCS+DSAMCYFL
Subjt:  MNFQVATDAIKCRAFQIALTDSARLW--------------LRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAMCYFL

Query:  TGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAAAGVR
        T LADETLTVKLGEEA  T  EVLQKAKKVIDGQELL+TKTGRPEKQ               +D+ +             S  +R+A + +R        
Subjt:  TGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRPEKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAAAGVR

Query:  GVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLRLEEL
                                     GS S   R  +R                                                      RLE  
Subjt:  GVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLGGLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLRLEEL

Query:  DLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRFHRDH
               G +R  P ER            +  S   +S           +++++I     + S   + L+R       +KLRGD  KR+K+KYCRFHRDH
Subjt:  DLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSKSLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRFHRDH

Query:  GHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDD
        GHNT++CWELKRQIEDLIQDGYFKKFVGKP SN  EKKEERKRSRTPPRR+DRPAVINTIFG P+GGQSGNKRKELAREARREVCIIRE +PTCSITF D
Subjt:  GHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDD

Query:  TDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMVVDGR
         DLEGVHLPHNDALVIA LIDH LVRRVL+D G                                        CIDLPV  GQD+TQVTQMAEF+V+DGR
Subjt:  TDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQDSTQVTQMAEFMVVDGR

Query:  SAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPLLSPE
        SAYNAIFGRPIIHSFR VPS LHQV KYSTPN V  VRGEQKTSRECYAS LKGS+VCAL+EQ  R KLQESEADLP+E KRQF   TEELELVPLLSPE
Subjt:  SAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELVPLLSPE

Query:  KQA
        +QA
Subjt:  KQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATCATCCGGTCATGAATCACGTTTCGTCGCATGTCACAGAGCAAGATCTTCCAGAGCTCCGATCTATGTACAAAATCCCCGACTATATTCTTATGCGCCTTCC
GACGGAGGAGGAAATGATAGACAATCCTCCACCTGGGTGGGTGTGCGCTTGCATCTATAGAATTGTTACCACGCTTAGTGCGTGCGTTCAGCATAAACCCCCAGTCACAT
ATGACGGATCTAAGGACCCTAAAGATTACGTTCAGGTCTTTGAAGGCCTCATGAACTTTCAAGTGGCAACAGATGCTATCAAGTGCCGCGCCTTCCAGATTGCCCTAACT
GACAGCGCACGTTTATGGTTGAGGAAAGAGTTCATTAGTCAATTCTCTTCTCGGCATTATGATAGAAAGACAGCAACTCACCTCGTCACCATTAGACAGAAGGAAGGCGA
GATGCTTAGAGAGTACGTCACAAGGTTCCAAGAGGAGCAGCTAAAAGTCGCGCACTGCTCCAATGATTCGGCCATGTGTTACTTCCTCACCGGCCTGGCCGATGAGACCC
TTACTGTGAAGCTTGGAGAGGAGGCTTCAGCCACATTAGCCGAAGTATTGCAAAAGGCAAAGAAAGTTATCGACGGACAAGAACTCCTCCAGACCAAGACTGGCCGACCT
GAAAAGCAGCGCTCAACGGACGACGACCTCCGATGCAGCGGCTCACCTTCTCCGGTTGACAGAAGCAGACGGCGCGGCAGGCAGCAGTGGCGTGCGACTCCGGGCAGCAG
CGGTGCGCGGCGTCAGGCAGCAACGGACGCGCGGATTCCGGCAGCAGCAGGCGTGCGGGGCGTTAGGCAGCAGCAAGTGGCGGCGGCGTGCGTGGGTGCTGTCTTCGGGG
TGCTTTCCGGCGGCGCAGCCCTTCCCTCGCGAGCGGGTTCGGTCTCCCTTGGTGGTCGAAAGTCGTGGCGCGGGTCCTTACTGCACACAGTGTCGTTGGTTGTCTTGGGA
GGATTAGTGGAAATTGTCAAGAGTGATGTGATGGTTCAAGACCTTGGGGATAAATGGCAAGGTCGAACACCAAGCGTGGTAGAGGGAGGTCGGTATCTTGGGAATAAATG
TCAAGAGCCGTTACGCCTCGAAGAGTTAGACCTGGAGTACCAACATAGGGGGTCGACCAGAAAAATCCCAGCCGAGAGAAGAGGAAGGCAGAGTTCAAGTCTAGGGACAA
AGGGTCATCCTCCTTCAGCGGCCGAGCTGAGTATCGAAGGTCGGATAGCGGCCCCAACCGGAGCCGACCTTATGAGCGATATACCCCGACCACCATCCCAATCTTCGAAA
TCCTTACAAACATTGAGGAGACTGGGATGGAAAAGCTACTCAAAGAAGCTCCGAGGGGACCCAGGAAAGCGCAGCAAGGATAAGTATTGTCGGTTTCATCGCGACCATGG
CCACAATACATCAAATTGCTGGGAACTGAAGCGCCAGATTGAAGACCTTATTCAAGATGGCTACTTCAAAAAATTCGTTGGCAAACCGAGTTCCAACTTGGCGGAGAAAA
AAGAAGAGAGGAAGCGTTCGAGAACGCCCCCTCGCCGAGATGACCGACCTGCGGTCATCAACACTATTTTCGGACGCCCAAGTGGCGGCCAATCTGGAAATAAAAGAAAA
GAACTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAGACCGACCTGCTCCATTACCTTTGATGATACCGACCTAGAGGGGGTCCACTTGCCCCATAA
TGACGCACTGGTGATCGCCCCTCTTATCGATCACGTCCTGGTCAGAAGAGTATTGGTAGATGAAGGTACATCTGCCAACATCTTGTCCCTTCCAACATATCTCGCTCTTG
GGTGGACCAGGTCGCAGCTGAAAAGGAGTCTGACGCCCTTGGTTGGATTCTCTGGAGAGTCGGTCTCCCCAGAAGTGTGTATTGACTTGCCGGTCGCGAACGGACAAGAT
AGTACACAAGTAACGCAGATGGCCGAGTTCATGGTGGTCGACGGTAGGTCAGCCTACAACGCCATCTTTGGGAGACCCATCATCCACTCATTCCGGGTCGTTCCTTCAAT
GCTTCATCAAGTCTTCAAATACTCAACTCCCAATGGAGTGGACACGGTCCGAGGGGAGCAGAAGACTTCAAGGGAGTGTTATGCCTCCACGCTTAAAGGGTCGTCAGTAT
GCGCCCTGAAAGAACAAATAATTCGGGACAAGCTGCAAGAGTCTGAGGCCGACCTACCTAGAGAAGCCAAAAGGCAGTTCTCCGCACTAACAGAGGAGCTCGAGCTTGTT
CCCTTGCTTAGCCCTGAAAAACAAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATCATCCGGTCATGAATCACGTTTCGTCGCATGTCACAGAGCAAGATCTTCCAGAGCTCCGATCTATGTACAAAATCCCCGACTATATTCTTATGCGCCTTCC
GACGGAGGAGGAAATGATAGACAATCCTCCACCTGGGTGGGTGTGCGCTTGCATCTATAGAATTGTTACCACGCTTAGTGCGTGCGTTCAGCATAAACCCCCAGTCACAT
ATGACGGATCTAAGGACCCTAAAGATTACGTTCAGGTCTTTGAAGGCCTCATGAACTTTCAAGTGGCAACAGATGCTATCAAGTGCCGCGCCTTCCAGATTGCCCTAACT
GACAGCGCACGTTTATGGTTGAGGAAAGAGTTCATTAGTCAATTCTCTTCTCGGCATTATGATAGAAAGACAGCAACTCACCTCGTCACCATTAGACAGAAGGAAGGCGA
GATGCTTAGAGAGTACGTCACAAGGTTCCAAGAGGAGCAGCTAAAAGTCGCGCACTGCTCCAATGATTCGGCCATGTGTTACTTCCTCACCGGCCTGGCCGATGAGACCC
TTACTGTGAAGCTTGGAGAGGAGGCTTCAGCCACATTAGCCGAAGTATTGCAAAAGGCAAAGAAAGTTATCGACGGACAAGAACTCCTCCAGACCAAGACTGGCCGACCT
GAAAAGCAGCGCTCAACGGACGACGACCTCCGATGCAGCGGCTCACCTTCTCCGGTTGACAGAAGCAGACGGCGCGGCAGGCAGCAGTGGCGTGCGACTCCGGGCAGCAG
CGGTGCGCGGCGTCAGGCAGCAACGGACGCGCGGATTCCGGCAGCAGCAGGCGTGCGGGGCGTTAGGCAGCAGCAAGTGGCGGCGGCGTGCGTGGGTGCTGTCTTCGGGG
TGCTTTCCGGCGGCGCAGCCCTTCCCTCGCGAGCGGGTTCGGTCTCCCTTGGTGGTCGAAAGTCGTGGCGCGGGTCCTTACTGCACACAGTGTCGTTGGTTGTCTTGGGA
GGATTAGTGGAAATTGTCAAGAGTGATGTGATGGTTCAAGACCTTGGGGATAAATGGCAAGGTCGAACACCAAGCGTGGTAGAGGGAGGTCGGTATCTTGGGAATAAATG
TCAAGAGCCGTTACGCCTCGAAGAGTTAGACCTGGAGTACCAACATAGGGGGTCGACCAGAAAAATCCCAGCCGAGAGAAGAGGAAGGCAGAGTTCAAGTCTAGGGACAA
AGGGTCATCCTCCTTCAGCGGCCGAGCTGAGTATCGAAGGTCGGATAGCGGCCCCAACCGGAGCCGACCTTATGAGCGATATACCCCGACCACCATCCCAATCTTCGAAA
TCCTTACAAACATTGAGGAGACTGGGATGGAAAAGCTACTCAAAGAAGCTCCGAGGGGACCCAGGAAAGCGCAGCAAGGATAAGTATTGTCGGTTTCATCGCGACCATGG
CCACAATACATCAAATTGCTGGGAACTGAAGCGCCAGATTGAAGACCTTATTCAAGATGGCTACTTCAAAAAATTCGTTGGCAAACCGAGTTCCAACTTGGCGGAGAAAA
AAGAAGAGAGGAAGCGTTCGAGAACGCCCCCTCGCCGAGATGACCGACCTGCGGTCATCAACACTATTTTCGGACGCCCAAGTGGCGGCCAATCTGGAAATAAAAGAAAA
GAACTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAGACCGACCTGCTCCATTACCTTTGATGATACCGACCTAGAGGGGGTCCACTTGCCCCATAA
TGACGCACTGGTGATCGCCCCTCTTATCGATCACGTCCTGGTCAGAAGAGTATTGGTAGATGAAGGTACATCTGCCAACATCTTGTCCCTTCCAACATATCTCGCTCTTG
GGTGGACCAGGTCGCAGCTGAAAAGGAGTCTGACGCCCTTGGTTGGATTCTCTGGAGAGTCGGTCTCCCCAGAAGTGTGTATTGACTTGCCGGTCGCGAACGGACAAGAT
AGTACACAAGTAACGCAGATGGCCGAGTTCATGGTGGTCGACGGTAGGTCAGCCTACAACGCCATCTTTGGGAGACCCATCATCCACTCATTCCGGGTCGTTCCTTCAAT
GCTTCATCAAGTCTTCAAATACTCAACTCCCAATGGAGTGGACACGGTCCGAGGGGAGCAGAAGACTTCAAGGGAGTGTTATGCCTCCACGCTTAAAGGGTCGTCAGTAT
GCGCCCTGAAAGAACAAATAATTCGGGACAAGCTGCAAGAGTCTGAGGCCGACCTACCTAGAGAAGCCAAAAGGCAGTTCTCCGCACTAACAGAGGAGCTCGAGCTTGTT
CCCTTGCTTAGCCCTGAAAAACAAGCATAG
Protein sequenceShow/hide protein sequence
MANHPVMNHVSSHVTEQDLPELRSMYKIPDYILMRLPTEEEMIDNPPPGWVCACIYRIVTTLSACVQHKPPVTYDGSKDPKDYVQVFEGLMNFQVATDAIKCRAFQIALT
DSARLWLRKEFISQFSSRHYDRKTATHLVTIRQKEGEMLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADETLTVKLGEEASATLAEVLQKAKKVIDGQELLQTKTGRP
EKQRSTDDDLRCSGSPSPVDRSRRRGRQQWRATPGSSGARRQAATDARIPAAAGVRGVRQQQVAAACVGAVFGVLSGGAALPSRAGSVSLGGRKSWRGSLLHTVSLVVLG
GLVEIVKSDVMVQDLGDKWQGRTPSVVEGGRYLGNKCQEPLRLEELDLEYQHRGSTRKIPAERRGRQSSSLGTKGHPPSAAELSIEGRIAAPTGADLMSDIPRPPSQSSK
SLQTLRRLGWKSYSKKLRGDPGKRSKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKPSSNLAEKKEERKRSRTPPRRDDRPAVINTIFGRPSGGQSGNKRK
ELAREARREVCIIREQRPTCSITFDDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGTSANILSLPTYLALGWTRSQLKRSLTPLVGFSGESVSPEVCIDLPVANGQD
STQVTQMAEFMVVDGRSAYNAIFGRPIIHSFRVVPSMLHQVFKYSTPNGVDTVRGEQKTSRECYASTLKGSSVCALKEQIIRDKLQESEADLPREAKRQFSALTEELELV
PLLSPEKQA