; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g33430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g33430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:25361241..25362559
RNA-Seq ExpressionMoc06g33430
SyntenyMoc06g33430
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY98609.1 haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa]1.2e-7646.28Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNG---NLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY
        M+IALS KNK+GFIDG+I KP G   NLL +W  NN+++ SWI+NSVSKEI+ASII++ SA +IW +LK+RFQQS+ PRIFQLR+EL+  +Q    +  Y
Subjt:  MLIALSGKNKVGFIDGTIKKPNG---NLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY

Query:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIG----TINPPLPSMAMA
        +TKLKT+W+EL +YRP     +CTC G+K L+  +Q EY+M+FLM L+ S+A+IR Q+LLMDP+PP+NKVFSL+ QEE QR IG    +I+    +MA A
Subjt:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIG----TINPPLPSMAMA

Query:  VAEIS------------------KRNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDIT
        +   +                   +NSA+   +K +R+FCTHC   GH I+KCYK HGYPPG++   P +R     +   TS+S+  V NQVS  +  I+
Subjt:  VAEIS------------------KRNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDIT

Query:  SSPAIQRPSNSSPAFFNSLNSSQYSQLMEMLQSHL-QAAKPETITP-MNHVAGQVTIENDWQG
         +   Q+ +  +  F  +LNS+QY QLM ML +H+  + K +   P  ++  G    E+DWQG
Subjt:  SSPAIQRPSNSSPAFFNSLNSSQYSQLMEMLQSHL-QAAKPETITP-MNHVAGQVTIENDWQG

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]7.3e-9055.05Show/hide
Query:  MLIALSGKNKVGFIDGTIKKP-NGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYT
        ML+A+SG+NK GFI G I+KP +G LL AW CNNDI+ SWI+NSVSKEIAASIIY GS K+IWDEL++RF+QS+ P I+QLRKE VT  QG L+IE YYT
Subjt:  MLIALSGKNKVGFIDGTIKKP-NGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYT

Query:  KLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNS
        KLKT+WQ L +YR T DCTC GLK   +  +SEY+M FLMGLN+SYA +RAQILLM P+P +N VFSLLIQEE+QR+ G + PP+  +A+ +A  S    
Subjt:  KLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNS

Query:  ATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQL
        +T   RK  R  C++CG++GH+ DKCYK HGYPPGY+  N              S+S     +     NV  T+S A    +N SP FF+SLNS QYSQL
Subjt:  ATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQL

Query:  MEMLQSHLQAAKP---ETITPMNHVAG
        M +L +HLQAA      T T + H +G
Subjt:  MEMLQSHLQAAKP---ETITPMNHVAG

XP_020965447.1 uncharacterized protein LOC110266048 [Arachis ipaensis]4.8e-7345Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNGN----LLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEA
        M +ALS K K+GFIDG++ KP+      L+  W+C NDI+T+W++NS+SK+IAAS+IY GSA  +W +L+ RF QS+APRIF+L+K L+T  QG+L++  
Subjt:  MLIALSGKNKVGFIDGTIKKPNGN----LLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEA

Query:  YYTKLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISK
        Y+TKLK +W+EL  ++P + C+C G+K +  +   EYVM FLMGLN++ A +R+QILL DP+PP+ KVFSL++QEE+Q+A+ +  PP   MA AV +  +
Subjt:  YYTKLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISK

Query:  --RNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSS
            S ++ + K +R  C HCG  GH  +KCYKLHGYPPGY          Q  N    +  N V   + S++N D        +PSN       SL +S
Subjt:  --RNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSS

Query:  QYSQLMEMLQSH--LQAAKPETITPMNHVAGQVTIENDWQ
        QY+QLM +LQ+   +QA +PE        AGQ   E DWQ
Subjt:  QYSQLMEMLQSH--LQAAKPETITPMNHVAGQVTIENDWQ

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]3.4e-188100Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK
        MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK
Subjt:  MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK

Query:  LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA
        LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA
Subjt:  LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA

Query:  TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM
        TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM
Subjt:  TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM

Query:  EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG
        EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG
Subjt:  EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG

XP_022155284.1 uncharacterized protein LOC111022420 [Momordica charantia]1.1e-7461.78Show/hide
Query:  TIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELTDYRPTID
        TI KP  N+L+AWKCNND+I  WI+NSVS++IAAS++Y+ SA DIW+EL++RFQQS+ PRI+QLRKE VT       IEAYYTKLKTVWQEL++Y  +  
Subjt:  TIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELTDYRPTID

Query:  CTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSATQFRRKD-NRSFCTHC
        CTC GLK +   F SEYVM FLMGLNESYA +RAQIL MDP+PP+NKVFSLLIQEE  R++  +     S+A+A  E+SKR    +FR+K+  R FCTHC
Subjt:  CTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSATQFRRKD-NRSFCTHC

Query:  GLRGHVIDKCYKLHGYPPGYRANNP
        G++GH+I+ CYKLHGYPP YR  +P
Subjt:  GLRGHVIDKCYKLHGYPPGYRANNP

TrEMBL top hitse value%identityAlignment
A0A2Z7AXW3 Uncharacterized protein8.2e-7148.48Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNGN---LLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY
        MLIALS KNK+GFIDG+I +P+ +   LL +W  NN+I+ SWI+NSVSKEI+ASII+  SA  IW +LK+RFQQS+ PRIFQLR+EL+   Q  LS+  Y
Subjt:  MLIALSGKNKVGFIDGTIKKPNGN---LLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY

Query:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLP-----SMAM
        +TKLK +W EL+++RP      CTC G+K L+   Q EYVM FLMGLN++YA+IR Q+LL+DP+PP+NKVFSL+ QEERQR IG    P P     +MA 
Subjt:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLP-----SMAM

Query:  AV-AEISKRNSAT--------QFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRP
        AV  + +++N AT          R  +NR FCT C + GH +D CYK+HGYPPGY          Q       S+ + V  NQV        S P    P
Subjt:  AV-AEISKRNSAT--------QFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRP

Query:  SNSSPAFFNSLNSSQYSQLMEMLQSHLQAA
        +         LNS+Q  QL+ ML +HL  A
Subjt:  SNSSPAFFNSLNSSQYSQLMEMLQSHLQAA

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.5e-9055.05Show/hide
Query:  MLIALSGKNKVGFIDGTIKKP-NGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYT
        ML+A+SG+NK GFI G I+KP +G LL AW CNNDI+ SWI+NSVSKEIAASIIY GS K+IWDEL++RF+QS+ P I+QLRKE VT  QG L+IE YYT
Subjt:  MLIALSGKNKVGFIDGTIKKP-NGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYT

Query:  KLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNS
        KLKT+WQ L +YR T DCTC GLK   +  +SEY+M FLMGLN+SYA +RAQILLM P+P +N VFSLLIQEE+QR+ G + PP+  +A+ +A  S    
Subjt:  KLKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNS

Query:  ATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQL
        +T   RK  R  C++CG++GH+ DKCYK HGYPPGY+  N              S+S     +     NV  T+S A    +N SP FF+SLNS QYSQL
Subjt:  ATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQL

Query:  MEMLQSHLQAAKP---ETITPMNHVAG
        M +L +HLQAA      T T + H +G
Subjt:  MEMLQSHLQAAKP---ETITPMNHVAG

A0A6J1CXR2 uncharacterized protein LOC1110152391.7e-188100Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK
        MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK
Subjt:  MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTK

Query:  LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA
        LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA
Subjt:  LKTVWQELTDYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSA

Query:  TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM
        TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM
Subjt:  TQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLM

Query:  EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG
        EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG
Subjt:  EMLQSHLQAAKPETITPMNHVAGQVTIENDWQG

A0A6J1DPT8 uncharacterized protein LOC1110224205.5e-7561.78Show/hide
Query:  TIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELTDYRPTID
        TI KP  N+L+AWKCNND+I  WI+NSVS++IAAS++Y+ SA DIW+EL++RFQQS+ PRI+QLRKE VT       IEAYYTKLKTVWQEL++Y  +  
Subjt:  TIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELTDYRPTID

Query:  CTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSATQFRRKD-NRSFCTHC
        CTC GLK +   F SEYVM FLMGLNESYA +RAQIL MDP+PP+NKVFSLLIQEE  R++  +     S+A+A  E+SKR    +FR+K+  R FCTHC
Subjt:  CTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSATQFRRKD-NRSFCTHC

Query:  GLRGHVIDKCYKLHGYPPGYRANNP
        G++GH+I+ CYKLHGYPP YR  +P
Subjt:  GLRGHVIDKCYKLHGYPPGYRANNP

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein5.9e-7746.28Show/hide
Query:  MLIALSGKNKVGFIDGTIKKPNG---NLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY
        M+IALS KNK+GFIDG+I KP G   NLL +W  NN+++ SWI+NSVSKEI+ASII++ SA +IW +LK+RFQQS+ PRIFQLR+EL+  +Q    +  Y
Subjt:  MLIALSGKNKVGFIDGTIKKPNG---NLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAY

Query:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIG----TINPPLPSMAMA
        +TKLKT+W+EL +YRP     +CTC G+K L+  +Q EY+M+FLM L+ S+A+IR Q+LLMDP+PP+NKVFSL+ QEE QR IG    +I+    +MA A
Subjt:  YTKLKTVWQELTDYRPTI---DCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIG----TINPPLPSMAMA

Query:  VAEIS------------------KRNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDIT
        +   +                   +NSA+   +K +R+FCTHC   GH I+KCYK HGYPPG++   P +R     +   TS+S+  V NQVS  +  I+
Subjt:  VAEIS------------------KRNSATQFRRKDNRSFCTHCGLRGHVIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDIT

Query:  SSPAIQRPSNSSPAFFNSLNSSQYSQLMEMLQSHL-QAAKPETITP-MNHVAGQVTIENDWQG
         +   Q+ +  +  F  +LNS+QY QLM ML +H+  + K +   P  ++  G    E+DWQG
Subjt:  SSPAIQRPSNSSPAFFNSLNSSQYSQLMEMLQSHL-QAAKPETITP-MNHVAGQVTIENDWQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.2e-2333.14Show/hide
Query:  KVGFIDGTIKKPN--GNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQE
        K GFIDGT+ KP+    L   W+  N ++  W++NS++ ++  S++Y  +A  +W++L+  F      +I+QLR+ L T  QG  S+E Y+ KL  VW E
Subjt:  KVGFIDGTIKKPN--GNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQE

Query:  LTDYRPTIDCTCSG-----LKSLSEFFQSEYVMTFLMG--LNESYAKIRAQILLMDPIPPMNKVFSLLIQEE
        L++Y P  +C C G      K   E  + E    FLMG  LN+ +  +  +I+   P P +++ F+++   E
Subjt:  LTDYRPTIDCTCSG-----LKSLSEFFQSEYVMTFLMG--LNESYAKIRAQILLMDPIPPMNKVFSLLIQEE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.7e-1029.45Show/hide
Query:  VGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTG-SAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELT
        +G IDG+   P       WK  + ++  WI  +++  +  +II  G +A+D+W  L+  F+ +   R  Q   EL TT    LS+  Y  KLK+    L+
Subjt:  VGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTG-SAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELT

Query:  DYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEE
        D    +D         S       VM  L GL E Y  I   I    P P   +  S+L+ EE
Subjt:  DYRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCGCACTCTCTGGAAAGAACAAGGTCGGATTCATTGACGGCACTATCAAGAAACCTAATGGAAATTTGCTTGCAGCGTGGAAGTGCAACAATGACATAATAAC
TTCCTGGATTATCAATTCGGTTTCAAAGGAGATAGCCGCGAGCATCATCTATACTGGATCTGCGAAGGATATTTGGGATGAGCTTAAGGAGCGCTTTCAACAAAGCGATG
CTCCGCGAATCTTCCAACTTCGTAAGGAATTGGTTACTACAATCCAAGGAACTCTGTCTATTGAAGCTTATTATACGAAATTAAAGACTGTCTGGCAAGAACTTACGGAT
TATCGACCGACTATTGACTGCACATGCTCTGGATTGAAGTCTCTTTCGGAATTCTTTCAATCAGAGTATGTTATGACTTTCCTCATGGGGCTTAACGAATCTTATGCCAA
AATTAGGGCCCAGATTCTTCTTATGGATCCAATTCCACCTATGAACAAGGTTTTTTCATTACTCATCCAAGAGGAACGTCAACGTGCTATAGGCACCATCAATCCTCCTC
TTCCTTCGATGGCCATGGCTGTTGCTGAAATTTCCAAGCGAAATAGTGCTACTCAATTTCGCCGAAAGGATAATCGATCCTTCTGTACTCATTGCGGCCTTCGTGGACAT
GTAATTGACAAATGTTACAAATTGCATGGTTATCCTCCAGGATATCGTGCTAATAATCCTGCTGCTAGGATCGGCCAACTGCATAATCCCAATGGAACCTCTCATTCCAA
TGGCGTTGTGGCTAATCAGGTCTCTGAGAAGAATGTTGATATTACTTCATCCCCTGCCATTCAACGACCATCAAATAGTTCGCCTGCTTTCTTTAATAGCCTCAATTCCA
GCCAATACTCACAACTTATGGAGATGCTTCAATCTCACCTTCAGGCTGCTAAACCAGAGACCATCACGCCTATGAATCATGTTGCAGGACAGGTCACCATTGAGAATGAT
TGGCAGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCGCACTCTCTGGAAAGAACAAGGTCGGATTCATTGACGGCACTATCAAGAAACCTAATGGAAATTTGCTTGCAGCGTGGAAGTGCAACAATGACATAATAAC
TTCCTGGATTATCAATTCGGTTTCAAAGGAGATAGCCGCGAGCATCATCTATACTGGATCTGCGAAGGATATTTGGGATGAGCTTAAGGAGCGCTTTCAACAAAGCGATG
CTCCGCGAATCTTCCAACTTCGTAAGGAATTGGTTACTACAATCCAAGGAACTCTGTCTATTGAAGCTTATTATACGAAATTAAAGACTGTCTGGCAAGAACTTACGGAT
TATCGACCGACTATTGACTGCACATGCTCTGGATTGAAGTCTCTTTCGGAATTCTTTCAATCAGAGTATGTTATGACTTTCCTCATGGGGCTTAACGAATCTTATGCCAA
AATTAGGGCCCAGATTCTTCTTATGGATCCAATTCCACCTATGAACAAGGTTTTTTCATTACTCATCCAAGAGGAACGTCAACGTGCTATAGGCACCATCAATCCTCCTC
TTCCTTCGATGGCCATGGCTGTTGCTGAAATTTCCAAGCGAAATAGTGCTACTCAATTTCGCCGAAAGGATAATCGATCCTTCTGTACTCATTGCGGCCTTCGTGGACAT
GTAATTGACAAATGTTACAAATTGCATGGTTATCCTCCAGGATATCGTGCTAATAATCCTGCTGCTAGGATCGGCCAACTGCATAATCCCAATGGAACCTCTCATTCCAA
TGGCGTTGTGGCTAATCAGGTCTCTGAGAAGAATGTTGATATTACTTCATCCCCTGCCATTCAACGACCATCAAATAGTTCGCCTGCTTTCTTTAATAGCCTCAATTCCA
GCCAATACTCACAACTTATGGAGATGCTTCAATCTCACCTTCAGGCTGCTAAACCAGAGACCATCACGCCTATGAATCATGTTGCAGGACAGGTCACCATTGAGAATGAT
TGGCAGGGCTGA
Protein sequenceShow/hide protein sequence
MLIALSGKNKVGFIDGTIKKPNGNLLAAWKCNNDIITSWIINSVSKEIAASIIYTGSAKDIWDELKERFQQSDAPRIFQLRKELVTTIQGTLSIEAYYTKLKTVWQELTD
YRPTIDCTCSGLKSLSEFFQSEYVMTFLMGLNESYAKIRAQILLMDPIPPMNKVFSLLIQEERQRAIGTINPPLPSMAMAVAEISKRNSATQFRRKDNRSFCTHCGLRGH
VIDKCYKLHGYPPGYRANNPAARIGQLHNPNGTSHSNGVVANQVSEKNVDITSSPAIQRPSNSSPAFFNSLNSSQYSQLMEMLQSHLQAAKPETITPMNHVAGQVTIEND
WQG