; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr2:13717952..13721364
RNA-Seq ExpressionMoc02g18410
SyntenyMoc02g18410
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]0.0e+0088.89Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS DDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFI+GLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRL+S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTS KGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM
         VN EALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVL MEFLIEHKVIPMPLAKCMIVTSNSP VVT SIKQPGGIRMI ALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM

Query:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
        AIPMVEQPVETRDVPPEIQVVM+E+       PD L K+
Subjt:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]2.1e-2063.95Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG
        QARWQELLAEFDFKFEHKAGK NQA +ALSRKGEHA LCMLAHIH SK DGSIRDLI E  +  P         +  +   F ++G
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]8.2e-30487.32Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS DDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTS KGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM
        AVNSEALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVL MEFLIEHKVIPMPLAKCMIVTSNSP VVTASIKQPGGIRMI ALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM

Query:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
              QPVETRDVPPEIQVVMRE+       PD L K+
Subjt:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]2.1e-18254.37Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------
        MS TK   K+  DRL  IEE++L+L+EVPD+LR LE RV E SEK  ++DA+  R+DGLPI ++  RV +LE +     + +P D               
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------

Query:  ------------------FKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                          F+ T+D L+  M+ +STR+ +TM+AV      Q + G NKL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ------------------FKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A M+L DDAKLWWR+KV DI++G CTI+S +DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +    + C LCKGPH+V+ 
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE
        CPHRA+LTALQ S+Q  NE  V T  +K+ED + PRMGALKFLSA+Q++V  PK    KGLMFVDATIN   +KST++DSGATHNFI++QEARRL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLN
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVL MEFL+EHKVIPMPLAKC+++T  +P V+ ASIKQPG +RMI A+QLK+GL 
Subjt:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLN

Query:  REEPTFMAIPMVEQPVETRDVPPEIQVVM
        REEPTFMAIP++E+      VP EI+ V+
Subjt:  REEPTFMAIPMVEQPVETRDVPPEIQVVM

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]1.9e-2162.79Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG
        QARWQE LAEFDFKFEHKAGK NQA +ALSRKGEHAALCMLAHIH+SK+DGS+RD+I+E   + P  ++     + G+   F ++G
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]2.7e-18254.37Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------
        MS TK   K+  DRL  IEE++L+L+EVPD+LR LE RV E SEK  ++DA+  R+DGLPI ++  RV +LE +     + +P D               
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRPED---------------

Query:  ------------------FKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                          F+ T+D L+  M+ +STR+ +TM+AV      Q + G NKL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ------------------FKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A M+L DDAKLWWR+KV DI++G CTI+S +DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +    + C LCKGPH+V+ 
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE
        CPHRA+LTALQ S+Q  NE  V T  +K+ED + PRMGALKFLSA+Q++V  PK    KGLMFVDATIN   +KST++DSGATHNFI++QEARRL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLN
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVL MEFL+EHKVIPMPLAKC+++T  +P V+ ASIKQPG +RMI A+QLK+GL 
Subjt:  KDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLN

Query:  REEPTFMAIPMVEQPVETRDVPPEIQVVM
        REEPTFMAIP++E+      VP EI+ V+
Subjt:  REEPTFMAIPMVEQPVETRDVPPEIQVVM

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]3.2e-2162.79Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG
        QARWQE LAEFDFKFEHKAGK NQA +ALSRKGEHAALCMLAHIH+SK+DGS+RD+I+E   + P  +      + G+   F ++G
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]5.2e-18153.52Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRP-----------------
        MS TK   K+  DRL  IEE++L+L+EVP +LR LEARV E S+K   IDA+  R+DGLPI ++  RV +LE +     + +P                 
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRP-----------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        ++F+ TID ++  M  + TR+ +TM+AV N    Q N G +KL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A+M+LTDDAKLWWR+KV DI++G CTI+S +DLKKELR QF P+N   +A  KL  L+HTG IRDYV+QFS +MLDIR  SEKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK--------------SGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCK
        KL+E +VQ LA AMA  ERLLDY +E    ++    P  G K +KP + +              SG  D+ PQ    G SR PY Q  +    + C LCK
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK--------------SGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCK

Query:  GPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEAR
        GPH+V+ CPHR +LTALQ S+Q  N+  V T  +K+ED++ PRMGALKFLSA+Q++V  PK    KGLMFVDATIN  P +ST++DSGATHNFI++QEAR
Subjt:  GPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEAR

Query:  RLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFAL
        RL LTI +D GKMKA+NSEALPIVGVSKRV  K+G WTG +D VV RMDDFDVVL MEFL+EHKVIPMPLAKC+++T  +P V+ ASIKQPG +RMI A+
Subjt:  RLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFAL

Query:  QLKKGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREW
        QLK+GL REEPTF+AIP++E       VP EI  V+ ++
Subjt:  QLKKGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREW

TrEMBL top hitse value%identityAlignment
A0A5A7UIP7 Reverse transcriptase6.3e-1551.22Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASF
        QARWQE LAEFDF+FEHK G  NQA +ALSRK EHAA+C+LAH+  S++ GS+RD +RE  ++    +   +  + G+   F
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASF

A0A5D3BYE6 Reverse transcriptase6.2e-18052.31Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++  L K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++ D LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLISCFLCKGPHRVA
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLISCFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V        +GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVL MEFL+EH+VIPMPLAKC+++T  +P VV   ++QP G++MI A+QLK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLK

Query:  KGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
        KGL+R+EPTFMAIP+         VP EI  V+ ++       PD L KS
Subjt:  KGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

A0A5D3BYE6 Reverse transcriptase6.3e-1551.22Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASF
        QARWQE LAEFDF+FEHK G  NQA +ALSRK EHAA+C+LAH+  S++ GS+RD +RE  ++    +   +  + G+   F
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASF

A0A5D3BYE6 Reverse transcriptase1.4e-17952.15Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  DRLVE+EEQ+LYL EVPDS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+                           
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ +++ R++LTMRA+ NQAP    +  +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++ D LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLKPWA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLISCFLCKGPHRVA
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S         + R +SCF+CKGPH   
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPSRGPYPQSQNAQRLISCFLCKGPHRVA

Query:  ECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK
        ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V        +GLM+VD  IN  P KSTMVDSGATHNFI+E EA+RL 
Subjt:  ECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLK

Query:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLK
        L  EKD G+MKAVNS ALPI+G+ KR M++LG W+G VDFVVV+MDDFDVVL MEFL+EH+VIPMPLAKC+++T  +P VV   ++QP G++MI A+QLK
Subjt:  LTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLK

Query:  KGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
        KGL+R+EPTFMAIP+         VP EI  V+ ++       PD L KS
Subjt:  KGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

A0A6J1D906 Reverse transcriptase0.0e+0089.05Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS DDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRL+S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTS KGLMFVDATINCN AKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM
         VN EALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVL MEFLIEHKVIPMPLAKCMIVTSNSP VVT SIKQPGGIRMI ALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM

Query:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
        AIPMVEQPVETRDVPPEIQVVM+E+       PD L K+
Subjt:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

A0A6J1D906 Reverse transcriptase1.0e-2063.95Show/hide
Query:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG
        QARWQELLAEFDFKFEHKAGK NQA +ALSRKGEHA LCMLAHIH SK DGSIRDLI E  +  P         +  +   F ++G
Subjt:  QARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHAALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKG

A0A6J1D906 Reverse transcriptase4.0e-30487.32Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRP                      
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS DDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQNAQR  SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK
        TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTS KGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEARRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM
        AVNSEALPIVGVSKRV LKLGTWTGS DFVVVRMDDFDVVL MEFLIEHKVIPMPLAKCMIVTSNSP VVTASIKQPGGIRMI ALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQLKKGLNREEPTFM

Query:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS
              QPVETRDVPPEIQVVMRE+       PD L K+
Subjt:  AIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKS

A0A6J1IEY4 uncharacterized protein LOC1114757332.5e-18153.52Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRP-----------------
        MS TK   K+  DRL  IEE++L+L+EVP +LR LEARV E S+K   IDA+  R+DGLPI ++  RV +LE +     + +P                 
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESK-----ATRP-----------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        ++F+ TID ++  M  + TR+ +TM+AV N    Q N G +KL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART
        +A+M+LTDDAKLWWR+KV DI++G CTI+S +DLKKELR QF P+N   +A  KL  L+HTG IRDYV+QFS +MLDIR  SEKDKVF FI GL+PWA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK--------------SGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCK
        KL+E +VQ LA AMA  ERLLDY +E    ++    P  G K +KP + +              SG  D+ PQ    G SR PY Q  +    + C LCK
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK--------------SGGADKRPQGPNPGPSRGPYPQSQNAQRLISCFLCK

Query:  GPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEAR
        GPH+V+ CPHR +LTALQ S+Q  N+  V T  +K+ED++ PRMGALKFLSA+Q++V  PK    KGLMFVDATIN  P +ST++DSGATHNFI++QEAR
Subjt:  GPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHNFISEQEAR

Query:  RLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFAL
        RL LTI +D GKMKA+NSEALPIVGVSKRV  K+G WTG +D VV RMDDFDVVL MEFL+EHKVIPMPLAKC+++T  +P V+ ASIKQPG +RMI A+
Subjt:  RLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFAL

Query:  QLKKGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREW
        QLK+GL REEPTF+AIP++E       VP EI  V+ ++
Subjt:  QLKKGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAACTGAGCAAATCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCAATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGATGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTGTGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTC
CGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAA
ACCGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAA
AGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGA
CCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATATCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGC
ATCCGTTCAGTCGTGCAATGAACCTGAAGTTGGAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGA
GGGTCAATGGACCCAAAGGAACGTCCGGAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCTAAAAG
AGTGATGTTAAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGAGGATGGAGTTCCTCATCGAACATAAAGTCA
TACCGATGCCTCTGGCCAAGTGTATGATTGTCACTAGTAATAGTCCCATTGTGGTCACAGCTAGCATCAAACAACCTGGTGGCATAAGGATGATATTCGCGTTACAGTTG
AAGAAAGGCCTCAACCGGGAGGAACCTACTTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGAAATTCAAGTTGTCATGAGAGAGTG
GTGTCCTTCTCCAAGACGGTCACCCGATCGCTTACGAAAGTCGCAAGCTAAATACTGCGGAACGACGTTATACCGTATCAGAGAAAGAAATACTAGCAGTCGTTCACTGC
CTCAGGCCAGGTGGCAGGAATTGTTAGCCGAGTTTGATTTTAAGTTTGAACACAAGGCAGGAAAGTTCAACCAAGCTACCAACGCTCTTAGCCGTAAAGGAGAACATGCG
GCCCTGTGCATGTTAGCCCACATTCACGCCAGCAAAGTGGACGGGTCGATCCGTGACCTCATTAGAGAAATCTTCAAAGAGACCCCTCCGCCCGAACTGTGGTCGAGCTG
GCTAAGACCGGGAAGACCCGCCAGTTTTGGGTTGAAGGGGACCTATTATTTACAAGAGGAAACAGATTGTACGTGCCAAGAATGGGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAACTGAGCAAATCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCAATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGATGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTGTGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTC
CGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCTGCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAA
ACCGTGGGCCAGAACCAAGCTGTATGAACAAAGGGTACAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAA
AGAATGCTACAAACCCCACTGGGGGAAACAAGACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCCGGTCCTTCCCGAGGA
CCCTATCCACAAAGTCAAAACGCTCAAAGGCTGATATCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGC
ATCCGTTCAGTCGTGCAATGAACCTGAAGTTGGAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGA
GGGTCAATGGACCCAAAGGAACGTCCGGAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAAC
TTCATATCAGAACAGGAAGCCCGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCTAAAAG
AGTGATGTTAAAATTGGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGAGGATGGAGTTCCTCATCGAACATAAAGTCA
TACCGATGCCTCTGGCCAAGTGTATGATTGTCACTAGTAATAGTCCCATTGTGGTCACAGCTAGCATCAAACAACCTGGTGGCATAAGGATGATATTCGCGTTACAGTTG
AAGAAAGGCCTCAACCGGGAGGAACCTACTTTTATGGCCATTCCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGAAATTCAAGTTGTCATGAGAGAGTG
GTGTCCTTCTCCAAGACGGTCACCCGATCGCTTACGAAAGTCGCAAGCTAAATACTGCGGAACGACGTTATACCGTATCAGAGAAAGAAATACTAGCAGTCGTTCACTGC
CTCAGGCCAGGTGGCAGGAATTGTTAGCCGAGTTTGATTTTAAGTTTGAACACAAGGCAGGAAAGTTCAACCAAGCTACCAACGCTCTTAGCCGTAAAGGAGAACATGCG
GCCCTGTGCATGTTAGCCCACATTCACGCCAGCAAAGTGGACGGGTCGATCCGTGACCTCATTAGAGAAATCTTCAAAGAGACCCCTCCGCCCGAACTGTGGTCGAGCTG
GCTAAGACCGGGAAGACCCGCCAGTTTTGGGTTGAAGGGGACCTATTATTTACAAGAGGAAACAGATTGTACGTGCCAAGAATGGGAGACCTGA
Protein sequenceShow/hide protein sequence
MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQA
PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINSCDDLKKELRGQFFPDNVEFMARRKLREL
RHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKPWARTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRG
PYPQSQNAQRLISCFLCKGPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSGKGLMFVDATINCNPAKSTMVDSGATHN
FISEQEARRLKLTIEKDTGKMKAVNSEALPIVGVSKRVMLKLGTWTGSVDFVVVRMDDFDVVLRMEFLIEHKVIPMPLAKCMIVTSNSPIVVTASIKQPGGIRMIFALQL
KKGLNREEPTFMAIPMVEQPVETRDVPPEIQVVMREWCPSPRRSPDRLRKSQAKYCGTTLYRIRERNTSSRSLPQARWQELLAEFDFKFEHKAGKFNQATNALSRKGEHA
ALCMLAHIHASKVDGSIRDLIREIFKETPPPELWSSWLRPGRPASFGLKGTYYLQEETDCTCQEWET