; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:16417124..16434275
RNA-Seq ExpressionMoc04g22590
SyntenyMoc04g22590
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7057786.1 unnamed protein product [Microthlaspi erraticum]4.6e-0731.41Show/hide
Query:  WILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTT-----DVF--SSWRSIYPPDKFTSKTIDRGSLKHELFLFDATPCP---TLC
        WILD GAS +IC   ++F  IS      +TL NK+     FS +V L+      +VF   S+      D      ID+G   ++L+L +    P     C
Subjt:  WILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTT-----DVF--SSWRSIYPPDKFTSKTIDRGSLKHELFLFDATPCP---TLC

Query:  AVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHEDGLPYTSCDAVGSPSIVENAT
         V S + WH+R GHPS S + +L N +        + +P +S   + SPS   + T
Subjt:  AVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHEDGLPYTSCDAVGSPSIVENAT

KAG7533590.1 Retrotransposon Copia-like N-terminal [Arabidopsis thaliana x Arabidopsis arenosa]1.6e-0726.18Show/hide
Query:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGTFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFS
        + + +  KS  S  +I  TP V +      N++ + FS   +N            F +WI+D GA+ ++CC+ SLF  I+ I    V L N S  ++  S
Subjt:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGTFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFS

Query:  SSVRLTTDV-------FSSWRSIYPPDKFTSKTIDRGSLKHELFLFD--------ATPCPTLCAVVSDDT---WHRRFGHPSFSHLNALKNALSFSSFSH
         +V L+  +         S+      +      I +G L+++L+  D          P P +C + + D+   WH R GHPSF  L  +   LS S    
Subjt:  SSVRLTTDV-------FSSWRSIYPPDKFTSKTIDRGSLKHELFLFD--------ATPCPTLCAVVSDDT---WHRRFGHPSFSHLNALKNALSFSSFSH

Query:  EDGLPY--------TSCDAVGSPSIVENATVVD
        +D   Y        T+   +G PS  +   V+D
Subjt:  EDGLPY--------TSCDAVGSPSIVENATVVD

KYP60497.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.2e-0728.57Show/hide
Query:  PKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAG---------TFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLV
        P ++ +  +   TPD   QCQQL+N L +Q   Q     +A  ++V G          + +WI+D GA+ +ICC K LF   + I   +V L N +   V
Subjt:  PKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAG---------TFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLV

Query:  EFSSSVRLTTDVF-----------------------SSWRSIYPPDKFT---SKTIDR-GSLKHE--LFLFD------ATPCPTLCAVVSDDTWHRRFGH
        E   S+++  D+F                       +S+R    P+ FT    KT+ + G+ K +  L +F+       + C   C VV+ DTWH+R GH
Subjt:  EFSSSVRLTTDVF-----------------------SSWRSIYPPDKFT---SKTIDR-GSLKHE--LFLFD------ATPCPTLCAVVSDDTWHRRFGH

Query:  PSFSHLNALKNALSFSS
           S    + N    SS
Subjt:  PSFSHLNALKNALSFSS

KYP66809.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.2e-0726.91Show/hide
Query:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGT---------FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSN
        I++ +D  S+ES +T Q++     QCQQLL  + +Q  T    N +A  S+V GT           +WI+D GA+ +ICCSKSL++  + I   +V L N
Subjt:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGT---------FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSN

Query:  KSWFLVEFSSSVRLTTDVFSSWRSIYPPDKF-----------------------------TSKTIDRGSLKHELFLFDAT------PCPTLCAVVSDDTW
         +   VE   S+++  D+F       P  +F                             T +TI     +  L +F+         C + C  V+ +TW
Subjt:  KSWFLVEFSSSVRLTTDVFSSWRSIYPPDKF-----------------------------TSKTIDRGSLKHELFLFDAT------PCPTLCAVVSDDTW

Query:  HRRFGHPSFSHLNALKNALSFSS
        H+R GH        + N +  +S
Subjt:  HRRFGHPSFSHLNALKNALSFSS

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]2.6e-1839.42Show/hide
Query:  QQLLNLLQSQFST-QNVNNSEASLSHVAGT-FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTTDVFSSWRSIYPP-----
        QQL  LLQSQ ST + V +++ + S+   T   S ILD+GAS +IC  + LFD I  I PV+V L NK  F+VE+S  VRL +D  S    +Y P     
Subjt:  QQLLNLLQSQFST-QNVNNSEASLSHVAGT-FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTTDVFSSWRSIYPP-----

Query:  --------------------------DKFTSKTIDRGSLKHELFLFDATP---------CPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHED
                                  DK  SKTID+G L H L+L D T          C +   +VS D WH R GHPSF+ L ALK+ L   + S ED
Subjt:  --------------------------DKFTSKTIDRGSLKHELFLFDATP---------CPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHED

Query:  GLPYTSCD
         L   SCD
Subjt:  GLPYTSCD

TrEMBL top hitse value%identityAlignment
A0A151T0D0 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-0828.57Show/hide
Query:  PKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAG---------TFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLV
        P ++ +  +   TPD   QCQQL+N L +Q   Q     +A  ++V G          + +WI+D GA+ +ICC K LF   + I   +V L N +   V
Subjt:  PKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAG---------TFGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLV

Query:  EFSSSVRLTTDVF-----------------------SSWRSIYPPDKFT---SKTIDR-GSLKHE--LFLFD------ATPCPTLCAVVSDDTWHRRFGH
        E   S+++  D+F                       +S+R    P+ FT    KT+ + G+ K +  L +F+       + C   C VV+ DTWH+R GH
Subjt:  EFSSSVRLTTDVF-----------------------SSWRSIYPPDKFT---SKTIDR-GSLKHE--LFLFD------ATPCPTLCAVVSDDTWHRRFGH

Query:  PSFSHLNALKNALSFSS
           S    + N    SS
Subjt:  PSFSHLNALKNALSFSS

A0A151TIF3 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-0826.91Show/hide
Query:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGT---------FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSN
        I++ +D  S+ES +T Q++     QCQQLL  + +Q  T    N +A  S+V GT           +WI+D GA+ +ICCSKSL++  + I   +V L N
Subjt:  IISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGT---------FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSN

Query:  KSWFLVEFSSSVRLTTDVFSSWRSIYPPDKF-----------------------------TSKTIDRGSLKHELFLFDAT------PCPTLCAVVSDDTW
         +   VE   S+++  D+F       P  +F                             T +TI     +  L +F+         C + C  V+ +TW
Subjt:  KSWFLVEFSSSVRLTTDVFSSWRSIYPPDKF-----------------------------TSKTIDRGSLKHELFLFDAT------PCPTLCAVVSDDTW

Query:  HRRFGHPSFSHLNALKNALSFSS
        H+R GH        + N +  +S
Subjt:  HRRFGHPSFSHLNALKNALSFSS

A0A438EW68 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-0731.47Show/hide
Query:  SPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSE-----ASLSHVAGT-----FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSS
        S ST+ S   +  QCQQL+ LL +Q S+ +  ++E      S+S+ AG         WI+D GA+ ++C   SLFD    +  V VTL       ++   
Subjt:  SPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSE-----ASLSHVAGT-----FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSS

Query:  SVRLTTDV-------FSSWRSIYPPDKFTSKTIDRGSLKHELFLFD------------ATPCPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSS
        SV L+ DV         ++R     +    K I +GS K +L+  D            A+  PT   +     WH R GHPSFS L  L++ L F S
Subjt:  SVRLTTDV-------FSSWRSIYPPDKFTSKTIDRGSLKHELFLFD------------ATPCPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSS

A0A6D2KLF9 Uncharacterized protein2.2e-0731.41Show/hide
Query:  WILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTT-----DVF--SSWRSIYPPDKFTSKTIDRGSLKHELFLFDATPCP---TLC
        WILD GAS +IC   ++F  IS      +TL NK+     FS +V L+      +VF   S+      D      ID+G   ++L+L +    P     C
Subjt:  WILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTT-----DVF--SSWRSIYPPDKFTSKTIDRGSLKHELFLFDATPCP---TLC

Query:  AVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHEDGLPYTSCDAVGSPSIVENAT
         V S + WH+R GHPS S + +L N +        + +P +S   + SPS   + T
Subjt:  AVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHEDGLPYTSCDAVGSPSIVENAT

A0A6J1CR17 uncharacterized protein LOC1110134411.3e-1839.42Show/hide
Query:  QQLLNLLQSQFST-QNVNNSEASLSHVAGT-FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTTDVFSSWRSIYPP-----
        QQL  LLQSQ ST + V +++ + S+   T   S ILD+GAS +IC  + LFD I  I PV+V L NK  F+VE+S  VRL +D  S    +Y P     
Subjt:  QQLLNLLQSQFST-QNVNNSEASLSHVAGT-FGSWILDYGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTTDVFSSWRSIYPP-----

Query:  --------------------------DKFTSKTIDRGSLKHELFLFDATP---------CPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHED
                                  DK  SKTID+G L H L+L D T          C +   +VS D WH R GHPSF+ L ALK+ L   + S ED
Subjt:  --------------------------DKFTSKTIDRGSLKHELFLFDATP---------CPTLCAVVSDDTWHRRFGHPSFSHLNALKNALSFSSFSHED

Query:  GLPYTSCD
         L   SCD
Subjt:  GLPYTSCD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCCGTCGGATTCTCTCGTAGATCCACTGATTCTTTCTCCTGTTGATGCTACTGATGTCTCTTCTCAGTCTATCATCAATGGCGGAAATGTTTCTACT
AATCCGTATTATCTTCACCATATAATTATATCGCCCATTGATCCAAAATCTATGGAGTCTCCCTCAACTATACAGTCTACTCCTGATGTTCTTCATCAGTGTCAG
CAGTTGCTCAATCTCTTGCAGTCTCAGTTTTCGACTCAGAACGTTAATAATAGTGAAGCTTCTCTTTCACATGTGGCAGGTACTTTTGGTTCCTGGATTTTAGAT
TATGGTGCATCTGTGTACATTTGCTGTTCCAAATCCCTTTTTGATGTTATTTCACCCATTTTTCCTGTATATGTTACTTTATCGAATAAGTCTTGGTTTTTGGTG
GAATTTTCTAGTTCAGTTCGGCTTACTACTGATGTTTTTTCTTCATGGCGTTCTATATATCCCCCAGACAAGTTTACTTCGAAGACGATTGACAGGGGTAGTTTA
AAGCATGAATTGTTTCTGTTTGATGCCACACCATGTCCTACCCTTTGTGCTGTTGTCTCTGATGACACTTGGCATCGTCGATTTGGTCATCCCTCCTTTAGCCAT
TTAAATGCTTTAAAGAATGCCTTATCTTTTAGCTCGTTTTCACATGAAGATGGTTTACCTTATACTTCTTGTGATGCTGTTGGTTCTCCTTCTATTGTTGAGAAT
GCTACTGTTGTTGACCATGCTTCTGTTGGTGCTCCTTCAGCTCTTGCTAGTGATGTTTTCAGCCCTCCTAATGTTGTCACAGATGGGTCTCTTCTGCCTCTTACT
AATGCTGGTAATACTTCTGGGGCTTTCTTGAATGAGAATACTTTGATTGATGCTTCTGATACATTTGAGGTCCATTTGAATGAGTCTGCTGTCACAGAGTTCTGC
AAAAACACAGAACACAGAAGAAGAACGCCCAACCCGTTTTCTCTCTATCAAGAAGAACTCTCTCAAGCTCTCCCTCTCGTTCCAAAGGATTGCTCCCACCAGCAC
GATCCCAAGACCCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTTGAGGAAAACTCGTTGAAGAAACTCCGGGTCGCTCGAAAGTCGTTCGTGGCGTCGGAT
TGGGCAAAAGTTGCAGAAAACAGCGAAGAGACAGAGCCATGGCGCTATGCAGCAGCGCCATGGAGCTGCGGGACAGCACATAGCGCCACGGCGCTGCCCTTAGGC
GCCGAGGCGCTGTCCCGGTTCTGCAAAAACACAGAACACAGAAGAAGAACGCCCAACCCGTTTTCTCTCTATCAAGAAGAACTCTCTCAAGCTCTCCCTCTCGTT
CCAAAGGATTGCTCCCACCAGCACGATCCCAAGACTCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTCGAGGGAAACTCGTTGAAGAAACGTTCTTCAAAG
AGCTTGAAAAGACGAATCCAACGCAATTTGAATCGTTCAAATCAGAGTTCAAAAGAAGAAGTTATGACCAAAATAAAATTGGGAAGGAAAATCCTACGTGGATAC
GCGGACCAACCAGAAGATGACACGTGGCAAGATGCTGACGTGGCAGCTGACTTGGATTCTCAATGGTCAATTTCTGATGACGTGGTAGATGACGTGGCACTCCAA
CAGTCAGATTCGATGATGTGGCATATGACGTGGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCCGTCGGATTCTCTCGTAGATCCACTGATTCTTTCTCCTGTTGATGCTACTGATGTCTCTTCTCAGTCTATCATCAATGGCGGAAATGTTTCTACT
AATCCGTATTATCTTCACCATATAATTATATCGCCCATTGATCCAAAATCTATGGAGTCTCCCTCAACTATACAGTCTACTCCTGATGTTCTTCATCAGTGTCAG
CAGTTGCTCAATCTCTTGCAGTCTCAGTTTTCGACTCAGAACGTTAATAATAGTGAAGCTTCTCTTTCACATGTGGCAGGTACTTTTGGTTCCTGGATTTTAGAT
TATGGTGCATCTGTGTACATTTGCTGTTCCAAATCCCTTTTTGATGTTATTTCACCCATTTTTCCTGTATATGTTACTTTATCGAATAAGTCTTGGTTTTTGGTG
GAATTTTCTAGTTCAGTTCGGCTTACTACTGATGTTTTTTCTTCATGGCGTTCTATATATCCCCCAGACAAGTTTACTTCGAAGACGATTGACAGGGGTAGTTTA
AAGCATGAATTGTTTCTGTTTGATGCCACACCATGTCCTACCCTTTGTGCTGTTGTCTCTGATGACACTTGGCATCGTCGATTTGGTCATCCCTCCTTTAGCCAT
TTAAATGCTTTAAAGAATGCCTTATCTTTTAGCTCGTTTTCACATGAAGATGGTTTACCTTATACTTCTTGTGATGCTGTTGGTTCTCCTTCTATTGTTGAGAAT
GCTACTGTTGTTGACCATGCTTCTGTTGGTGCTCCTTCAGCTCTTGCTAGTGATGTTTTCAGCCCTCCTAATGTTGTCACAGATGGGTCTCTTCTGCCTCTTACT
AATGCTGGTAATACTTCTGGGGCTTTCTTGAATGAGAATACTTTGATTGATGCTTCTGATACATTTGAGGTCCATTTGAATGAGTCTGCTGTCACAGAGTTCTGC
AAAAACACAGAACACAGAAGAAGAACGCCCAACCCGTTTTCTCTCTATCAAGAAGAACTCTCTCAAGCTCTCCCTCTCGTTCCAAAGGATTGCTCCCACCAGCAC
GATCCCAAGACCCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTTGAGGAAAACTCGTTGAAGAAACTCCGGGTCGCTCGAAAGTCGTTCGTGGCGTCGGAT
TGGGCAAAAGTTGCAGAAAACAGCGAAGAGACAGAGCCATGGCGCTATGCAGCAGCGCCATGGAGCTGCGGGACAGCACATAGCGCCACGGCGCTGCCCTTAGGC
GCCGAGGCGCTGTCCCGGTTCTGCAAAAACACAGAACACAGAAGAAGAACGCCCAACCCGTTTTCTCTCTATCAAGAAGAACTCTCTCAAGCTCTCCCTCTCGTT
CCAAAGGATTGCTCCCACCAGCACGATCCCAAGACTCAAGAGGATAGCGAGGAAGACACAGTGGTGGTGTTCGAGGGAAACTCGTTGAAGAAACGTTCTTCAAAG
AGCTTGAAAAGACGAATCCAACGCAATTTGAATCGTTCAAATCAGAGTTCAAAAGAAGAAGTTATGACCAAAATAAAATTGGGAAGGAAAATCCTACGTGGATAC
GCGGACCAACCAGAAGATGACACGTGGCAAGATGCTGACGTGGCAGCTGACTTGGATTCTCAATGGTCAATTTCTGATGACGTGGTAGATGACGTGGCACTCCAA
CAGTCAGATTCGATGATGTGGCATATGACGTGGCAATGA
Protein sequenceShow/hide protein sequence
MATPSDSLVDPLILSPVDATDVSSQSIINGGNVSTNPYYLHHIIISPIDPKSMESPSTIQSTPDVLHQCQQLLNLLQSQFSTQNVNNSEASLSHVAGTFGSWILD
YGASVYICCSKSLFDVISPIFPVYVTLSNKSWFLVEFSSSVRLTTDVFSSWRSIYPPDKFTSKTIDRGSLKHELFLFDATPCPTLCAVVSDDTWHRRFGHPSFSH
LNALKNALSFSSFSHEDGLPYTSCDAVGSPSIVENATVVDHASVGAPSALASDVFSPPNVVTDGSLLPLTNAGNTSGAFLNENTLIDASDTFEVHLNESAVTEFC
KNTEHRRRTPNPFSLYQEELSQALPLVPKDCSHQHDPKTQEDSEEDTVVVFEENSLKKLRVARKSFVASDWAKVAENSEETEPWRYAAAPWSCGTAHSATALPLG
AEALSRFCKNTEHRRRTPNPFSLYQEELSQALPLVPKDCSHQHDPKTQEDSEEDTVVVFEGNSLKKRSSKSLKRRIQRNLNRSNQSSKEEVMTKIKLGRKILRGY
ADQPEDDTWQDADVAADLDSQWSISDDVVDDVALQQSDSMMWHMTWQ