Protein

MCA_00151_1

Length
1,662 amino acids


Description: Retrotransposon Ty3/Gypsy Gag-Pol polyprotein

Browser: contigA:402223-407287-

RNA-seq: read pairs 917, FPKM 6.8, percentile rank 20.3% (100% = highest expression)

Protein function

Annotation:Retrotransposon Ty3/Gypsy Gag-Pol polyprotein
EGGNOG:0PGPZto reverse transcriptase
SGD closest match:S000007347TY3B-GTransposon Ty3-G Gag-Pol polyprotein
CGD closest match:CAL0000191508ORF298Uncharacterized protein

Protein alignments

%idAln lengthE-value
A0A060T683_BLAAD42.14%11390.0ARAD1B13926p OS=Blastobotrys adeninivorans GN=GNLVRS02_ARAD1B13926g PE=4 SV=1
UniRef50_A0A060T68342.14%11390.0ARAD1B13926p n=1 Tax=Blastobotrys adeninivorans TaxID=409370 RepID=A0A060T683_BLAAD
B5FVH8_YARLI34.94%11190.0YALI0E14388p2 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_E14388g2 PE=4 SV=1
YG31B_YEAST39.07%9420.0Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3
A0A167D9C8_9ASCO33.04%9052e-134Gag-pol fusion protein OS=Sugiyamaella lignohabitans GN=AWJ20_901 PE=4 SV=1
A0A1D8PI28_CANAL24.97%7811e-57Uncharacterized protein OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=ORF298 PE=4 SV=1
A0A0J9XCS2_GEOCN33.85%1304e-09Uncharacterized protein OS=Geotrichum candidum GN=BN980_GECA10s00604g PE=4 SV=1

Mitochondrial localization by mitoprotII

Probability of mitochondrial location: 0.0539

Protein family membership

None predicted.

Domains and repeats

  1. Domain
1 200 400 600 800 1000 1200 1400 1662

Detailed signature matches

    1. PF00078 (RVT_1)
    2. PS50878 (RT_POL)
    1. SSF53098 (Ribonucle...)
    1. PS50994 (INTEGRASE)
Unintegrated signatures no IPR
Unintegrated signatures
  1. SSF56672 (DNA/RNA p...)
  2. cd00303 (retropepsi...)
  3. cd01647 (RT_LTR)
  4. cd09274 (RNase_HI_R...)
  5. mobidb-lite (disord...)

Residue annotation

  1. Catalytic residue ...
  2. catalytic motif cd...
  3. inhibitor binding ...
  4. Active site flap c...
  5. putative NTP bindi...
  6. putative active si...
  7. putative nucleic a...
  8. active site cd09274
  9. RNA/DNA hybrid bin...

Protein sequence

>MCA_00151_1
MSSSIKNENPTTHTTEQYNSIIIECEKVPRIAKITAYKGQSDPSIINEWISKIESAMRTRFVDSRYWASAAFDLALFDGP
AEKWAKDGLNVVRANNGPIAPLIEWKTFCDAMRVFFVPEAVERALEAKVYRMQKSDTGSVIQYVADLRNCLYLLKPEVRR
KINVKQIFFHGLDSEMRDKLEVFYDNLSSDDLMQRALRMCHEEGRWKFRHFFDPNQTEMTLVDPLAIKAVQASPEPTVGV
DNINYNRNNSKNRNNKGSDNGKKNNNQRNAAGFPANISLPKQEEECFFELQFQSKKLRKPTLKADVGSINKALSVHPPEL
VPTPVPDPDPDPDPEGFIDPCNRFSCLPLDEAIDDDTYEGHNMCCNPIMMDVPRPGEEAPLSLDAGEGDHTRQTLNNNSV
QIARSSDVDAVWRRVANTGGECTKGPNSVPPMKNCSLNFLVDVEEVTDKNLNTKPQLVGKGPPTPKFSVKVSARPIVPPV
TPPISEMEISKIDINAVTPDDGPEIGKKIVVPTCIKLQSRDWKPRFNALIDSGAASYFVSHEAIQRNHLEHLTVPCPPIK
LSSAFSNSQTCKRKVQLTFHIGPIEATKSFFVVHGLSHDMILGTPFTAEFADFVDLKSFSVAGVGSVADDPLFVSAIEFE
RLAKSKENAIGICVLKFDEPVKEPASLEVLPFDASEYKDVLTNDPPTGLPPERDVTHPIDVVPGSTPPYRSYYRLSKAES
DYLTEELSKMANTGMIRPSSSPYSAPVLFVKKKDGSLRLCVDYRMLNNITIKNRYPLPFIEEMLNKVEDARVFSKLDLRS
GYHQIRIRPQDIEKTAFSTSTGHWEFLVMPFGLTNAPATFQNFMNDIFRPYIDKFVNVYLDDILVFSKTKEEHVQHLKKV
LDVLKENKLVANLKKCEFFKTELDFLGYKLDAKGIHITDEKVRAVKEFPIPKTVKECQRFLGMTNYYRKFVPNFSGIAAP
LHDFTSEKTKWTETQTQAFETLKEKLIDAPILISPNSKHTYVLTTDASNKNIGATLEQYEGSRLLGVIAYFSRKLKPTEV
NYAVRDLEFLAVVDALQHFRPLLLGRHFILRTDHFSLTQLQSQTKPPHGRIARWFDALADYDFTTQYIKGKQNVVADALS
RQPSVNAIDTNDATNEEEELMSTPSDDIIRRVRDKLDSDDYFANVIRILNDENDPAHRTISSKYHLDDGLLYFKITPTVP
YEPTLLLWLTLMVIMDHTAPFLRLAHFYYWPVMFPEVKRIVDHCPDCAYTQPNKKTHGYLQPLQIPQQRWDSISLDFVSG
LPEVDGYDMILVVVDRLSKRARFLPTTKTLTGKDCAYLLHREIFKIHGFPLDIVSDRDVRFQELFWKTMHLINGTKNLQS
TSFHPETDGQTERVNRVLRALMSSHAVQYGFNWPFHLATLEFSYNSTFQATIRASPFMADLGRVPRNPSFITPPMSFTGK
IEAEDFAIITQAILARTRDFIAEAQDKQSLYANKTRTALVLKENDEVLVRYAYSLAPNSNKYIDSVNLFTGPYRVVKKID
DNVYEIDLPEYDRSRRNINVSRLKKFEPSEAFPRSPPLNLEAVKARITEISGIVGETDDAYLLQFSNCRPGHIVAVNKQV
LNLLSPAEQERIKRNANAMTRNLMTHPDLAKEQFTHNQHLWRNQALSFFNQEQINHMNRDDS

GO term prediction

Biological Process

GO:0015074 DNA integration

Molecular Function

GO:0003676 nucleic acid binding

Cellular Component

None predicted.